8373722: [TESTBUG] compiler/vectorapi/TestVectorOperationsWithPartialSize.java fails intermittently #28960

XiaohongGong · 2025-12-23T06:45:46Z

The test fails intermittently with the following error:

Caused by: java.lang.RuntimeException: assertEqualsWithTolerance: expected 0.0 but was 1.1754945E-38 (tolerance: 1.4E-44, diff: 1.1754945E-38)
at compiler.vectorapi.TestVectorOperationsWithPartialSize.verifyAddReductionFloat(TestVectorOperationsWithPartialSize.java:231)
at compiler.vectorapi.TestVectorOperationsWithPartialSize.testAddReductionFloat(TestVectorOperationsWithPartialSize.java:260)

The root cause is that the Vector API reduceLanes() does not guarantee a specific calculation order for floating-point reduction operations [1]. When the array contains extreme values, this can produce results outside the tolerance range compared to sequential scalar addition.

For example, given array elements:

[0.0f, Float.MIN_NORMAL, Float.MAX_VALUE, -Float.MAX_VALUE]

Sequential scalar addition produces:

0.0f + Float.MIN_NORMAL + Float.MAX_VALUE - Float.MAX_VALUE = 0.0f

However, reduceLanes() might compute:

(0.0f + Float.MIN_NORMAL) + (Float.MAX_VALUE - Float.MAX_VALUE) = Float.MIN_NORMAL

The difference of the two times of calculation is Float.MIN_NORMAL (1.1754945E-38), which exceeds the tolerance of Math.ulp(0.0f) * 10.0f = 1.4E-44. Even with a 10x rounding error factor, the tolerance is insufficient for such edge cases.

Since reduceLanes() does not require a specific calculation order, differences from scalar results can be significantly larger when special or extreme maximum/minimum values are present. Using a fixed tolerance is inappropriate for such corner cases.

This patch fixes the issue by initializing the float array in test with random normal values within a specified range, ensuring the result gap stays within the defined tolerance.

Tested locally on my AArch64 and X86_64 machines 500 times, and I didn't observe the failure again.

[1] https://docs.oracle.com/en/java/javase/25/docs/api/jdk.incubator.vector/jdk/incubator/vector/FloatVector.html#reduceLanes(jdk.incubator.vector.VectorOperators.Associative)

Progress

Change must be properly reviewed (1 review required, with at least 1 Reviewer)
Change must not contain extraneous whitespace
Commit message must refer to an issue

Issue

JDK-8373722: [TESTBUG] compiler/vectorapi/TestVectorOperationsWithPartialSize.java fails intermittently (Bug - P4)

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/28960/head:pull/28960
$ git checkout pull/28960

Update a local copy of the PR:
$ git checkout pull/28960
$ git pull https://git.openjdk.org/jdk.git pull/28960/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 28960

View PR using the GUI difftool:
$ git pr show -t 28960

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/28960.diff

Using Webrev

Link to Webrev Comment

…Size.java fails intermittently

bridgekeeper · 2025-12-23T06:47:29Z

👋 Welcome back xgong! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2025-12-23T06:47:37Z

❗ This change is not yet ready to be integrated.
See the Progress checklist in the description for automated requirements.

openjdk · 2025-12-23T06:48:17Z

@XiaohongGong The following label will be automatically applied to this pull request:

hotspot-compiler

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command.

mlbridge · 2025-12-23T06:52:36Z

Webrevs

00: Full (a44f551c)

DamonFool · 2025-12-23T08:37:48Z

Shall we change the calculation of tolerance?

e.g.

max_abs = max(abs(arr[0]), abs(arr[1]), ...)
tolerance = Math.ulp(max_abs) * vlen

XiaohongGong · 2025-12-23T09:28:00Z

Shall we change the calculation of tolerance?

e.g.
max_abs = max(abs(arr[0]), abs(arr[1]), ...)
tolerance = Math.ulp(max_abs) * vlen

Thanks for looking at this issue. It's really a good question about the definition of a more reasonable tolerance, which is fundamentally a numerical analysis problem per my understanding.

For a floating‑point reduction test, the goal is to check that the API is implemented correctly. And what we really care about is how close the reduction result is to the mathematically expected value. In other words, the tolerance should be derived from the expected sum itself. For example, if the float array is [1.0f, -1.0f, 2.0f, -2.0f], we genuinely expect a result very close to 0.0f, not something near 2.0f.

Using max_abs to set the tolerance risks inflating the admissible error range (since max_abs here would be 2.0f), which I'm afraid might make the test much less effective. WDYT?

DamonFool · 2025-12-23T09:49:26Z

And what we really care about is how close the reduction result is to the mathematically expected value.

Understood.

However, (1.0, 5.0) the test range is really too small.
If we expand the range, the current tolerance may still become not big enough.

DamonFool · 2025-12-23T12:16:01Z

For example, if the float array is [1.0f, -1.0f, 2.0f, -2.0f], we genuinely expect a result very close to 0.0f, not something near 2.0f.

Using max_abs to set the tolerance risks inflating the admissible error range (since max_abs here would be 2.0f), which I'm afraid might make the test much less effective.

FYI: the current sum based tolerance may be also bigger than max_abs based.
For example, if the float array is [1.0f, 1.0f, 2.0f, 2.0f]. The sum would be 6.0f, the max_abs would be 2.0.
What do you think?

XiaohongGong · 2025-12-24T01:54:02Z

For example, if the float array is [1.0f, -1.0f, 2.0f, -2.0f], we genuinely expect a result very close to 0.0f, not something near 2.0f.
Using max_abs to set the tolerance risks inflating the admissible error range (since max_abs here would be 2.0f), which I'm afraid might make the test much less effective.

FYI: the current sum based tolerance may be also bigger than max_abs based. For example, if the float array is [1.0f, 1.0f, 2.0f, 2.0f]. The sum would be 6.0f, the max_abs would be 2.0. What do you think?

We used the Math.ulp(sum) * 10 to calculate the tolerance, which means the difference of expected and actual value is inside of 10 ULP around the value of sum. Note that Math.ulp(f) is the positive distance between f and the next representable float value larger in magnitude (1 ulp around value f). Hence, it is more reasonable based on the expected value?

XiaohongGong · 2025-12-24T02:05:54Z

And what we really care about is how close the reduction result is to the mathematically expected value.

Understood.

However, (1.0, 5.0) the test range is really too small. If we expand the range, the current tolerance may still become not big enough.

What I really want to avoid in this case is generating values that are near the extreme maximum or minimum of float. Expanding the value range would probably still be acceptable with the current tolerance, which is derived not only from the cross‑lane sum but also includes a rounding‑error factor of 10.

BTW, consider that there are already enough API tests under test/jdk/jdk/incubator/vector, this test deliberately uses a narrower range, because its primary goal is to verify the expected IR generated on SVE, rather than to stress all numerical edge cases.

DamonFool · 2025-12-24T03:06:35Z

We used the Math.ulp(sum) * 10 to calculate the tolerance, which means the difference of expected and actual value is inside of 10 ULP around the value of sum.

The question here is what do you mean by expected value?
Do you mean the sequential scalar floating point add with rounding errors will produce the expected value?
Or do you mean the golden value in math?

Just consider the following case

1000.0f + Float.MAX_VALUE - Float.MAX_VALUE

The sequential scalar floating add will result 0.0f.
However, the golden value in math should be 1000.0f.

XiaohongGong · 2025-12-24T05:01:11Z

We used the Math.ulp(sum) * 10 to calculate the tolerance, which means the difference of expected and actual value is inside of 10 ULP around the value of sum.

The question here is what do you mean by expected value? Do you mean the sequential scalar floating point add with rounding errors will produce the expected value? Or do you mean the golden value in math?

Just consider the following case
1000.0f + Float.MAX_VALUE - Float.MAX_VALUE
The sequential scalar floating add will result 0.0f. However, the golden value in math should be 1000.0f.

I think we should refer to the java ref spec, that calculates the values in sequential order, right?

DamonFool · 2025-12-24T06:13:53Z

I think we should refer to the java ref spec, that calculates the values in sequential order, right?

I'm not sure. Did you find related description in the spec?

If that is true, the current sum-based Math.ulp(0.0f) * 10 is far from 1000.0f, which seems unreasonable.

jatin-bhateja · 2025-12-24T07:22:50Z

test/hotspot/jtreg/compiler/vectorapi/TestVectorOperationsWithPartialSize.java

+        random.fill(random.uniformFloats(1.0f, 5.0f), fa);
+        random.fill(random.uniformDoubles(1.0, 5.0), da);


Ideally our tolerance window should be narrow, and increasing the tolerance range to accomodate outliers as you mentioned in your issue description may defeat the purpose.

Unlike auto-vectorization which adhears strict ordering JLS semantics, vectorAPI relaxes the reduction order to give backends leeway to use parallel reduction not strictly following the sequential order.

There are multiple considerations involed, fallback implimentation performs reduction sequentially, inline expander always relaxes the strict ordering, intrinsification of Add/Mul reductions are only supported by Aarch64, X86 and riscv.

Computing expected value using parallel reduction can be other alternative but then we may face similar problems on targets which does not intrinsify unordered reductions.

Tolerance modeling is a complex topic and involves relative and absolute error, current 10ULP absolute limit is not generic enough to handle entier spectrum of values, what you have enforced now is a range based tolerance did you try widening the input value range and confirm if 10ULP tolerance limit is sufficient ?

Yeah, I'm trying to extend the value range to 1~3000. The tests are still running... Since the result largely depends on the random values, I run this test 500 times on SVE/NEON/X86 machines respectively (1500 times totally), and have not observed failure now. Is that fine to you? I will update the test once all tests pass. Thanks for looking at this change!

Yeah, I'm trying to extend the value range to 1~3000. The tests are still running... Since the result largely depends on the random values, I run this test 500 times on SVE/NEON/X86 machines respectively (1500 times totally), and have not observed failure now. Is that fine to you? I will update the test once all tests pass. Thanks for looking at this change!

As with range 1~3000, we may still see failures even with 1000ULP according to the following program, right?

class T { public static void main(String[] args) { float ROUNDING_ERROR_FACTOR_ADD = 1000.0f; Float a = 1.0f + (ROUNDING_ERROR_FACTOR_ADD + 1) * Math.ulp(1.0f); Float b = 3000.0f; Float expected = a + b - b; Float actual = a + (b - b); float tolerance = Math.ulp(expected) * ROUNDING_ERROR_FACTOR_ADD; if (Math.abs(expected - actual) > tolerance) { System.out.println("Error: Out of tolerance!"); } } }

Yeah, I'm trying to extend the value range to 1~3000. The tests are still running... Since the result largely depends on the random values, I run this test 500 times on SVE/NEON/X86 machines respectively (1500 times totally), and have not observed failure now. Is that fine to you? I will update the test once all tests pass. Thanks for looking at this change!

As with range 1~3000, we may still see failures even with 1000ULP according to the following program, right?

class T { public static void main(String[] args) { float ROUNDING_ERROR_FACTOR_ADD = 1000.0f; Float a = 1.0f + (ROUNDING_ERROR_FACTOR_ADD + 1) * Math.ulp(1.0f); Float b = 3000.0f; Float expected = a + b - b; Float actual = a + (b - b); float tolerance = Math.ulp(expected) * ROUNDING_ERROR_FACTOR_ADD; if (Math.abs(expected - actual) > tolerance) { System.out.println("Error: Out of tolerance!"); } } }

Oops, if the range is 1~3000, there is no negative float, so the above program should not happen.
Just ignore it.

XiaohongGong · 2025-12-24T09:29:17Z

I think we should refer to the java ref spec, that calculates the values in sequential order, right?

I'm not sure. Did you find related description in the spec?

If that is true, the current sum-based Math.ulp(0.0f) * 10 is far from 1000.0f, which seems unreasonable.

I found the reference here: https://docs.oracle.com/javase/specs/jls/se7/html/jls-15.html#jls-15.7.1 . I used the Java Playground, and see the result:

So that's just the issue that I want to avoid. We didn't have a more reasonable golden value as we do not know the calculation order in Vector API, right?

DamonFool · 2025-12-24T09:46:29Z

So that's just the issue that I want to avoid. We didn't have a more reasonable golden value as we do not know the calculation order in Vector API, right?

Agreed.

I would suggest the testing range also covers negative floats, not only positives.

DamonFool · 2025-12-24T14:05:52Z

Here is an example which shows that the tolerance may be still not big enough even with ROUNDING_ERROR_FACTOR_ADD = 10000000.0f.

Note: the test range of a and b is only [0.0f, 1.0f].

class T {
  public static void main(String[] args) {

    float ROUNDING_ERROR_FACTOR_ADD = 10000000.0f;

    Float a = 0.0f + (ROUNDING_ERROR_FACTOR_ADD + 1) * Math.ulp(0.0f);
    Float b = 1.0f;

    Float expected = a + b - b;
    Float actual   = a + (b - b);

    float tolerance = Math.ulp(expected) * ROUNDING_ERROR_FACTOR_ADD;
    if (Math.abs(expected - actual) > tolerance) {
      System.out.println("Error: Out of tolerance!");
    }
  }
}

So I'm afraid the sum-based tolerance should be improved.

DamonFool · 2025-12-24T15:01:45Z

Note: the test range of a and b is only [0.0f, 1.0f].

The range is [-1.0f, 1.0f] actually.

8373722: [TESTBUG] compiler/vectorapi/TestVectorOperationsWithPartial…

a44f551

…Size.java fails intermittently

openjdk bot added the hotspot-compiler [email protected] label Dec 23, 2025

openjdk bot added the rfr Pull request is ready for review label Dec 23, 2025

jatin-bhateja reviewed Dec 24, 2025

View reviewed changes

		random.fill(random.uniformFloats(1.0f, 5.0f), fa);
		random.fill(random.uniformDoubles(1.0, 5.0), da);

8373722: [TESTBUG] compiler/vectorapi/TestVectorOperationsWithPartialSize.java fails intermittently #28960

Are you sure you want to change the base?

8373722: [TESTBUG] compiler/vectorapi/TestVectorOperationsWithPartialSize.java fails intermittently #28960

Conversation

XiaohongGong commented Dec 23, 2025 • edited by openjdk bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Progress

Issue

Reviewing

Uh oh!

bridgekeeper bot commented Dec 23, 2025

Uh oh!

openjdk bot commented Dec 23, 2025

Uh oh!

openjdk bot commented Dec 23, 2025

Uh oh!

mlbridge bot commented Dec 23, 2025

Webrevs

Uh oh!

DamonFool commented Dec 23, 2025

Uh oh!

XiaohongGong commented Dec 23, 2025

Uh oh!

DamonFool commented Dec 23, 2025

Uh oh!

DamonFool commented Dec 23, 2025

Uh oh!

XiaohongGong commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

XiaohongGong commented Dec 24, 2025

Uh oh!

DamonFool commented Dec 24, 2025

Uh oh!

XiaohongGong commented Dec 24, 2025

Uh oh!

DamonFool commented Dec 24, 2025

Uh oh!

jatin-bhateja Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

XiaohongGong Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DamonFool Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

DamonFool Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

XiaohongGong commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DamonFool commented Dec 24, 2025

Uh oh!

DamonFool commented Dec 24, 2025

Uh oh!

DamonFool commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

XiaohongGong commented Dec 23, 2025 •

edited by openjdk bot

Loading

XiaohongGong commented Dec 24, 2025 •

edited

Loading

XiaohongGong Dec 24, 2025 •

edited

Loading

XiaohongGong commented Dec 24, 2025 •

edited

Loading