-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Would making diff_
thresholds a percentage instead of absolute values make more sense?
#206
Comments
Hi @zakkak just chiming in here - The Jfr perf test thresholds are specified as a relative change ( |new - old| / old ). Maybe something similar could make sense elsewhere too. |
Definitely makes sense, requires recording JVM run as a baseline, but that already happens, see the notion of |
@Karm what percentage would you consider acceptable? |
There are 2 things:
I'd focus on 2) and I'd hardcode values from Q 2.13.8.Final, M 22.3.3.1-Final run on a reference system. |
Right now,
diff_*
thresholds for performance regression testing are defined as absolute numbers, e.g.:mandrel-integration-tests/apps/jfr-native-image-performance/threshold.properties
Line 1 in e67f2fd
This sometimes results in test failures when running on different machines than the one used to tune the thresholds.
However, I am thinking that checking if the increase is within an acceptable range, e.g. 5%, would probably make more sense. After all a 50ms increase on a 10ms run is huge, while on a 5s run it's negligible.
I wonder if switching to percentages instead would also allow us to perform the regression testing (only for diffs between runs) on various machines (including github runners) while not losing accuracy.
cc @Karm @jerboaa
The text was updated successfully, but these errors were encountered: