-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ML] Bad values for the variance scale #24
Comments
The exceptions do not happen if influencer z is removed. |
Update: The root cause has been identified, counting influencer occurrences per bucket has a bug. The fix is simple (1LOC). The change affects results: Hopefully to the better, I will analyze the diff to be sure. I followed the code history back to version 5.5, so it's not a recent regression but likely has been introduced by PR 144 in the old repo. As said, the fix is simple, but I plan to take some more time for related code improvements and test cases. |
Fix counting of influencer per bucket for metric population analyses, prior this fix the count has always been set to 1. Fixes #24
Fix counting of influencer per bucket for metric population analyses, prior this fix the count has always been set to 1. Fixes elastic#24
Fix counting of influencer per bucket for metric population analyses, prior this fix the count has always been set to 1. Fixes #24
A user data set has shown up two issues with the variance scale calculation in version 6.2.2 of the analytics:
In particular, we are seeing the following error messages logged:
Error calculating joint distribution: Bad variance scale -5.75
Error calculating joint distribution: Bad variance scale inf
There is no prospect of getting hold of the data set; however the data characteristics sound benign. There were two detectors:
For x we have min: 0, max: 4.34571, avg: 2.0736 and cardinality of y is 430.
This issue is to investigate routes by which this problem could occur. The initial areas to investigate are
CTimeSeriesDecomposition::scale
and the calculation of the count variance scale, particularly for influencers.cc @LucaWintergerst.
The text was updated successfully, but these errors were encountered: