[ML] First pass implementation of support functionality for change detection and modelling #9

tveasey · 2018-03-08T14:47:35Z

Description
This implements 1) a naive Bayes classifier, using our distribution models, which will be used for modelling the probability of a change, and 2) a change detector framework, currently supporting detecting level shifts and time shifts, which works by comparing BIC of the various possible hypotheses against one another and a null hypothesis that there is no change. (Note that this work is going to initially be implemented on a feature branch to enable incremental review of the changes and so we can evaluate on all our QA data sets before merging to master.)

Effects
The functionality are not used in this change, so it has no effect on our results.

droberts195 · 2018-03-09T17:32:56Z

lib/maths/unittest/CTimeSeriesChangeDetectorTest.h

+        static CppUnit::Test *suite();
+
+    private:
+        using TGenerator = double (*)(ml::core_t::TTime);


Is the (*) necessary here with a using? You didn't need it two lines below.

No, you're completely right. I think I may just be a conversion from typedef error.

Actually, this has to be a function pointer type (otherwise I can't create a container of them). The other is just a function type. I'll switch all this code to use std::function since I think the intention is then clearer.

…ge-modelling-part-1

droberts195

LGTM

hendrikmuhs · 2018-03-09T20:21:49Z

lib/maths/CTimeSeriesChangeDetector.cc

+    }
+
+    double logLikelihood;
+    if (count >= 5 && m_ResidualModel->jointLogMarginalLikelihood(


deserves an explaination, why 5? If count < 5 do we need samples? Seems like not.

The mean can vary early on, so the effective number of parameters for small n is greater than 1 (the mean) for the case one assumes that there might be a level shift. I started off trying to correct for the bias, but decided in the end it was easier just to allow some updates for the mean to stabilise before first updating the log-likelihood. The exact value of this constant is not really important, but ideally one wants to choose it as small as possible without incurring a significant boost to the sum log-likelihood for this hypothesis. This was empirically a good choice, i.e. for the case that there is no level shift I found that the sum log-likelihood was very similar with and without an assumed mean shift for many independent runs. I commented on this in CUnivariateTimeSeriesLevelShiftModel but will cross reference that comment here.

…tection and modelling (#9) This implements 1) a naive Bayes classifier, using our distribution models, which will be used for modelling the probability of a change, and 2) a change detector framework, currently supporting detecting level shifts and time shifts, which works by comparing BIC of the various possible hypotheses against one another and a null hypothesis that there is no change.

droberts195 · 2018-12-18T11:53:16Z

Removing version label as this is a feature branch PR and it causes confusion when generating release notes. (For interest this was eventually merged to 6.4 and above in #92.)

tveasey added 5 commits February 21, 2018 18:40

Improvements to trend modelling and periodicity testing for forecasting.

32633b6

Ground work utilities for change detection and modelling

62b4316

Merge branch 'master' into feature/change-modelling-part-1

0df49e3

Don't bump state

e66a1b1

Sanitise argument so we don't get out-of-bounds access

5265ee1

tveasey added >enhancement v7.0.0 labels Mar 8, 2018

tveasey requested a review from droberts195 March 8, 2018 14:47

droberts195 reviewed Mar 9, 2018

View reviewed changes

tveasey added 2 commits March 9, 2018 17:40

Merge branch 'feature/forecast-enhancements-part-2' into feature/chan…

82961d4

…ge-modelling-part-1

Review comments

74969b2

droberts195 approved these changes Mar 9, 2018

View reviewed changes

hendrikmuhs reviewed Mar 9, 2018

View reviewed changes

Fuller explanation of the delay to update log-likelihoods

6a83e02

tveasey merged commit 5587587 into elastic:feature/forecast-enhancements-part-2 Mar 12, 2018

sophiec20 added the :ml label Apr 4, 2018

droberts195 removed the v7.0.0 label Dec 18, 2018

droberts195 mentioned this pull request Dec 18, 2018

[DOCS] Updates changelog for 7.0.0-alpha2 #347

Merged

davidkyle mentioned this pull request Jun 20, 2023

[NLP] Catch exceptions thrown during inference and report as errors #2542

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] First pass implementation of support functionality for change detection and modelling #9

[ML] First pass implementation of support functionality for change detection and modelling #9

tveasey commented Mar 8, 2018

droberts195 Mar 9, 2018

tveasey Mar 9, 2018

tveasey Mar 9, 2018

droberts195 left a comment

hendrikmuhs Mar 9, 2018 •

edited

Loading

tveasey Mar 12, 2018 •

edited

Loading

droberts195 commented Dec 18, 2018

[ML] First pass implementation of support functionality for change detection and modelling #9

[ML] First pass implementation of support functionality for change detection and modelling #9

Conversation

tveasey commented Mar 8, 2018

droberts195 Mar 9, 2018

Choose a reason for hiding this comment

tveasey Mar 9, 2018

Choose a reason for hiding this comment

tveasey Mar 9, 2018

Choose a reason for hiding this comment

droberts195 left a comment

Choose a reason for hiding this comment

hendrikmuhs Mar 9, 2018 • edited Loading

Choose a reason for hiding this comment

tveasey Mar 12, 2018 • edited Loading

Choose a reason for hiding this comment

droberts195 commented Dec 18, 2018

hendrikmuhs Mar 9, 2018 •

edited

Loading

tveasey Mar 12, 2018 •

edited

Loading