[ML] Implements an absolute goodness-of-fit test to accept a change #21

tveasey · 2018-03-21T09:42:03Z

The key issue we had with change detection, prior to this PR, was that all the tests were relative, i.e. in terms of relative evidence to no change. This meant that if the time series changed in a way which was not well described by one of the possible changes we consider, it was quite possible to accept a change versus the hypothesis that the time series hadn't changed. This led to degraded adaption of the model: whose parameters should be rapidly relearnt as we do now in this case.

This PR implements an absolute "goodness-of-fit" test for each change, by additionally testing versus its expected BIC given the residual distribution. It means we will only accept changes which are a reasonably accurate description of the change currently occurring in the time series.

…ge-modelling-part-3

edsavage

Looks good Tom.

edsavage · 2018-03-26T09:44:13Z

lib/maths/CTimeSeriesChangeDetector.cc

@@ -555,6 +627,16 @@ void CUnivariateTimeShiftModel::addSamples(std::size_t count,
        {
            this->addLogLikelihood(logLikelihood);
        }
+        for (const auto &weight : weights)
+        {
+            double expectedLogLikelihood;


I know this is safe in this context but I would prefer that this variable was initialized.

I tend not to explicitly initialise variables which are immediately set by a subsequent function call. Also, we tend to run into this case more than we might otherwise because we mandated to use bool return types to indicate success/failure rather than exceptions.

I wonder, since C++11 cleaned up initialisation, whether we should mandate instead that variables are, as a minimum, value initialised, i.e. double expectedLogLikelihood{}; for example. Unfortunately, there are will be a lot of cases which violate this at present, but something we could think about adding to the coding standards and then fixing in a targeted change.

edsavage · 2018-03-26T09:47:01Z

include/maths/CTimeSeriesChangeDetector.h

@@ -350,6 +366,9 @@ class MATHS_EXPORT CUnivariateTimeShiftModel final : public CUnivariateChangeMod
        //! The BIC of applying the time shift.
        virtual double bic() const;

+        //! The expected BIC of applying the change.
+        virtual double expectedBic() const;


Going forward I think it would be best practice to use the 'override' keyword in cases such as this.

Visual Studio 2013 doesn't support override, so bear in mind this will create a backporting headache. I agree for 7.0-only code we should use it though.

I noticed using override on one method generates warnings for every other virtual function that is not marked override (-Winconsistent-missing-override) in the compilation unit. It should probably be done in a single commit

I'm inclined to agree. I think this is something we should adopt, but also let's make this in a single change and target at 7.0 only as there would be no need to back port.

) This implements an absolute "goodness-of-fit" test for each change, by additionally testing a change versus its expected BIC given the residual distribution. It means we will only accept changes if they are a reasonably accurate description of the change currently occurring in the time series.

droberts195 · 2018-12-18T11:53:47Z

Removing version label as this is a feature branch PR and it causes confusion when generating release notes. (For interest this was eventually merged to 6.4 and above in #92.)

tveasey added 6 commits March 19, 2018 17:03

Implement an absolute test for suitability of change hypotheses

9b91ed5

Remove debug

0dbf15b

Fix restore

628eb53

Merge branch 'feature/forecast-enhancements-part-2' into feature/chan…

a9531fa

…ge-modelling-part-3

Smooth decision to accept change over various factors

c388c9b

Merge branch 'feature/forecast-enhancements-part-2' into feature/chan…

a87b74e

…ge-modelling-part-3

tveasey added >enhancement v7.0.0 :ml labels Mar 21, 2018

tveasey requested a review from edsavage March 21, 2018 09:43

tveasey added 2 commits March 23, 2018 14:37

Merge branch 'feature/forecast-enhancements-part-2' into feature/chan…

030d613

…ge-modelling-part-3

Bad merge

fbb511f

edsavage approved these changes Mar 26, 2018

View reviewed changes

Explicitly initialise likelihoods

d1e466e

tveasey merged commit 1eb8f8a into elastic:feature/forecast-enhancements-part-2 Mar 26, 2018

droberts195 removed the v7.0.0 label Dec 18, 2018

droberts195 mentioned this pull request Dec 18, 2018

[DOCS] Updates changelog for 7.0.0-alpha2 #347

Merged

davidkyle mentioned this pull request Jun 20, 2023

[NLP] Catch exceptions thrown during inference and report as errors #2542

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Implements an absolute goodness-of-fit test to accept a change #21

[ML] Implements an absolute goodness-of-fit test to accept a change #21

tveasey commented Mar 21, 2018

edsavage left a comment

edsavage Mar 26, 2018

tveasey Mar 26, 2018

edsavage Mar 26, 2018

droberts195 Mar 26, 2018

davidkyle Mar 26, 2018

tveasey Mar 26, 2018

droberts195 commented Dec 18, 2018

[ML] Implements an absolute goodness-of-fit test to accept a change #21

[ML] Implements an absolute goodness-of-fit test to accept a change #21

Conversation

tveasey commented Mar 21, 2018

edsavage left a comment

Choose a reason for hiding this comment

edsavage Mar 26, 2018

Choose a reason for hiding this comment

tveasey Mar 26, 2018

Choose a reason for hiding this comment

edsavage Mar 26, 2018

Choose a reason for hiding this comment

droberts195 Mar 26, 2018

Choose a reason for hiding this comment

davidkyle Mar 26, 2018

Choose a reason for hiding this comment

tveasey Mar 26, 2018

Choose a reason for hiding this comment

droberts195 commented Dec 18, 2018