Move `predict` from Turing #716

sunxd3 · 2024-11-12T14:02:21Z

This PR aims to move predict function from Turing.jl repo to here (DynamicPPL). This PR won't change the way that predict is fundamentally implemented. (Later in #651, we will transition to using fix to implement predict.)

The challenge of this PR is that:

predict returns a MCMCChains.Chain
the implementation in Turing.jl uses the Chain generation pipeline in Turing.jl (same pipeline called at the end of sample)
it doesn't really make sense to move all the Chain-related util functions into DynamicPPL
so we need to separate a subset of the util functions and add to DynamicPPL

What I have done as of now:

move the predict function and recovered a subset of the util functions needed to make it functional
sample in tests now uses LgoDensityFunction interface

Modifications made to the moved util functions are:

AbstractMCMC.bundle_samples are renamed to _bundle_samples; unused keywords arguments are removed
Transition type is copied from Turing.jl repo, but the stat field is removed as it is never used in predict

But most of the functions in the PR right now should be the same or straightforwardly identifiable from Turing.jl code.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

sunxd3 · 2024-11-14T15:29:08Z

~~Some tests still fail: the mean of the predictions looks correct, but it seems the variance is high. Not certain where goes wrong, so need further investigation.~~

The reason is some tests implicitly rely on the variance of the posterior samples. Discarding some initial samples fixes this. Turing do this by default, but via LogDensityFunction we need do the discarding explicitly.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

coveralls · 2024-11-18T12:42:49Z

Pull Request Test Coverage Report for Build 12007336979

Details

47 of 48 (97.92%) changed or added relevant lines in 2 files are covered.
25 unchanged lines in 4 files lost coverage.
Overall coverage increased (+0.2%) to 84.55%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
ext/DynamicPPLMCMCChainsExt.jl	34	35	97.14%

Files with Coverage Reduction	New Missed Lines	%
src/model.jl	1	94.44%
src/varinfo.jl	6	86.3%
src/simple_varinfo.jl	6	86.6%
src/threadsafe.jl	12	57.76%

Totals
Change from base Build 11934706726:	0.2%
Covered Lines:	3601
Relevant Lines:	4259

💛 - Coveralls

codecov · 2024-11-18T12:46:53Z

Codecov Report

Attention: Patch coverage is 97.91667% with 1 line in your changes missing coverage. Please review.

Project coverage is 84.55%. Comparing base (ba490bf) to head (53b6749).

Files with missing lines	Patch %	Lines
ext/DynamicPPLMCMCChainsExt.jl	97.14%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #716      +/-   ##
==========================================
+ Coverage   84.35%   84.55%   +0.19%     
==========================================
  Files          30       30              
  Lines        4211     4259      +48     
==========================================
+ Hits         3552     3601      +49     
+ Misses        659      658       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

sunxd3 · 2024-11-18T14:40:57Z

We had a fast discussion on this today at the meeting. Tor raised that we should probably implement predict that take generic Vector as the second argument (instead of just Chain), this is because predict works with sample, and sample can produce non-Chain type returns.

Also although we don't use fix for this PR yet, it is worthwhile to have some nice and better thought-out implementations.

torfjelde · 2024-11-19T07:10:13Z

Vector as the second argument

Specifically, I was thinking Vector{<:VarInfo}:) But otherwise, this sounds very good 👍

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

torfjelde · 2024-11-25T13:48:01Z

src/model.jl

+    varinfos::AbstractArray{<:AbstractVarInfo};
+    include_all=false,
+)
+    predictive_samples = Array{PredictiveSample}(undef, size(varinfos))


Do we really need the PredictiveSample here?

My original suggestion was just to use Vector{<:OrderedDict} for the return-value (an abstractly typed PredictiveSample doesn't really offer anything beyond this, does it?)

I haven't think too deep about this. A new type certainly is easier to dispatch on, but may not be necessary. Let me look into it

torfjelde · 2024-11-25T13:48:16Z

Otherwise stuff is starting to look nice though:)

move predict from Turing

1c1c907

sunxd3 marked this pull request as draft November 13, 2024 08:52

sunxd3 and others added 2 commits November 13, 2024 09:21

minor fixes

bdf90b4

Update test/ext/DynamicPPLMCMCChainsExt.jl

c7d08b0

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

sunxd3 and others added 4 commits November 18, 2024 11:06

fix test error by discard burn-in's

a425c41

add some comments

41471f6

fix test error

90d99ca

Update test/ext/DynamicPPLMCMCChainsExt.jl

ea23b7c

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

sunxd3 and others added 3 commits November 21, 2024 12:42

refactor the code; add predict in Turing that takes array of varinfos

76ef40f

Update model.jl

304b63e

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Merge branch 'master' into sunxd/move_predict

53b6749

sunxd3 mentioned this pull request Nov 25, 2024

Move predict from Turing, implemented using fix #651

Closed

mhauru assigned sunxd3 Nov 25, 2024

torfjelde reviewed Nov 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move `predict` from Turing #716

Move `predict` from Turing #716

sunxd3 commented Nov 12, 2024 •

edited

Loading

sunxd3 commented Nov 14, 2024 •

edited

Loading

coveralls commented Nov 18, 2024 •

edited

Loading

codecov bot commented Nov 18, 2024 •

edited

Loading

sunxd3 commented Nov 18, 2024

torfjelde commented Nov 19, 2024

torfjelde Nov 25, 2024

sunxd3 Nov 25, 2024

torfjelde commented Nov 25, 2024

Move predict from Turing #716

Are you sure you want to change the base?

Move predict from Turing #716

Conversation

sunxd3 commented Nov 12, 2024 • edited Loading

sunxd3 commented Nov 14, 2024 • edited Loading

coveralls commented Nov 18, 2024 • edited Loading

Pull Request Test Coverage Report for Build 12007336979

Details

💛 - Coveralls

codecov bot commented Nov 18, 2024 • edited Loading

Codecov Report

sunxd3 commented Nov 18, 2024

torfjelde commented Nov 19, 2024

torfjelde Nov 25, 2024

Choose a reason for hiding this comment

sunxd3 Nov 25, 2024

Choose a reason for hiding this comment

torfjelde commented Nov 25, 2024

Move `predict` from Turing #716

Move `predict` from Turing #716

sunxd3 commented Nov 12, 2024 •

edited

Loading

sunxd3 commented Nov 14, 2024 •

edited

Loading

coveralls commented Nov 18, 2024 •

edited

Loading

codecov bot commented Nov 18, 2024 •

edited

Loading