SMC return inferencedata and perform convergence checks #4814

ricardoV94 · 2021-06-28T09:06:44Z

This PR makes sample_smc return InferenceData and run convergence checks by default.

@aloctavodia I am not familiar at all with InferenceData objects so let me know if I am doing something wrong or not returning as much information as we could / should.

You mentioned in #4802 (comment) that we could store the betas and the log_likelihood in sample_stats. However, the log_likelihood appears magically in the returned idata, is this one correct? Should we manually add ours instead, since it is already pre-computed. Or did you mean the log_marginal_likelihood?

These are the variables stored previously in trace.report:

trace.report._n_draws = draws
trace.report._n_tune = 0
trace.report.log_marginal_likelihood = np.array(log_marginal_likelihoods)
trace.report.log_pseudolikelihood = log_pseudolikelihood
trace.report.betas = betas
trace.report.accept_ratios = accept_ratios
trace.report.nsteps = nsteps
trace.report._t_sampling = time.time() - t1

Also I am running the same convergence checks as in the normal pm.sample, should we do this or instead, implement specific checks for SMC (or do nothing at all)?

ricardoV94 · 2021-06-28T09:13:28Z

Apologies for the reviewers spam, just thought that all of you might have a better understanding of the InferenceData than me :D

pymc3/smc/sample_smc.py

aloctavodia · 2021-06-28T12:26:55Z

Just for the record and summarizing our previous conversation.

We should keep diagnostics as we have preliminary simulations showing they are useful (even when they were not designed with SMC in mind).
For ABC (but not "plain SMC") we should overwrite the log-likelihood with the values from "log_pseudolikelihood" (I wonder if with the changes introduced in Refactor pm.Simulator (WIP) #4802, this is still necessary... maybe not!)
We should store in sample_stats. the log_marginal_likelihood, betas, accept_ratios and nsteps. The _t_sampling should be automatically added to the attributes, the same goes for _n_tune (even when this does not make sense for SMC).

ricardoV94 · 2021-06-28T13:37:18Z

We should store in sample_stats. the log_marginal_likelihood, betas, accept_ratios and nsteps. The _t_sampling should be automatically added to the attributes, the same goes for _n_tune (even when this does not make sense for SMC).

I am reading here https://arviz-devs.github.io/arviz/schema/schema.html?highlight=sample_stats#sample-stats, and it seems sample_stats expects one measure per posterior sample, but most of those SMC measures do not work like this. Should I just ignore the text and save them there anyway?

ricardoV94 · 2021-06-28T15:52:27Z

Pushed changes to include the sampler stats as well. Feels pretty hacky, because I have to deal with the case where chains have different numbers of draws. Let me know if you have a better suggestion

aloctavodia · 2021-06-28T16:50:50Z

I think that sample_stats should be more general, but that's on the ArviZ side.

Different chains should have the same number of draws.

ricardoV94 · 2021-06-28T17:06:45Z

I think that sample_stats should be more general, but that's on the ArviZ side.

Different chains should have the same number of draws.

What I was calling draws here are the "stages". Sorry for the confusion. Those can vary between chains

michaelosthege

LGTM

ricardoV94 · 2021-06-29T13:23:28Z

The current sample_stats dimensions are "chain" x "draw" by default. Should I rename them to "chain" x "stage", or it's not worth the trouble?

aloctavodia · 2021-06-30T11:57:17Z

I think is not worth the trouble at this point.

ricardoV94 added request discussion SMC Sequential Monte Carlo labels Jun 28, 2021

ricardoV94 force-pushed the smc_inferencedata branch from 5d0aa63 to e738c67 Compare June 28, 2021 09:08

ricardoV94 changed the title ~~SMC inferencedata and convergence checks~~ SMC return inferencedata and perform convergence checks Jun 28, 2021

ricardoV94 requested review from aloctavodia, OriolAbril and michaelosthege June 28, 2021 09:11

Return InferenceData and run convergence checks in sample_smc by default

75a3cf9

ricardoV94 force-pushed the smc_inferencedata branch from e738c67 to 75a3cf9 Compare June 28, 2021 09:29

michaelosthege reviewed Jun 28, 2021

View reviewed changes

pymc3/smc/sample_smc.py Show resolved Hide resolved

pymc3/smc/sample_smc.py Show resolved Hide resolved

aloctavodia reviewed Jun 28, 2021

View reviewed changes

pymc3/smc/sample_smc.py Show resolved Hide resolved

Fix failing tests

fdfe339

Add SMC sample_stats to InferenceData

a453776

ricardoV94 marked this pull request as ready for review June 28, 2021 16:42

michaelosthege approved these changes Jun 28, 2021

View reviewed changes

ricardoV94 removed the request discussion label Jun 29, 2021

aloctavodia approved these changes Jun 30, 2021

View reviewed changes

ricardoV94 merged commit 4505f14 into pymc-devs:main Jun 30, 2021

ricardoV94 deleted the smc_inferencedata branch June 30, 2021 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SMC return inferencedata and perform convergence checks #4814

SMC return inferencedata and perform convergence checks #4814

ricardoV94 commented Jun 28, 2021 •

edited

Loading

ricardoV94 commented Jun 28, 2021

aloctavodia commented Jun 28, 2021 •

edited

Loading

ricardoV94 commented Jun 28, 2021 •

edited

Loading

ricardoV94 commented Jun 28, 2021 •

edited

Loading

aloctavodia commented Jun 28, 2021

ricardoV94 commented Jun 28, 2021

michaelosthege left a comment

ricardoV94 commented Jun 29, 2021 •

edited

Loading

aloctavodia commented Jun 30, 2021

SMC return inferencedata and perform convergence checks #4814

SMC return inferencedata and perform convergence checks #4814

Conversation

ricardoV94 commented Jun 28, 2021 • edited Loading

ricardoV94 commented Jun 28, 2021

aloctavodia commented Jun 28, 2021 • edited Loading

ricardoV94 commented Jun 28, 2021 • edited Loading

ricardoV94 commented Jun 28, 2021 • edited Loading

aloctavodia commented Jun 28, 2021

ricardoV94 commented Jun 28, 2021

michaelosthege left a comment

Choose a reason for hiding this comment

ricardoV94 commented Jun 29, 2021 • edited Loading

aloctavodia commented Jun 30, 2021

ricardoV94 commented Jun 28, 2021 •

edited

Loading

aloctavodia commented Jun 28, 2021 •

edited

Loading

ricardoV94 commented Jun 28, 2021 •

edited

Loading

ricardoV94 commented Jun 28, 2021 •

edited

Loading

ricardoV94 commented Jun 29, 2021 •

edited

Loading