Model API cleanup #6309

ricardoV94 · 2022-11-17T10:03:28Z

Trying to standardize mapping names a bit more to facilitate model transformations such as in pymc-devs/pymc-extras#91

Closes #6305
Closes #5076

Major / Breaking Changes

Sampling of transformed variables from prior_predictive is no longer allowed

Bugfixes / New features

...

Docs / Maintenance

Rename several internal Model variables

codecov · 2022-11-17T10:40:22Z

Codecov Report

Merging #6309 (60f15e1) into main (d4ff7ae) will increase coverage by 2.31%.
The diff coverage is 94.44%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6309      +/-   ##
==========================================
+ Coverage   91.87%   94.18%   +2.31%     
==========================================
  Files         111      111              
  Lines       23917    23908       -9     
==========================================
+ Hits        21973    22518     +545     
+ Misses       1944     1390     -554

Impacted Files	Coverage Δ
pymc/initial_point.py	`100.00% <ø> (ø)`
pymc/tests/distributions/test_continuous.py	`99.76% <ø> (-0.01%)`	⬇️
pymc/model.py	`89.76% <85.71%> (-0.29%)`	⬇️
pymc/backends/arviz.py	`90.71% <100.00%> (+2.90%)`	⬆️
pymc/data.py	`80.08% <100.00%> (ø)`
pymc/model_graph.py	`78.82% <100.00%> (ø)`
pymc/sampling/forward.py	`95.45% <100.00%> (-0.11%)`	⬇️
pymc/sampling/jax.py	`98.19% <100.00%> (+0.85%)`	⬆️
pymc/sampling/mcmc.py	`92.27% <100.00%> (ø)`
pymc/smc/kernels.py	`97.41% <100.00%> (+0.03%)`	⬆️
... and 8 more

michaelosthege

Trusting that you'll fix that one test 👍

This also disables prior_predictive sampling of transformed variables

…d_vars_to_dims

This property was initially added just to handle deterministics created by automatic imputation, in order to ensure the combined tensor of missing and observed components showed up in prior and posterior predictive sampling. At the same time, it allowed hiding the deterministic during mcmc sampling, saving memory use for large datasets. This last benefit is lost for the sake of simplicity. If a user is concerned, they can manually split the observed and missing components of a dataset when defining their model.

pipme · 2024-07-11T07:54:11Z

pymc/sampling/forward.py

@@ -343,7 +362,7 @@ def sample_prior_predictive(
    var_names : Iterable[str]
        A list of names of variables for which to compute the prior predictive
        samples. Defaults to both observed and unobserved RVs. Transformed values
-        are not included unless explicitly defined in var_names.
+        are not allowed.


Any reason why transformed (unconstrained) values are not allowed here? I need to get prior samples in the transformed space, what should I do? Thanks!

(MCMC sampling by removing observations from the likelihood is not a good option. I guess I could also use sample_prior_predictive to get prior parameter samples and then transform back to the unconstrained space but it seems not easy to do it with Transforms in an efficient vectorized way, i.e., it seems I have to do a for-loop.)

A helper to go from unconstrained back to constrained draws is needed elsewhere (specially since we don't save them in InferenceData after sampling), so we can add that and will also cover your use case.

I think there is an open issue for that.

This issue: #6721

Thanks for the quick reply! Transforming efficiently between unconstrained and constrained draws would be super useful for our use case. What would be needed for its implementation? In the meantime, what could be a good workaround? (I need to transform a lot of prior samples from constrained to unconstrained. Or alternatively, unconstrained prior samples are even more useful to me than constrained samples. So it would be better if I could get them directly.) Can I modify back sample_prior_predictive to include the transformed variables like sd_log__?

For the InferenceData, pm.sample(..., idata_kwargs={"include_transformed": True}) seems to include the transformed samples.

pipme · 2024-07-11T07:55:38Z

Hi, please see my comment above. Any reason why transformed (unconstrained) values are not allowed in sample_prior_predictive? I need to get prior samples in the transformed space, what should I do? Thanks!

ricardoV94 · 2024-07-11T10:27:43Z

No we shouldn't change sample_prior_predictive, but you can of course change in your local installation if that's an option for you. Or you can do something like this for yourself: https://discourse.pymc.io/t/logp-questions-synthetic-dataset-to-evaluate-modeling/12129/6?u=ricardov94

I'll open a PR soon to add the functionality.

ricardoV94 added the maintenance label Nov 17, 2022

ricardoV94 requested a review from michaelosthege November 17, 2022 10:03

Remove unused filter_rvs_to_jitter

37b8233

ricardoV94 force-pushed the model_api_cleanup branch 3 times, most recently from 8a81537 to 43e3f05 Compare November 17, 2022 10:15

Rename data kwarg to observed in Model register_rv

c2138f3

ricardoV94 force-pushed the model_api_cleanup branch 2 times, most recently from de31798 to 0121fd9 Compare November 17, 2022 10:30

ricardoV94 force-pushed the model_api_cleanup branch from 0121fd9 to a94805a Compare November 17, 2022 10:48

ricardoV94 added the major Include in major changes release notes section label Nov 17, 2022

ricardoV94 added this to the v4.4.0 milestone Nov 17, 2022

ricardoV94 force-pushed the model_api_cleanup branch from a94805a to 0fc9a9d Compare November 17, 2022 11:06

ricardoV94 changed the title ~~Model api cleanup~~ Model API cleanup Nov 17, 2022

ricardoV94 force-pushed the model_api_cleanup branch 2 times, most recently from 9246301 to 29e8e05 Compare November 17, 2022 14:18

michaelosthege approved these changes Nov 17, 2022

View reviewed changes

ricardoV94 added 4 commits November 18, 2022 09:40

Do not add transformed value names to named_vars

088fa9d

This also disables prior_predictive sampling of transformed variables

Rename add_random_variable to add_named_variable and _RV_dims to name…

dc41794

…d_vars_to_dims

Rename Model initial_values to rvs_to_initial_values

0b1f0ba

ricardoV94 force-pushed the model_api_cleanup branch from 29e8e05 to 60f15e1 Compare November 18, 2022 08:40

ricardoV94 merged commit 4acd98e into pymc-devs:main Nov 18, 2022

ricardoV94 deleted the model_api_cleanup branch November 18, 2022 10:04

larryshamalama mentioned this pull request Mar 11, 2023

Remove auto argument from pm.Deterministic docstring #6588

Closed

shreyas3156 mentioned this pull request Mar 11, 2023

Remove auto argument from pm.Deterministic docstring #6592

Merged

5 tasks

pipme reviewed Jul 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model API cleanup #6309

Model API cleanup #6309

ricardoV94 commented Nov 17, 2022 •

edited

Loading

codecov bot commented Nov 17, 2022 •

edited

Loading

michaelosthege left a comment

pipme Jul 11, 2024

ricardoV94 Jul 11, 2024 •

edited

Loading

ricardoV94 Jul 11, 2024

pipme Jul 11, 2024 •

edited

Loading

pipme commented Jul 11, 2024

ricardoV94 commented Jul 11, 2024

Model API cleanup #6309

Model API cleanup #6309

Conversation

ricardoV94 commented Nov 17, 2022 • edited Loading

Major / Breaking Changes

Bugfixes / New features

Docs / Maintenance

codecov bot commented Nov 17, 2022 • edited Loading

Codecov Report

michaelosthege left a comment

Choose a reason for hiding this comment

pipme Jul 11, 2024

Choose a reason for hiding this comment

ricardoV94 Jul 11, 2024 • edited Loading

Choose a reason for hiding this comment

ricardoV94 Jul 11, 2024

Choose a reason for hiding this comment

pipme Jul 11, 2024 • edited Loading

Choose a reason for hiding this comment

pipme commented Jul 11, 2024

ricardoV94 commented Jul 11, 2024

ricardoV94 commented Nov 17, 2022 •

edited

Loading

codecov bot commented Nov 17, 2022 •

edited

Loading

ricardoV94 Jul 11, 2024 •

edited

Loading

pipme Jul 11, 2024 •

edited

Loading