Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MCKP gene iterator speedup #264

Conversation

sjfleming
Copy link
Member

There is an iterator which gives logical arrays used to index the COO sparse posterior object, for the purposes of breaking it up into "independent" chunks for MCKP estimation. (This is just achieved by having all entries for a given gene within a single chunk, since each gene is independent of every other during MCKP estimation.)

Previously, it was implemented as a generator which would yield a logical array. The implementation was a bit clumsy.

Now, it is implemented as a simple list, computed as fast as I can figure. The speedup for large posterior COO objects is at least a factor of 100. This is very important, since for large datasets it was taking quite a long time in some cases, like on the order of 10 minutes or more, it seemed.

@sjfleming sjfleming merged commit 7fd0dac into sf_memory_efficient_posterior_generation Aug 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant