-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE REQUEST] .CXG (TileDb compliant) format support from single-cell-explorer to cellxgene Annotate #2655
Comments
Hi @Neah-Ko ! Thanks for your question! Responding inline:
We currently do not have plans to incorporate this feature in Annotate in the near future, that being said our roadmap has not been fully established for the latter half of 2024, so its not totally out of the realm of possibilities (if it becomes enabling or required for some other higher priority features, this would incentivize us to commit some resources towards development).
We really appreciate this offer and if this is a high priority/urgent feature for you, I think it makes sense for you to work on it. If you do end up implementing the feature, we'd definitely like to hear about your experience and technical discovery work on it. We can't promise that the PR would be merged into the main branch since our engineering resources are already committed to higher priority work, and we would likely not have time to review. That being said, hearing about your experience and learnings while implementing it would be invaluable for us and we definitely encourage you to report back with any information that might inform a future direction/implementation of the feature in Annotate. We sincerely value your understanding and enthusiasm for contributing to the CELLxGENE community. Please do continue to share your ideas and feedback with us. |
Hello @MaximilianLombardo So I went on with frontporting the feature, with way more success than I initially expected to be honest :) You may find the code at #2656
Note: The Dockerfile is made to be rebuilt to test changes on sources. For me, installing cellxgene in editable mode didn't worked because then react doesn't find the templates anymore. I had a look on stackoverflow and it may or may not be arranged by tweaking pyproject.toml BehaviorSo far, it seems that both file formats can be read in, with no concerning error in the logs ⭐ In principle, I merged the different parts into an up to date version of CXG, so if something breaks it is probably going to be on the CXG side. I've made some modest attempts at factoring, but it probably deserves another pass. I am going to detail some of my choices and findings: FBS schemaAs mentioned above, I merged into Annotate. I decided to try without regenerating the schema. To support cxg dataset, I had to propagate some To dig more into it, I would need access to at least the compilation chains sent to Conversioncellxgene reads the dataset when converted with the primitives in However, I have to load it setting the flag
It doesn't reads when conversion is done using cxgtool ConfigurationThe configuration scheme evolved from single-cell-explorer to here. At the moment I am not fully certain that all parts are going to be consumed and not taken from the default. I'm almost duplicating the CxgDataset parametersTo enable annotation support and user genesets creation I had to tweak the function In general I could have overlooked something from the many configuration files. Let me know what you think of it, I am going to scan and bring minor improvements where I can next week and try to merge and run both data_adaptors unit tests. |
Hello @MaximilianLombardo , UpdateAs I mentioned above, the pull request now also features I can see that on the At the moment, I don't think I can properly support them in the pull request as they are calling some If I could have access to some documentation on how the schema is re-generated, that would be really helpful. SuggestionWhile browsing for In my opinion that would be a great backend for cellxgene, benefiting from TileDb optimizations and able to output an Anndata object. That could also contribute towards reducing cellxgene RAM footprint, which is the top challenge at the moment. We are pretty committed on using cellxgene for visualization here, we would benefit from improvements. So let me know if I may contribute. Best, |
+1 to this request! |
1 similar comment
+1 to this request! |
Hello and happy new year cellxgene team.
I am working on cellxgene integration to our software stack for my structure @bag-cnag.
By browsing the repositories here and there, I noticed that single-cell-explorer which is the
Is working with your
.cxg/
file structure format which is TileDb based. Annotate however, only takes as input.h5ad
files backed by Anndata.We are considering multiple options to host our files, and are interested by the possible performances offered by TileDb.
Question
Can we expect this feature to be available on Annotate ?
From my understanding the codebase on single-cell-explorer is essentially a fork of annotate, thus it should be in principle integrable.
I will most likely have to work on a PoC on this feature. If you are interested, I could extend a pull request.
The text was updated successfully, but these errors were encountered: