-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Convert chemicals to ChEBI rather than Reacto #221
Comments
N.b. Once we are sure that all chemicals do have ChEBI IDs, need to clean up REACTO to remove chemical IDs |
Separate from the missing (unresolving) ChEBI identifiers that were already spotted. I need a way to check the integrity of the identifiers that are resolving. The best method that I can think of is to open the model in the graph editor, output the GPAD and cross check the label in the graph editor with the Chebi identifier in the GPAD and then cross-check those with Chebi. I will also check them with respect to the cross references in Reactome. This is a labor-intensive manual process, but I think it is necessary to ensure that things happened correctly. I will start a spreadsheet and link it to this ticket. I'm not sure how many I will check, but will look at several different pathways and several different kinds of reactions. |
@ukemi that shouldn't have to be done manually. I can probably whip up a way to check automatically once given the GPAD. This is not to stop any manual work that could proceed while the automated check is in progress (that's my usual procedure anyway). |
That would be awesome @nataled! @dustine32 do you know if there are products generated from the development server? If so, is there a GPAD that @nataled could use? Even if it is a mega-file, it would be straightforward to filter on annotations from Reactome models. We might actually want to put something like this in place beyond just for this project. |
Item for Monday "weeds" - what exactly are the integrity problems (wrong charge states of ionizable compounds? other?)? In principle, this is really a Reactome curation integrity issue: we should only be using correct ChEBI instances in the first place, so the follow-on question is how to change Reactome curation and QA practice to fix them at the source. And, as suggested on Wednesday, get rid of ChEBI terms used to identify polynucleotides where SO terms would work. And, probably, also identify classes of ChEBI instances that Reactome needs to annotate weird cases - perhaps we really need those electrons and photons - to add to Jim's list of ChEBI terms legal for GO-CAM. Your spreadsheet will be a good resource for starting to sort this out. |
It's way simpler than that. In my own worrisome way, I just want to make sure that the process worked. That is, when I see a chemical in a model, it is the chemical that was in the original Reactome pathway and the label on the chemical is correct though the chain of Reactome ID->ChebiID in Reactome->ChebiID in GO-CAM->GO-CAM graph label. |
I did a bit of this today and I am convinced that the integrity of the information being transferred is intact: |
This came up during the QC checks of the last release. The initial discussion is pasted below. Also see the discussion from:
#176 (comment) onward.
http://noctua-dev.berkeleybop.org/editor/graph/gomodel:R-HSA-1474151
The text was updated successfully, but these errors were encountered: