Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add chemical datatypes #1941

Merged
merged 3 commits into from
Mar 21, 2016
Merged

Conversation

bgruening
Copy link
Member

This will add a bunch of molecule datatypes to Galaxy. It was developed as part of the ChemicalToolBox and I feel it's time to move them into core.

The converters rely on a binary called OpenBabel which can will be installed during runtime, if conda dependency resolution is enabled.

I marked it as WIP as I have some smaller corrections to do, but would appreciate a first review and it it's ok to move them into main.

@bgruening bgruening self-assigned this Mar 17, 2016
@bgruening bgruening added this to the 16.04 milestone Mar 17, 2016
<sniffer type="galaxy.datatypes.molecules:InChI"/>
<sniffer type="galaxy.datatypes.molecules:FPS"/>
<sniffer type="galaxy.datatypes.molecules:CML"/>
<!-- TODO: see molecules.py <sniffer type="galaxy.datatypes.molecules:SMILES"/>-->
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you update the doctests for guess_ext with sniffer tests for these datatypes - at least the ones you have test data in the repo for:

https://github.com/galaxyproject/galaxy/blob/dev/lib/galaxy/datatypes/sniff.py#L259

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually, I already have some doctests. But I need to extend them and add more test data. Still struggling to find small once. Will do so!

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Your doctests are testing the sniff method - putting them up in guess_ext would test the actual datatype conf file - that the sniffer is in the right order, etc...

@jmchilton
Copy link
Member

Conda is paying dividends all over the place, this is awesome @bgruening.

@bgruening bgruening force-pushed the chemical_datatypes branch 2 times, most recently from 3e634c7 to 1187e79 Compare March 18, 2016 16:16
@bgruening
Copy link
Member Author

@jmchilton can you please look over it. I'm not sure this all is needed. Why do I need to edit registry.py?

@bgruening bgruening changed the title WIP: add chemical datatypes add chemical datatypes Mar 19, 2016
@martenson
Copy link
Member

@bgruening Does lib/galaxy/datatypes/test/2zbz.pdb have to have 4k lines?
Besides that it looks very fine to me.

@bgruening
Copy link
Member Author

@martenson I can search for a smaller file. Do you know why I need to touch the registry.py? Shouldn’t be the ordering in datatypes_conf be sufficient?

@martenson
Copy link
Member

@bgruening I thought the conf is sufficient. Why do you think you need to touch registry.py?

@bgruening
Copy link
Member Author

@martenson
Copy link
Member

@bgruening I cannot comment on what the best way is but I see no harm done by adding it there.

@bgruening bgruening force-pushed the chemical_datatypes branch from 1187e79 to c390efb Compare March 20, 2016 17:04
@bgruening bgruening force-pushed the chemical_datatypes branch from c390efb to 1e9b0e6 Compare March 20, 2016 17:07
@bgruening
Copy link
Member Author

@martenson I have found a smaller file and rebased this PR.

@jmchilton
Copy link
Member

The registry.py changes should not be needed, does this fail without them?

@martenson
Copy link
Member

@jmchilton I think that is what @bgruening said

@jmchilton
Copy link
Member

Okay - I see that now - I'll pull this down and hack on it.

@jmchilton
Copy link
Member

Clearly I hadn't made the change to test the datatypes conf that I thought I did - I've included the fix with bgruening#9. I wonder if this is the problem with #1156...

Fix datatype sniff test to use sample and revert molecule registry ha…
@bgruening
Copy link
Member Author

Thanks @jmchilton! Seems to work as expected! Can we merge this and trigger a new test run in #1156.

jmchilton added a commit that referenced this pull request Mar 21, 2016
@jmchilton jmchilton merged commit 5ead8a9 into galaxyproject:dev Mar 21, 2016
@bgruening bgruening deleted the chemical_datatypes branch March 21, 2016 15:50
@bgruening
Copy link
Member Author

Thanks for the merge and the review and fixing and for everything! :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants