-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Subsetting the database when using poppunk_visualise --cytoscape with --include-files #196
Comments
This should be done by this bit of code which masks the not-included nodes: https://github.com/johnlees/PopPUNK/blob/46aff5d5715b26a7582c733d6956cb4c78748a99/PopPUNK/plot.py#L488-L495 But maybe when we print the graphml it prints the masked nodes too? @nickjcroucher do you remember if this is the case? |
All gets saved with the same function - masking can be a little tricky in graph-tool. Will take a look. |
Also @muppi1993 highlighted that running with just |
I think we just need a
Agree, will also add in 2.5.0 |
poppunk_visualise --ref-db poppunk_clusters --output cytoscape_5 --cytoscape --network-file /poppunk_clusters/poppunk_clusters_refs_graph.gt --include-files strains.csv --external-clustering meta.csv I have the same issue when running this code -- several strains from the reference database are still included in the final cytoscape output even though it should just include the strains from the query/strain list |
Versions
poppunk 2.4.0
poppunk_sketch 1.7.4
Command used and output returned
Output: one .graphml and two .csv files
Describe the bug
I tried to only include a subset of the dataset in the output with --include-files, which worked fine for the --microreact output. However, the .graphml network contains all isolates from the database rather than just those listed in gps_cluster3_list.txt.
The text was updated successfully, but these errors were encountered: