You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
At the moment (due to the mash interface) samples are referred to by the filename of their assembly, but it would be convenient to be able to give each sample an arbitrary name in the --r-files or --q-files input:
sample1 assemblies/sample1.contigs.fa
Easy enough to update the PopPUNK side, but the mash sketch names are always the file name. Two possible ways of doing this I think:
Create temporary symlinks of the assemblies at their given names. This would best be done inside a tmp directory to avoid any clashes (copying the resulting sketches into the output dir)
Keep a table of file names to sample names as part of the database. Less hacky but more work.
The text was updated successfully, but these errors were encountered:
For a local system, I don’t see a problem with the first option – if anything, have a renamed directory of sequences might be helpful for the user (rather than having to manually look up which FASTA files correspond to which isolates). I don’t know if there’s a loss of flexibility on multi-user systems, or if there are any special characters mash won’t accept as inputs?
From: John Lees <[email protected]<mailto:[email protected]>>
Reply-To: johnlees/PopPUNK <[email protected]<mailto:[email protected]>>
Date: Thursday, 28 February 2019 at 20:14
To: johnlees/PopPUNK <[email protected]<mailto:[email protected]>>
Cc: Subscribed <[email protected]<mailto:[email protected]>>
Subject: [johnlees/PopPUNK] Allow assemblies to have separately defined names (#43)
At the moment (due to the mash interface) samples are referred to by the filename of their assembly, but it would be convenient to be able to give each sample an arbitrary name in the --r-files or --q-files input:
sample1 assemblies/sample1.contigs.fa
Easy enough to update the PopPUNK side, but the mash sketch names are always the file name. Two possible ways of doing this I think:
* Create temporary symlinks of the assemblies at their given names. This would best be done inside a tmp directory to avoid any clashes (copying the resulting sketches into the output dir)
* Keep a table of file names to sample names as part of the database. Less hacky but more work.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub<#43>, or mute the thread<https://github.com/notifications/unsubscribe-auth/APz7QyVjAgX7ycJg0JIviFa3IY1u-x0fks5vSDidgaJpZM4bXpbI>.
At the moment (due to the mash interface) samples are referred to by the filename of their assembly, but it would be convenient to be able to give each sample an arbitrary name in the
--r-files
or--q-files
input:Easy enough to update the PopPUNK side, but the mash sketch names are always the file name. Two possible ways of doing this I think:
The text was updated successfully, but these errors were encountered: