-
Notifications
You must be signed in to change notification settings - Fork 605
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DRAGEN: use sample name cleaning patterns instead of hardcoded ones #1994
Conversation
…Small refactoring Fixes #1865
Suggestion is to have a top level config attribute called So something like: dragen_config:
filename_suffixes:
- _tumor |
Actually, there is no need to have So removing merging the list of DRAGEN cleaning patterns with the main least, and removing |
@ewels, good to take another look now. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome, thanks 👌🏻 It was a good discussion earlier, I'm still a little on the fence! Let's keep it in the back of our minds when we do some of the heavier refactoring in v2. Happy to reconsider the best approach.
I think there are a couple which are maybe a bit risky, in that I could imagine them legitimately coming up in some sample names (.time
, .gc
, .vc
) - would it work to shift them to the other config where they're only removed from the end of sample names and not the middle onwards? I've done that to compromise with other similarly risky ones in the past.
Add
fn_clean_exts
for DRAGEN sample name cleaning, instead of hardcoding patterns into the code.Fixes #1865