-
Notifications
You must be signed in to change notification settings - Fork 441
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update STAR version + new options for STARsolo #5060
Update STAR version + new options for STARsolo #5060
Conversation
@mtekman , @wm75, @JasperO98 , @pavanvidem, @ieguinoa |
Just to be clear:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good on my side
@lldelisle I can look at it in the evening. I have the following features/params/UI rearrangements that I have in my TODO for STARSolo.
this can be fixed with adding
Do you think is it reasonable to add (some of) them to this PR? |
@pavanvidem sure. |
I just realize that the Chemistry parameter propose "Cell Ranger v2" while Cell Ranger is the name of the software, for me it should be "Chromium chemistry v2"... |
I will work on it. If you wish please contribute :) |
Good cath, it is correct. |
@pavanvidem , here is what I have in mind, tell me if this would work for you:
|
FYI, I just started working on the above options sequentially. |
Perfect, tell me if you need help. |
make `--soloCBwhitelist` optional and add limits macro for Smart-Seq
Would it be possible to include
Maybe also check if those two old issues are still relevant: |
Good catch @bernt-matthias , thanks. Can you target your new PR to my branch? |
last week I created a patch on your fork: lldelisle#2 |
I just pushed 033cd4a .. hape that is fine |
plus: - removed a few redundant name attribs - increases profile
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@lldelisle Fantastic! I have a final comment on cell filtering. Does it make sense to allow droplet-based filtering for Smart-Seq? Technically it is possible. But do people use this option? Maybe I'm trying to optimize too much. I'm already very happy with the current state of the wrapper and it can be merged. Thanks a lot for your efforts!!
Maybe you are right, it does not make a lot of sense but then this suppose to put the conditional block |
As I said, it is all good and It does not break anything. Ready to merge from my side. |
@bgruening I think @pavanvidem is not a 'member' and his approval does not allow me to merge the PR. Could you approve it or you prefer a second review? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a bit of (important) housekeeping.
For everything else I trust you and other reviewers :)
tools/rgrnastar/macros.xml
Outdated
@@ -5,7 +5,8 @@ | |||
the index versions in sync, but you should manually adjust the +galaxy | |||
version number. --> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tools-iuc/data_managers/data_manager_star_index_builder/data_manager/rna_star_index_builder.xml
Line 1 in 02d1d48
<tool id="rna_star_index_builder_data_manager" name="rnastar index versioned" tool_type="manage_data" version="@IDX_VERSION@" profile="19.05"> |
is what this comment refers to.
This PR should, because of the linked macros file, trigger deployment of a new version of the DM, too, so you need to bump the DM version to version="@IDX_VERSION@+galaxy1"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You mean we should change the version because we changed STAR version or because of something else?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess bumping the IDX_VERSION_SUFFIX
does not hurt, but I do not understand why we should do it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe to indicate that we use the STAR version @TOOL_VERSION@ instead of the STAR version @IDX_VERSION@ to build the index...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@bernt-matthias @lldelisle Just to explain things: the reason for symlinking the macros file is to keep the IDX_VERSION in one place only so that when you update the tool wrapper to a STAR version that requires a newer index format, you'd automatically deploy also a DM that can create these indexes.
The "downside" is that any changes to the tool wrapper macros file will silently affect the DM. So in this case the next version of the DM will use the 2.7.10b version of star for building indexes. These should be identical to ones built with older versions, but it's good to bump the DM wrapper version to be able to trace things back.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems legit.
Could a more expressive filter help here. For instance we could just store the star version that was used to create an entry in the datatable ... and then just filter datatable entries for a min (or max) required star version.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think it is easy to add a new column in a table, this suppose to change the table (which happened when we add a new column for the 'genomeVersion')...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, that would require a new table. I'm also not so sure whether that would improve the situation much.
min and max version checks in tool wrappers also need quite some discipline to maintain, and the max check in particular doesn't work backwards, i.e., at the time of writing a tool wrapper version the max value is typically unknown still so there's always at least one wrapper version that will display all newer index versions.
What would be comparably easy to do is to remove the symlink and have the DM use its own macro, which then needs to be maintained separately, but would maybe come with fewer surprises.
Anyway, I don't think this should hold back this PR any longer. If we want to decouple the DM from the tool wrapper, we should do it in its own PR where the decision will be more discoverable than as part of a giant PR like this one.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree.
5af8af0
to
296660c
Compare
Let see if I did not break everything... |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the fixes, looks great to me now!
Youhou! Who wants to click or find something to change? |
Merging is ok with you @bernt-matthias ? |
Sure. Thanks @lldelisle |
🎉 this was a HUGE PR. |
A great collaboration. I love you galaxy people. |
This worth probably a blog post... |
The most important thing is that this was a huge progress. Thanks! I'm using this tool very frequently for 10x and Smart-seq data. I also have some iCell (needs the optional barcode whitelist param), cel-seq and large Smart-seq (needs limit params) data in the queue. I can already tell in the next weeks if we broke something ;-) |
YEAH! 💯 |
Have we fixed all those issues #5060 (comment).. If so we can close them. |
Hmm. Can't find the workflow run for the merge. Can someone double check? |
At least the updates did not arrive at the TS yet. |
Arg, the commit message contained |
I take care. Thanks @bernt-matthias |
Sorry I just found a bug see #5144 |
In fact 2 bugs... 😔 I PR a fix in #5145 |
FOR CONTRIBUTOR: