Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add mem for all tools with cores and rescale some memory based on usegalaxy.org usage #76

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

natefoo
Copy link
Member

@natefoo natefoo commented Nov 4, 2024

Some of this may be controversial so I'd certainly like to hear from other usegalaxy.* admins before this is merged. =)

fastqc is now scalable cores and memory based off input_size - probably I should fix this for compressed data a la my bwa-mem2 rule but perhaps TPV should have an estimated_input_size?

Data I used to make adjustments can be found in this gist. There is also a discussion about how to do a better job of this long term in #75.

Any tool I didn't have data for that has cores but no mem I left a comment on. fastp I don't know what to do with, since its usage is way outside of what I allocate for it, so I might try a more restricted query for that.

Copy link
Collaborator

@mira-miracoli mira-miracoli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot! this can save so much resources 🌳 🌍

tools.yml Show resolved Hide resolved
tools.yml Show resolved Hide resolved
tools.yml Outdated Show resolved Hide resolved
tools.yml Show resolved Hide resolved
tools.yml Show resolved Hide resolved
tools.yml Show resolved Hide resolved
tools.yml Show resolved Hide resolved
tools.yml Show resolved Hide resolved
tools.yml Show resolved Hide resolved
tools.yml Show resolved Hide resolved
Copy link
Collaborator

@mira-miracoli mira-miracoli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please have a look on line 1241 before merging, I think there is a value missing

tools.yml Outdated
@@ -1428,12 +1395,14 @@ tools:
mem: 24
toolshed.g2.bx.psu.edu/repos/iuc/iqtree/iqtree/.*:
cores: 10
mem: 7
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is low setting for mem. Max memory for Galaxy Australia's iqtree jobs is regularly in the 8-19Gb range (19 is the what we give it in TPV)

@natefoo
Copy link
Member Author

natefoo commented Nov 8, 2024

Thanks @mira-miracoli and @cat-bro! I think it should be good to go now.

@natefoo
Copy link
Member Author

natefoo commented Nov 8, 2024

Oh, I guess I need to apply formatting. It removes all the comments though, which IMO can be useful - @nuwang you mentioned making roundtrip the default in galaxyproject/total-perspective-vortex#140, should I do that (or make it the method used by format, at least)? Or do we want to enforce no comments in the shared DB?

Copy link
Member

@bgruening bgruening left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great stuff @natefoo

tools.yml Outdated
toolshed.g2.bx.psu.edu/repos/iuc/fasta_stats/fasta-stats/.*:
mem: 11.4
toolshed.g2.bx.psu.edu/repos/iuc/fastp/fastp/.*:
cores: 4
mem:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

something missing here

toolshed.g2.bx.psu.edu/repos/devteam/fastq_paired_end_interlacer/fastq_paired_end_interlacer/.*:
mem: 30.4
toolshed.g2.bx.psu.edu/repos/devteam/fastq_paired_end_joiner/fastq_paired_end_joiner/.*:
mem: 11.4
toolshed.g2.bx.psu.edu/repos/devteam/fastqc/fastqc/.*:
cores: 8
cores: min(max(int(input_size * 4), 1), 16)
mem: min(max(int(input_size * 7), 5.6), 64)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I 4GB input file would already get 16 cores? That seems to be a lot. We use 3 cores and never had complains so far.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Mainly I was just scaling 4GB/core but it certainly could be overkill. Of course the true question is whether cores scale in an inverse linear relationship with time, which we'd have to do some testing to know. Since I'm happy to allocate more cores if I have to allocate the memory anyway and it gets the job done faster.

tools.yml Outdated
@@ -588,8 +536,8 @@ tools:
cores: 2
mem: 7.6
toolshed.g2.bx.psu.edu/repos/devteam/tophat2/tophat2/.*:
cores: 10
mem: 90
cores: 16
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

half both cores and mem? tophat should not get any priority IMHO

toolshed.g2.bx.psu.edu/repos/iuc/chira_merge/chira_merge/.*:
cores: 1
mem: 60
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mira-miracoli if we don't have this tool in our config, we use the memory from the shared DB isn't it? So we are using 60GB I assume?

@pavanvidem since this is your tool, any recommendation?

@nuwang
Copy link
Member

nuwang commented Nov 9, 2024

Oh, I guess I need to apply formatting. It removes all the comments though, which IMO can be useful - @nuwang you mentioned making roundtrip the default in galaxyproject/total-perspective-vortex#140, should I do that (or make it the method used by format, at least)? Or do we want to enforce no comments in the shared DB?

It would be great if we can preserve comments, at least when formatting I think. However, I just left a comment on that other thread about the impending pydantic transition.

@natefoo
Copy link
Member Author

natefoo commented Dec 4, 2024

@nuwang can we make a new release of TPV? Then a reformat and the noqas should allow this to pass.

@nuwang
Copy link
Member

nuwang commented Dec 17, 2024

@natefoo Just published a release. Sorry about the delay, just returned from vacation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants