Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate if --cores-per-socket usage needs to be added to MATLAB docs #349

Open
b-reyes opened this issue Sep 24, 2024 · 0 comments
Open
Labels
medium-priority Not urgent, but should be addressed

Comments

@b-reyes
Copy link
Contributor

b-reyes commented Sep 24, 2024

We had a user who stated the following:

I was concerned about running some jobs using Matlab on blanca, the same job could take 
30 min on one node and 4h30 on another one and I didn't understand why.

Until I find out that despite submitting jobs by asking --ntasks-per-node=N, matlab was 
starting a parallel pool of n workers only, with n < N, for instance N = 28, n = 16.

And this is actually normal given that Matlab launches a number of workers which equals at
maximum the number of physical cores. 

So I was in this unknown situation when I'm allocated 28 logical cores but can have only 16 
workers, corresponding to the 16 physical cores of the node, e.g. bnode0101, and thus limiting
 the extent of my parallelization in a significant way.

I found out the solution to be allocated the correct number of physical cores I want, by adding
 the option --cores-per-socket 28 in my sbatch command.

I didn't make any test on Alpine yet, but I noticed that some alpine nodes had also a reduced 
number of physical cores.

I think a note on all this would be helpful for other users to add in the document.

We should confirm the behavior they state. If confirmed, we should add this information to our MATLAB page.

@b-reyes b-reyes added the medium-priority Not urgent, but should be addressed label Sep 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
medium-priority Not urgent, but should be addressed
Projects
None yet
Development

No branches or pull requests

1 participant