Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

parallel runs failed using execute from PsN (but successful using nmfe75 script) #259

Open
JoannaPeng opened this issue Apr 30, 2024 · 7 comments

Comments

@JoannaPeng
Copy link

I just installed the NONMEM license and PsN onto my new computer (Windows 11 Pro). I can do parallel runs using nmfe75 successfully using the command line below. The pnm file (mpiwini8.pnm) is the one that came with the NONMEM license without any modifications.

 nmfe75 control5.mod control5.lst -parafile=c:\nm75g64\run\mpiwini8.pnm [nodes]=4

But when I tried to do parallel runs using PsN using the command line below, there was only 1 run in parallel, so essentially the parallel runs (using 4 nodes) did not happen. How to fix this problem?

   execute control5.mod -parafile=c:\nm75g64\run\mpiwini8.pnm -nodes=4    

PsN parallel runs failed

@rikardn
Copy link
Member

rikardn commented May 2, 2024

This is interesting. I am not sure about what is going on here.

One thing to check is what nmfe command PsN is actually using here. Unfortunately PsN does not store this in any files. Could you add the line print("NMFE COMMAND: ", $command, "\n"); in the file nonmemrun.pm at line 182? The file should be in your PsN lib installation directory. Try run the execute command above again and report the output.

@JoannaPeng
Copy link
Author

Here is the output after adding the line to the file nonmemrun.pm at line 182:

image

Would appreciate your input!

@rikardn
Copy link
Member

rikardn commented May 3, 2024

This seems to be in order. You could try running this exact command

C:\nm75g64/run/nmfe75.bat psn.mod psn.lst -background "-parafile=C:\nm75g64\run\mpiwini8.pnm" "[nodes]=4"

and see if it runs in parallel outside of PsN.

@JoannaPeng
Copy link
Author

I ran the following command:

image

The command window showed the following, and there are "worker1", "worker2", and "worker3" folders as part of the output. So it did run in parallel outside of PsN.

image

@rikardn
Copy link
Member

rikardn commented May 6, 2024

Thanks. This means that deeper debugging would be needed. I don't think this can be prioritized for the time being. Just out of curiosity: which MPI library did you install?

@JoannaPeng
Copy link
Author

Understood. I have Microsoft MPI. Hope that answers your question about the MPI library.

@rikardn
Copy link
Member

rikardn commented May 8, 2024

Ok, thanks for the feedback. I could be that Microsoft MPI has made it easier to use MPI on Windows and that running PsN with any MPI has never been properly tested before.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants