-
Notifications
You must be signed in to change notification settings - Fork 250
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rerunning regression test by submitting job_card in the run directory fails #2247
Comments
@DusanJovic-NOAA looks like I made a change from $PWD/modulefiles to @[PWD]/modulefiles which should not have happened. If you change fv3_slurm.IN_hera to use $ instead of @[] for line 20 I think that would solve this issue. I have to fix an issue using -l and -n together too that I will try to bring this fix into. |
Ok, that seems to work. Why is MACHINE_ID exportred, previously it wasn't? And it isn't in the other job card templates. Also srun like now looks like:
Why is number of tasks (-n option) quoted string, it should be an integer. In the job card template , this line:
has been changed to:
|
Why did you quote all these at-parse-placeholders? Only in hera templates, in all other templates they are as before. |
Actually, thanks for bringing that up. The quoted strings using the at-parse variables do not need the quotes, you're right. The hera job card is a test for switching all of those fv3/compile IN files to bash. Having one in development helps me see what mistakes this might be. Bash likes quoted variables but with using the at-parse it's not needed as bash doesn't see it as anything. I'll get those quotes removed in that branch. |
Bash likes variables that are not used directly in the script to be exported to make sure they are used externally properly. If the export is meaningless and the variable is not used in the script, then it should be removed. |
Unrelated to the job card template issues, but now by default
to:
broke that. Please remove that And again -j make option accepts integers not quoted strings (BUILD_JOBS). |
But in this case MACHINE_ID is used in this script, it is used by module-setup.sh, which is sourced. So MACHINE_ID does not have to be exported. Please remove export. After all it isn't exported in any other job card template, and everything works without any issues. |
I've removed the quoted at-parse variables. |
Then why is MACHINE_ID not exported in other job card templates? |
I haven't converted those to shellcheck, yet. Hera was a test to see if it had issues with the move from /bin/sh to /bin/bash and the changes that shellcheck wanted. I tested as much as I could on Hera but was not sure with using the at-parse if it would have any negative impacts for other projects. With Hera getting the most usage externally, it seemed a great place to introduce the change. The intent is to convert them all over if Hera does not see major issues. |
These were fixed in #2241 |
Description
During the development and/or debugging I often need to keep rerunning the same test over and over again. The simplest way of doing this is to run the desired regression test once, save the run directory, move that directory to somewhere where it will not be purged (noscrub directory for example) and then submit the job card from that run directory. This workflow is currently broken after the last PR that cleaned up rt.sh.
To Reproduce:
./rt.sh -n 'control_p8 intel' -k
mv ../rt_xxxxxxxx/control_P8_intel /new/location/control_p8_intel
cd /new/location/control_p8_intel
sbatch job_card
This will first fail with the error:
MACHINE_ID: unbound variable
. This happens becauseMACHINE_ID=hera
is commented out in thejob_card
.https://github.com/ufs-community/ufs-weather-model/blob/develop/tests/fv3_conf/fv3_slurm.IN_hera#L17-L23
After uncommenting that line and resubmitting job card it fails again with:
This error happens because in the
job_card
the module use location is hard-coded to the original run directory:I am not submitting this job from that original location, I moved the entire run directory to a different location. I see these job_card template changes are made only on Hera, all other job cards are untouched. Why?
Additional context
Output
The text was updated successfully, but these errors were encountered: