drude gpu sample fails CI #3104

jngrad · 2019-08-25T10:41:19Z

Two samples have been randomly failing tests recently:

Grand Canonical has large deviations from target concentration (for the last 2 weeks)
Drude in BMIM PF6 has broken bonds (for the last 2 days)

3107: Fix failing grand_canonical sample test and add documentation to samples r=jngrad a=jonaslandsgesell This PR adds documentation in to the sample files: * widom_insertion.py * wang_landau_reaction_ensemble.py * grand_canonical.py * reaction_ensemble.py The PR also fixes partly #3104: The problem with the failing test was that the excess chemical potential did not match the submitted concentration. I now provide a matching pair of concentration and excess chemical potential. Co-authored-by: Jonas Landsgesell <[email protected]> Co-authored-by: Jean-Noël Grad <[email protected]>

jngrad · 2019-09-05T17:42:25Z

Drude is still failing:
https://gitlab.icp.uni-stuttgart.de/espressomd/espresso/-/jobs/156474
https://gitlab.icp.uni-stuttgart.de/espressomd/espresso/-/jobs/154727

jngrad · 2019-12-10T12:09:35Z

Finally found the source of the error: bad P3M parameters from the tuning function. There seems to be a pattern in P3M parameters from simulations that crash: the r_cut value is almost twice as small as in simulations that run fine. To reproduce it locally or on a coyote with ubuntu-python3:cuda-10.1:

make local_samples
cd testsuite/scripts/samples
sed -i  "/system.actors.add(p3m)/i p3m._params = {'cao': 7, 'inter': 32768, 'r_cut': 2.5491526892051883, 'alpha': 1.286486160729783, 'accuracy': 0.0009884728050820963, 'mesh': [120, 120, 120], 'epsilon': 0.0, 'mesh_off': [0.5, 0.5, 0.5], 'tune': True, 'check_neutrality': True, 'prefactor': 1389.3612645, 'alpha_L': 47.67025070635568, 'r_cut_iL': 0.06879447050636864, 'cao_cut': [0.0, 0.0, 0.0], 'a': [0.0, 0.0, 0.0], 'ai': [0.0, 0.0, 0.0], 'inter2': 0, 'cao3': 0, 'additional_mesh': [0.0, 0.0, 0.0]}" local_samples/drude_bmimpf6.py
rm -f local_samples/drude_bmimpf6_gpu_processed.py; ../../../pypresso test_drude_bmimpf6_with_gpu.py

The LJ sigmas are in the range 3.4-5.0, in simulations that don't crash the tuned P3M r_cut is in the range of 3.3-5.1, while in crashed simulations r_cut is in the range 2.5-2.8. If this is the real cause, we could add a lower bound to r_cut in the parameters of the tuning function.

Note: checkpointing the particle positions/forces/velocities obtained from a bad tuning (i.e., the simulation eventually crashed) into a simulation with good P3M parameters doesn't lead to a crash.

fweik · 2020-04-10T15:27:15Z

Do we have any theory why a low P3M r_cut leads to crashes? That does not make sense to me.

KaiSzuttor · 2020-08-03T08:06:21Z

closing in favor of #3842

jngrad added the DevOps label Aug 25, 2019

This was referenced Aug 25, 2019

CI build failed for merged PR #3103

Closed

CI build failed for merged PR #3099

Closed

CI build failed for merged PR #3097

Closed

CI build failed for merged PR #3083

Closed

jonaslandsgesell mentioned this issue Aug 28, 2019

Fix failing grand_canonical sample test and add documentation to samples #3107

Merged

jngrad mentioned this issue Sep 7, 2019

CI build failed for merged PR #3142

Closed

This was referenced Sep 16, 2019

CI build failed for merged PR #3171

Closed

CI build failed for merged PR #3172

Closed

jngrad mentioned this issue Sep 26, 2019

CI build failed for merged PR #3217

Closed

jngrad changed the title ~~drude and grand canonical samples fail CI~~ drude sample fails CI Oct 23, 2019

This was referenced Oct 28, 2019

CI build failed for merged PR #3281

Closed

CI build failed for merged PR #3285

Closed

jngrad mentioned this issue Nov 28, 2019

CI build failed for merged PR #3344

Closed

This was referenced Dec 6, 2019

CI build failed for merged PR #3361

Closed

CI build failed for merged PR #3366

Closed

KaiSzuttor changed the title ~~drude sample fails CI~~ drude gpu sample fails CI Dec 9, 2019

jngrad mentioned this issue Jan 21, 2020

CI build failed for merged PR #3430

Closed

fweik mentioned this issue Jan 24, 2020

Add cutoff limits to P3M tuning #3437

Open

jngrad mentioned this issue Feb 16, 2020

CI build failed for merged PR #3488

Closed

This was referenced Mar 12, 2020

CI build failed for merged PR #3576

Closed

CI build failed for merged PR #3580

Closed

This was referenced Apr 9, 2020

CI build failed for merged PR #3647

Closed

CI build failed for merged PR #3651

Closed

jngrad mentioned this issue May 3, 2020

CI build failed for merged PR #3703

Closed

jngrad mentioned this issue Jun 10, 2020

CI build failed for merged PR #3752

Closed

jngrad mentioned this issue Jun 29, 2020

CI build failed for merged PR #3782

Closed

KaiSzuttor closed this as completed Aug 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

drude gpu sample fails CI #3104

drude gpu sample fails CI #3104

jngrad commented Aug 25, 2019

jngrad commented Sep 5, 2019

jngrad commented Dec 10, 2019

fweik commented Apr 10, 2020

KaiSzuttor commented Aug 3, 2020

drude gpu sample fails CI #3104

drude gpu sample fails CI #3104

Comments

jngrad commented Aug 25, 2019

jngrad commented Sep 5, 2019

jngrad commented Dec 10, 2019

fweik commented Apr 10, 2020

KaiSzuttor commented Aug 3, 2020