-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to repro dreamer_v3_100k_ms_pacman experiment reward results #228
Comments
Hi @brodequin-slaps, thanks for reporting this. PS. We run that experiment with cc: @belerico |
Could it be the seed? I've also seen a large variance in Hafner results' on some environments given different seeds. We can maybe try to run another experiment with a different seed? |
Also, which SheepRL version or commit are you using? |
Hi @belerico , Using commit Since using the same seed as upstream, it would make sense to me that the results obtained would match those advertised (especially if ran with Will also try different seeds after that |
Sure, I try to run other experiments. Thanks |
(.venv) sam@sam:~/dev/ml/sheeprl$ python -c "import torch; print(torch.__version__)"
2.2.1+cu121 # Torch
(.venv) sam@sam:~/dev/ml/sheeprl$ nvidia-smi
Wed Mar 6 08:36:26 2024
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.104.07 Driver Version: 537.34 CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 NVIDIA GeForce GTX 1080 On | 00000000:02:00.0 On | N/A |
| 49% 64C P0 152W / 200W | 7696MiB / 8192MiB | 100% Default |
| | | N/A |
+-----------------------------------------+----------------------+----------------------+ |
Hi @brodequin-slaps, can you try out this branch? python sheeprl.py \
exp=dreamer_v3_100k_ms_pacman \
fabric.devices=1 \
fabric.accelerator=cuda \
torch_use_deterministic_algorithms=True \
torch_backends_cudnn_benchmark=False \
torch_backends_cudnn_deterministic=True \
cublas_workspace_config=":4096:8" where While I haven't tried specifically with Dreamer-V3, I've run some simple and fast expriments with PPO: where:
P.S. the script has been run with:
|
Hi @belerico , Tried the However, they don't correspond to your experiments, maybe something's different in our setup |
Hi @brodequin-slaps, I don't think that one can achieve the perfect determinism on completely different hardware: https://discuss.pytorch.org/t/how-to-get-determistic-behavior-with-different-gpus/125640 |
Hi,
Ran the default main branch on the dreamer_v3_100k_ms_pacman experiment (seed 5), but could not repro the rewards advertised
Advertised curve:
When I run it locally with defaulted everything:
Wondering what could explain the difference?
Edit: Found out about deterministic mode which is disabled by default. Will update with the deterministic run results once finished
Edit: Finished run:
The text was updated successfully, but these errors were encountered: