SAC/TD3 issue #64
dhruvkm2402
started this conversation in
General
Replies: 2 comments
-
This looks similar to my problem, you can take a look at my question, it may be helpful for you. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Hi @dhruvkm2402 and @HumbleLee Some practices that can be used to control the limits of the actions taken by the policy/actor may include:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi @Toni-SM ,
I was trying to analyze TD3 and SAC performance for my scenario in Omniverse ISAAC gym. But with TD3 and SAC, after a few steps it outputs only one action which is 1. Are there some additional changes I need to make? PPO works as expected depending on the reward formulation.
I'll share the code that I referenced from Multi-Agent example.
Beta Was this translation helpful? Give feedback.
All reactions