You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In train.py, I see a central agent,SL agent and RL agents. They are running in different CPU cores with multiprocessing package. And RL agents get the weights of policy and value network from central agent with a Queue. I see train_a3c.py is very similar to train.py. I wonder if these two files are both implementations of A3C algorithm?
The text was updated successfully, but these errors were encountered:
In train.py, I see a central agent,SL agent and RL agents. They are running in different CPU cores with multiprocessing package. And RL agents get the weights of policy and value network from central agent with a Queue. I see train_a3c.py is very similar to train.py. I wonder if these two files are both implementations of A3C algorithm?
The text was updated successfully, but these errors were encountered: