train.py is an implementation of A3C or A2C ? #6

wanziyu · 2022-06-05T07:49:50Z

In train.py, I see a central agent，SL agent and RL agents. They are running in different CPU cores with multiprocessing package. And RL agents get the weights of policy and value network from central agent with a Queue. I see train_a3c.py is very similar to train.py. I wonder if these two files are both implementations of A3C algorithm?

pengyanghua · 2022-06-05T22:04:33Z

@wanziyu I think both are A3C algorithm. You can run "diff train.py train_a3c.py" to see the differences.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train.py is an implementation of A3C or A2C ? #6

train.py is an implementation of A3C or A2C ? #6

wanziyu commented Jun 5, 2022 •

edited

Loading

pengyanghua commented Jun 5, 2022

train.py is an implementation of A3C or A2C ? #6

train.py is an implementation of A3C or A2C ? #6

Comments

wanziyu commented Jun 5, 2022 • edited Loading

pengyanghua commented Jun 5, 2022

wanziyu commented Jun 5, 2022 •

edited

Loading