tensorpack/examples/DeepQNetwork at master · xiaosongwang/tensorpack

History

Name		Name	Last commit message	Last commit date
parent directory ..
DQN.py		DQN.py
README.md		README.md
atari.py		atari.py
breakout.jpg		breakout.jpg
common.py		common.py
curve-breakout.png		curve-breakout.png
expreplay.py		expreplay.py

README.md

video demo

Reproduce the following reinforcement learning methods:

Nature-DQN in: Human-level Control Through Deep Reinforcement Learning
Double-DQN in: Deep Reinforcement Learning with Double Q-learning
Dueling-DQN in: Dueling Network Architectures for Deep Reinforcement Learning
A3C in Asynchronous Methods for Deep Reinforcement Learning. (I used a modified version where each batch contains transitions from different simulators, which I called "Batch-A3C".)

Claimed performance in the paper can be reproduced, on several games I've tested with.

DQN typically took 1 day of training to reach a score of 400 on breakout game (same as the paper). My Batch-A3C implementation only took <2 hours. Both were trained on one GPU with an extra GPU for simulation.

Double-DQN runs at 18 batches/s (1152 frames/s) on TitanX. Note that I wasn't using the network architecture in the paper. If switched to the network in the paper it could run 2x faster.

How to use

Download an atari rom to $TENSORPACK_DATASET/atari_rom/ (defaults to ~/tensorpack_data/atari_rom/), e.g.:

mkdir -p ~/tensorpack_data/atari_rom
wget https://github.com/openai/atari-py/raw/master/atari_py/atari_roms/breakout.bin -O ~/tensorpack_data/atari_rom/breakout.bin

Start Training:

./DQN.py --rom breakout.bin
# use `--algo` to select other DQN algorithms. See `-h` for more options.

Watch the agent play:

./DQN.py --rom breakout.bin --task play --load trained.model

A pretrained model on breakout can be downloaded here.

A3C code and models for Atari games in OpenAI Gym are released in examples/A3C-Gym

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeepQNetwork

DeepQNetwork

README.md

How to use

Files

DeepQNetwork

Directory actions

More options

Directory actions

More options

Latest commit

History

DeepQNetwork

Folders and files

parent directory

README.md

How to use