Skip to content

Latest commit

 

History

History
 
 

DeepQNetwork

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 

breakout

video demo

Reproduce the following reinforcement learning methods:

Claimed performance in the paper can be reproduced, on several games I've tested with.

DQN

DQN typically took 1 day of training to reach a score of 400 on breakout game (same as the paper). My Batch-A3C implementation only took <2 hours. Both were trained on one GPU with an extra GPU for simulation.

Double-DQN runs at 18 batches/s (1152 frames/s) on TitanX. Note that I wasn't using the network architecture in the paper. If switched to the network in the paper it could run 2x faster.

How to use

Download an atari rom to $TENSORPACK_DATASET/atari_rom/ (defaults to ~/tensorpack_data/atari_rom/), e.g.:

mkdir -p ~/tensorpack_data/atari_rom
wget https://github.com/openai/atari-py/raw/master/atari_py/atari_roms/breakout.bin -O ~/tensorpack_data/atari_rom/breakout.bin

Start Training:

./DQN.py --rom breakout.bin
# use `--algo` to select other DQN algorithms. See `-h` for more options.

Watch the agent play:

./DQN.py --rom breakout.bin --task play --load trained.model

A pretrained model on breakout can be downloaded here.

A3C code and models for Atari games in OpenAI Gym are released in examples/A3C-Gym