Candy

Candy: Self-driving in Carla Environment.

What is candy? A model with the structure: Hierarchical Observation -- Plan&Policy -- Hierarchical Actions

We use VAE/GAN/Glow for world representation, and do RL/IL/Planning/MCTS upon it.

Demo

Performance

Car drifting (more to be uploaded):

City navigation:

VAE

Real:

Reconstructed: (With hidden state of size 50, running for 1 hour on a single GTX1080Ti)

Running Candy

(This project is still working in progress.)

Download Carla-0.8.2 from here.

Start CarlaUE4 engine in server mode, using commands from here.

  ./CarlaUE4.sh -windowed -ResX=800 -ResY=600 -carla-server -benchmark -fps=10

Install Carla PythonClient using:
```
  pip install ~/carla/PythonClient
```
Install Openai baselines under instructions here.

Install required packages:

  pip install numpy tensorflow-gpu msgpack msgpack-numpy pyyaml tqdm gym opencv-python scipy pygame pillow

Start the program by running:

  CUDA_VISIBLE_DEVICES=0 python main.py -m Town01 -l

Visualization: After running the following command, open localhost:6006 on the browser.
```
  tensorboard -logdir=./logs
```

Candy Features

Combining imitation learning and reinforcement learning. Candy can learn make its first turn in 40 minutes(Single GTX1080Ti) from scratch (randomize policy network).
VAE unsupervised learning for world model construction.
Persistent training process and flexible architecture.

Todo

Ideal Features

Curiosity-based Attention, Supervised Attention, loop-control-Attention, Interpretable Attention.
VAE + modelbased planning + video prediction + MCTS.
GQN, what-where in any place (Better generalization).
guiding commands following (HRL, Multi-tasking).
From implicit to explicit: Meta-learning, Rule Learning (Experiments from imaginary room).
Stronger world model with enhanced VAE(maybe with attention).

Code Components

main.py: Main file. It Deals with Carla environment.
carla_wrapper.py: Wrap main.py, buffer information for the model.
candy_model.py: Main model file. It is for building the big graph of the model.
modules/* : Building blocks of the model, used in candy_model.py.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
modules		modules
ray_candy		ray_candy
screenshots		screenshots
.gitignore		.gitignore
README.md		README.md
args.yaml		args.yaml
candy_model.py		candy_model.py
carla_wrapper.py		carla_wrapper.py
deamon.sh		deamon.sh
main.py		main.py
persistence_vae_vi.py		persistence_vae_vi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Candy

Demo

Performance

VAE

Running Candy

Candy Features

Todo

Ideal Features

Code Components

About

Releases

Packages

Contributors 2

Languages

createamind/candy

Folders and files

Latest commit

History

Repository files navigation

Candy

Demo

Performance

VAE

Running Candy

Candy Features

Todo

Ideal Features

Code Components

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages