Transformer

This repo contains the Transformer architecture as described in Attention is All You Need. All of the code to run the model is housed in transformer.py.

Requirements

This build assumes Python >= 3. We also want to use the TensorFlow Beta 2.0.0 (2019). Check the TensorFlow Install Instructions for up to date instructions.

# For CPU

pip install --upgrade pip
pip install tensorflow==2.0.0-rc0 tensorflow-datasets

# For GPU

pip install --upgrade pip
pip install tensorflow-gpu==2.0.0-rc0 tensorflow-datasets

If you have Docker installed, use the provided docker image to run your model! The following will build and then run the image. You do not need to rebuild the container every time.

# Build and Run

sh build && sh run

To re-run just use:

sh run

Run

To use this model create a config JSON file like this:

{
  "seed": 420365,
  "checkpoint": false,
  "layers": 1,
  "attn-heads": 16,
  "model-depth": 32,
  "dff": 64,
  "dropout": 0.1,
  "epochs": 300,
  "batch-size": 1
}

// output to config.json

From this point, you can run the following command. This will use your config.json by default. Make sure you set up your data pipeline in transformer.py beforehand.

python transformer.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
model		model
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
build		build
config.json		config.json
run		run
transformer.py		transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer

Requirements

Run

About

Releases

Packages

Languages

kadengriffith/Transformer

Folders and files

Latest commit

History

Repository files navigation

Transformer

Requirements

Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages