GitHub - meghabyte/arrival-logograms-generation: Generating logograms from the language in the movie Arrival for novel concepts!

Update (August 2024)

Due to a few requests, I've made a similar model more easily available on Replicate! https://replicate.com/meghabyte/arrival-logograms

Overview

Train a text-to-image model to generate Arrival-inspired logograms for novel concepts!

Arrival is one of my favorite science fiction movies, particularly due to its focus on the interactive nature of language. As shown below, the logograms designed for the alien language in the film are beautiful smoke-like circles, and are even released in another GitHub repository with additional analysis from Wolfram Research!

For a while, I have wanted a tattoo of one of the logograms, but the officially released logograms mainly cover concepts like weapon, ship grounded, and there is no linear time, which, while relevant for a science fiction film, don't feel particularly personal or compelling for something like a permamanent tattoo. So, I decided to fine-tune Stable Diffusion, a text-to-image diffusion model, over the 38 logograms (small dataset!) released by Wolfram so that I could generate logograms for novel concepts that I'd be more willing to permanantly inked on me 😛 I ended up getting a tattoo of a logogram generated for the concept resilience above my right elbow, and I describe the process below!

Model Training Instructions

Here are the following steps to repeat the overall process, which come from here.

Install HuggingFace diffusers, which provides a variety of pretrained vision models, from source: pip install git+https://github.com/huggingface/diffusers
Install the accelerate library: pip install accelerate
Install all dependencies: pip install -U -r requirements.txt
Verify all training data is located in the data/train/ directory. To add more logograms, place the image in this folder and update the file data/train/metadata.jsonl, which contains the mapping between the filename and text caption.
Run the following command to finetune a Stable Diffusion v1-4 model:

accelerate launch train.py \
  --pretrained_model_name_or_path="CompVis/stable-diffusion-v1-4" \
  --train_data_dir="./data" \
  --use_ema \
  --resolution=512 --center_crop --random_flip \
  --train_batch_size=1 \
  --gradient_accumulation_steps=4 \
  --gradient_checkpointing \
  --mixed_precision="fp16" \
  --max_train_steps=50 \
  --learning_rate=1e-03 \
  --max_grad_norm=1 \
  --lr_scheduler="constant" --lr_warmup_steps=0 \
  --output_dir="model"

In general, I found training for 30-50 steps most reasonable, with a learning rate of 1e-03. I had hoped training for more steps would make the model generate higher-contrast samples that are even more similar to the training examples (due to overfitting), but training longer led to a lot more complex threads and "splatters" around the circles, creating a messier look. Training for less leads to simpler patterns, but the background texture is not plain white, shown next.

Generating Logogram Samples

After fine-tuning has finished, run python generate.py to generate samples. Modify the PROMPTS variable to be the list of the text concepts to generate concepts for. The script will generate 20 samples per prompt.

In general, the generated logograms aren't on a clean white background, and have many artifacts, but I think they capture the overall circular shape and "smokiness" aesthetic quite well. I quickly cleaned them up using the Magic Wand tool in Photoshop. Here are the original generated logograms with the their cleaned version for different unseen concepts!

Tattoo

I ended up picking the concept resilience for my tattoo, which feels a lot more meaningful and personal than the existing logograms. The tattoo artist said that some of the smoky lines will merge together at the size I wanted the tattoo, so he drew his own version inspired by the generated version. Here is what it eventually looked like! 😊

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data/train		data/train
output_images		output_images
README.md		README.md
generate.py		generate.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Update (August 2024)

Overview

Model Training Instructions

Generating Logogram Samples

Tattoo

About

Releases

Packages

Languages

meghabyte/arrival-logograms-generation

Folders and files

Latest commit

History

Repository files navigation

Update (August 2024)

Overview

Model Training Instructions

Generating Logogram Samples

Tattoo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages