Smol GPT

Multimodal instruction-following model for text generation that runs on your CPU. Less than 14 GB of RAM required.

Disclaimer

Models hallucinates - this is meant for fun. We do not recommend using the model in production - unless you ~~hallucinate for a living~~ know what you are doing.

The implementation may contain bugs and int4 quantization performed is not optimal – This might lead to worse performance than the original model.

Usage

git clone https://github.com/nolanoOrg/smol-gpt
pip install -r requirements.txt
cd cpp && make
cd ..
python3 app.py (May take a few minutes to download and load the model)
Open http://127.0.0.1:4241/ in your browser.`

Contributing (Possible future directions)

Contributions are welcome. Please open an issue or a PR. New features will be community driven. Following features can be easily added for the model:

Features

Chat/Conversation mode is supported by the model, but not the app.
Increase Input/Output length.
GPTQ quantization.
Interesting Prompts.

Performance (Speed and Memory)

Reduce RAM usage by 4x (down to ~4 GB)
- Current Flask implementation loads the Bert & CLIP models twice for some reason.
- Offload T5 encoder after getting the hidden representations.
- Shift Vision and Bert model to int4/int8 and offload after using.
Speed up 4x:
- Keep the model loaded in RAM.
- Integrate CALM (https://arxiv.org/abs/2207.07061) for 2x speedup during generation.
- Translate Vision and Bert model to C.
MMap Speed up.
Support Smoller GPT for running multimodal models in 4 GB of RAM.

Unknowns about the model

Performance on multiple collated images.
Couple with OCR to reason about text from images.

License

MIT

Communication

Nolano Discord: https://discord.gg/sWQsr4FE
GitHub Issues.

Misc

The model used are Clip and Bert following Blip-2 and Flan-T5 for instruction following.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
assets		assets
cpp		cpp
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Smol GPT

Disclaimer

Usage

Contributing (Possible future directions)

Features

Performance (Speed and Memory)

Unknowns about the model

License

Communication

Misc

About

Releases

Packages

Contributors 2

Languages

NolanoOrg/smol-gpt

Folders and files

Latest commit

History

Repository files navigation

Smol GPT

Disclaimer

Usage

Contributing (Possible future directions)

Features

Performance (Speed and Memory)

Unknowns about the model

License

Communication

Misc

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages