Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Models #7

Merged
merged 32 commits into from
May 19, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
32 commits
Select commit Hold shift + click to select a range
24bc461
Update models.json
Smartappli May 19, 2024
f39674a
Update dependabot.yml
Smartappli May 19, 2024
1c1cf43
Create CODEOWNERS
Smartappli May 19, 2024
78e0638
Create FUNDING.yml
Smartappli May 19, 2024
a480bfb
Create release-drafter.yml
Smartappli May 19, 2024
3a76659
Create bug-report.yml
Smartappli May 19, 2024
e12c8d8
Create feature-request.yml
Smartappli May 19, 2024
3445a6e
Create question.yml
Smartappli May 19, 2024
d970140
Create sweep-template.yml
Smartappli May 19, 2024
467fddd
Create black.yml
Smartappli May 19, 2024
f239651
Update model-check.yml
Smartappli May 19, 2024
2003273
Create flake8.yml
Smartappli May 19, 2024
2ccb045
Create ruff.yml
Smartappli May 19, 2024
e78b1c5
Delete .github/workflows/flake8.yml
Smartappli May 19, 2024
d4bd8ca
Update models.json
Smartappli May 19, 2024
ba57b28
:art: Format Python code with psf/black
Smartappli May 19, 2024
5e46137
Merge pull request #6 from Smartappli/black
Smartappli May 19, 2024
8d9e6af
Update README.md
Smartappli May 19, 2024
0db9488
Merge pull request #8 from Smartappli/main
Smartappli May 19, 2024
7a70baf
Update model-check.yml
Smartappli May 19, 2024
5a9bcc1
Create requirements.txt
Smartappli May 19, 2024
b4968ee
Update models.json
Smartappli May 19, 2024
f7d5f2f
Update models.json
Smartappli May 19, 2024
4332fea
Update models.json
Smartappli May 19, 2024
f71cc4a
Update models.json
Smartappli May 19, 2024
6a00d61
Update models.json
Smartappli May 19, 2024
c5dba93
Update models.json
Smartappli May 19, 2024
ad486cb
Update models.json
Smartappli May 19, 2024
14b81aa
Update models.json
Smartappli May 19, 2024
f684c54
Update models.json
Smartappli May 19, 2024
be8fd34
Update models.json
Smartappli May 19, 2024
1673bbe
Update README.md
Smartappli May 19, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .github/CODEOWNERS
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
@Smartappli
1 change: 1 addition & 0 deletions .github/FUNDING.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
github: [Smartappli]
73 changes: 73 additions & 0 deletions .github/ISSUE_TEMPLATE/bug-report.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,73 @@
name: "☢️ Bug Report"
title: "\U0001F41B [Bug]: "
description: Create a report to help us improve Serge.
labels: ["☢️ Bug"]

body:
- type: markdown
attributes:
value: |
Thanks for taking the time to fill out this bug report.

- type: textarea
id: description
attributes:
label: Bug description
description: Please provide a bug description with expected beavior vs. actual behavior.
placeholder: Don't put your logs here!
validations:
required: true

- type: textarea
id: steps
attributes:
label: Steps to reproduce
description: Please provide a step by step guide to recreate the issue.
placeholder: |
Please provide steps to reproduce the behavior.

1. Step 1
2. Step 2
3. Step 3
validations:
required: true

- type: textarea
id: environment
attributes:
label: Environment Information
description: Please provide the following information about your environment.
placeholder: |
Docker version: (run `docker --version`)
OS: (e.g., Windows, macOS, Linux)
OS version: (e.g., Windows 10, macOS 11.4, Ubuntu 20.04)
CPU Model: (e.g., Intel Core i7-9700K, AMD Ryzen 5 3600)
Browser: (if relevant)
validations:
required: true

- type: textarea
id: screenshots
attributes:
label: Screenshots
description: Please provide screenshots if relevant.
validations:
required: false

- type: textarea
id: logs
attributes:
label: Relevant log output
description: Please copy and paste any relevant log output. This will be automatically formatted into code, so no need for backticks.
render: shell

- type: checkboxes
id: confirmations
attributes:
label: Confirmations
description: Please verify you meet those criterias to make it easier to help you.
options:
- label: I'm running the latest version of the main branch.
required: true
- label: I checked existing issues to see if this has already been described.
required: true
37 changes: 37 additions & 0 deletions .github/ISSUE_TEMPLATE/feature-request.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
name: "✏️ Feature Request"
title: "\U0001F680 [Feature]: "
description: Suggest an idea to improve this project.
labels: ["✏️ Feature"]

body:
- type: markdown
id: notice
attributes:
value: |
### Notice
- Don't forget you can ask your questions in our [Discord server](https://discord.gg/62Hc6FEYQH).
- If you think this is just a bug, open the issue with the **☢️ Bug Report** template.
- type: textarea
id: description
attributes:
label: "Feature Description"
description: "A clear and detailed description of the feature you would like to see added."
placeholder: "Explain your feature clearly, and in detail."
validations:
required: true
- type: textarea
id: additional-context
attributes:
label: "Additional Context (optional)"
description: "If you have something else to describe, write them here."
placeholder: "Write here what you can describe differently."
- type: checkboxes
id: terms
attributes:
label: "Checklist:"
description: "By submitting this issue, you confirm that:"
options:
- label: "I have checked for existing issues that describe my suggestion prior to opening this one."
required: true
- label: "I understand that improperly formatted feature requests may be closed without explanation."
required: true
32 changes: 32 additions & 0 deletions .github/ISSUE_TEMPLATE/question.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
name: "🤔 Question"
title: "\U0001F917 [Question]: "
description: Ask a question so we can help you
labels: ["🤔 Question"]

body:
- type: markdown
id: notice
attributes:
value: |
### Notice
- Don't forget you can ask your questions in our [Discord server](https://discord.gg/q2sAWmzF).
- If you think this is just a bug, open the issue with the **☢️ Bug Report** template.
- If you have a suggestion for a Serge feature you would like to see, open the issue with the **✏️ Feature Request** template.
- type: textarea
id: description
attributes:
label: "Question Description"
description: "A clear and detailed description of the question."
placeholder: "Explain your question clearly, and in detail."
validations:
required: true
- type: checkboxes
id: terms
attributes:
label: "Checklist:"
description: "By submitting this issue, you confirm that:"
options:
- label: "I have checked for existing issues that describe my questions prior to opening this one."
required: true
- label: "I understand that improperly formatted questions may be closed without explanation."
required: true
15 changes: 15 additions & 0 deletions .github/ISSUE_TEMPLATE/sweep-template.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
name: Sweep Issue
title: 'Sweep: '
description: For small bugs, features, refactors, and tests to be handled by Sweep, an AI-powered junior developer.
labels: sweep
body:
- type: textarea
id: description
attributes:
label: Details
description: Tell Sweep where and what to edit and provide enough context for a new developer to the codebase
placeholder: |
Unit Tests: Write unit tests for <FILE>. Test each function in the file. Make sure to test edge cases.
Bugs: The bug might be in <FILE>. Here are the logs: ...
Features: the new endpoint should use the ... class from <FILE> because it contains ... logic.
Refactors: We are migrating this function to ... version because ...
17 changes: 15 additions & 2 deletions .github/dependabot.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,5 +3,18 @@ updates:
- package-ecosystem: "github-actions"
directory: "/"
schedule:
# Check for updates to GitHub Actions every week
interval: "weekly"
interval: "daily"
labels:
- "🤖 Dependencies"
- package-ecosystem: "pip"
directory: "/"
schedule:
interval: "daily"
labels:
- "🤖 Dependencies"
- package-ecosystem: "docker"
directory: "/"
schedule:
interval: "daily"
labels:
- "🤖 Dependencies"
72 changes: 72 additions & 0 deletions .github/release-drafter.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,72 @@
name-template: '$RESOLVED_VERSION'
tag-template: '$RESOLVED_VERSION'
categories:
- title: '❗ Breaking Changes:'
labels:
- '❗ Breaking Change'
- title: '🚀 New Features:'
labels:
- '✏️ Feature'
- title: '🐛 Fixes:'
labels:
- '☢️ Bug'
- title: '📚 Documentation:'
labels:
- '📒 Documentation'
- title: '🧹 Updates:'
labels:
- '🧹 Updates'
- title: '🤖 Dependencies:'
labels:
- '🤖 Dependencies'
change-template: '- $TITLE (#$NUMBER)'
change-title-escapes: '\<*_&'
exclude-contributors:
- dependabot
- dependabot[bot]
version-resolver:
major:
labels:
- 'major'
minor:
labels:
- 'minor'
- '❗ Breaking Change'
- '✏️ Feature'
patch:
labels:
- 'patch'
- '📒 Documentation'
- '☢️ Bug'
- '🤖 Dependencies'
- '🧹 Updates'
default: patch
template: |
$CHANGES

## 🐳 Docker Images
- `ghcr.io/$OWNER/$REPOSITORY:latest`
- `ghcr.io/$OWNER/$REPOSITORY:$RESOLVED_VERSION`

**Full Changelog**: https://github.com/$OWNER/$REPOSITORY/compare/$PREVIOUS_TAG...$RESOLVED_VERSION

Thanks to $CONTRIBUTORS for making this release possible.

autolabeler:
- label: '📒 Documentation'
files:
- '*.md'
title:
- '/(docs|doc:|\[doc\]|typos|comment|documentation)/i'
- label: '☢️ Bug'
title:
- '/(fix|bug|missing|correct)/i'
- label: '🧹 Updates'
title:
- '/(improve|update|refactor|deprecated|remove|unused|test)/i'
- label: '🤖 Dependencies'
title:
- '/(bump|dependencies)/i'
- label: '✏️ Feature'
title:
- '/(feature|feat|create|implement)/i'
30 changes: 30 additions & 0 deletions .github/workflows/black.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
name: Black Formatter
on: [push, pull_request]

concurrency:
group: black-${{ github.event_name == 'pull_request' && format('{0}-{1}', github.workflow, github.event.pull_request.number) || github.workflow_ref }}
cancel-in-progress: true

jobs:
linter_name:
name: Black
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- name: Check files using the black formatter
uses: rickstaa/action-black@v1
id: action_black
with:
black_args: "."
- name: Create Pull Request
if: steps.action_black.outputs.is_formatted == 'true'
uses: peter-evans/create-pull-request@v6
with:
token: ${{ secrets.GITHUB_TOKEN }}
title: "Format Python code with psf/black push"
commit-message: ":art: Format Python code with psf/black"
body: |
There appear to be some python formatting errors in ${{ github.sha }}. This pull request
uses the [psf/black](https://github.com/psf/black) formatter to fix these issues.
base: ${{ github.head_ref }} # Creates pull request onto pull request or commit branch
branch: black
8 changes: 5 additions & 3 deletions .github/workflows/model-check.yml
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,10 @@ on:
schedule:
- cron: '0 12 */7 * *'

concurrency:
group: LLM-Checker-${{ github.event_name == 'pull_request' && format('{0}-{1}', github.workflow, github.event.pull_request.number) || github.workflow_ref }}
cancel-in-progress: true

jobs:
model-health-check:
runs-on: ubuntu-latest
Expand All @@ -30,9 +34,7 @@ jobs:
working-directory: ./
run: |
pip install --upgrade pip
pip install pytest
pip install huggingface_hub
pip install pathlib
pip install -r requirements.txt
- name: Run model health check
working-directory: ./
run: |
Expand Down
11 changes: 11 additions & 0 deletions .github/workflows/ruff.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
name: Ruff
on: [push, pull_request]
jobs:
ruff:
name: Ruff
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- uses: chartboost/ruff-action@v1
with:
args: 'check --ignore F401 --fix'
14 changes: 7 additions & 7 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,22 +17,22 @@ Then replace the models.json file located in /usr/src/app/api/src/serge/data/ of

| Directory | Description | Models |
|:------------:|:--------------------------------:|:---------:|
| **All** | All models | AlBfred 40B-1023; Asclepius 13B; BioMistral 7B; Code 13B, 33B; CodeLLaMA 7B, 7B-Instruct, 7B-Python, 13B, 13B-Instruct, 13B-Python, 34B, 34B-Instruct, 34B-Python? 70B, 70B-Instruct, 70B-Python; Falcon 7B, 7B-Instruct, 40B, 40B-Instruct; Gemma 2B, 2B-Instruct, 7B, 7B-Instruct; Finance Chat, LLM, LLM-13B; LLaMA 2 7B, 7B-Chat, 7B-Coder, 13B, 13B-Chat, 70B, 70B-Chat, 70B-OASST; LLamA Pro 8B, 8B-Instruct; Med42 70B; Medalpaca 13B; Medicine: Chat, LLM, LLM 13B; Meditron 7B, 7B-Chat, 70B; Mistral 7B-v0.1, 7B-Instruct-v0.2, 7B-OpenOrca; MistralLite 7B; Mixtral 8x7B-v0.1, 8x7B-Dolphin-2.7, 8x7B-Instruct-v0.1, SlimOrca 8x7B; Neural-Chat 7B-v3.3; Notus 7B-v1; Notux 8x7b-v1; Nous-Hermes-2 Mistral-7B-DPO, Mixtral-8x7B-DPO, Mistral-8x7B-SFT; OpenChat 7B-v3.5-0106; OpenLLaMA 3B-v2, 7B-v2, 13B-v2; Orca 2 7B, 13B; Phi 2 2.7B; Python Code 13B, 33B; PsyMedRP 13B-v1, 20B-v1; SlimOrca 13B; Sqlcoder 2 15B; Starling LM 7B-Alpha; Tinyllama 1.1B Chat v1.0, 1.1B Chat Medical; Vicuna 7B-v1.5, 13B-v1.5, 33B-v1.3, 33B-Coder; Vigogne 2 7B-Chat, 7B-Instruct, 13B-Instruct, 70B-Chat; Wizard Coder-33B-v1.1; LM-7B-v1.0, LM-13B-v1.2, LM-70B-v1.0, Math-7B-v1.1, 13B-v1.0, 70B-v1.0; Zephyr 3B, 7B-Alpha, 7B-Beta |
| **All** | All models | Alfred 40B-1023; Asclepius 13B; BioMistral 7B; Code 13B, 33B; CodeLLaMA 7B, 7B-Instruct, 7B-Python, 13B, 13B-Instruct, 13B-Python, 34B, 34B-Instruct, 34B-Python? 70B, 70B-Instruct, 70B-Python; Falcon 7B, 7B-Instruct, 40B, 40B-Instruct; Gemma 2B, 2B-Instruct, 7B, 7B-Instruct; Finance Chat, LLM, LLM-13B; Gorilla 7B-Falcon, 7B, OpenFunctions V1, OpenFunctions V2; LLaMA 2 7B, 7B-Chat, 7B-Coder, 13B, 13B-Chat, 70B, 70B-Chat, 70B-OASST; LLaMA 3 11B-Instruct-v0.1, 13B-Instruct-v0.1, 16B-Instruct-v0.1; LLamA Pro 8B, 8B-Instruct; Med42 70B; Medalpaca 13B; Medicine: Chat, LLM, LLM 13B; Meditron 7B, 7B-Chat, 70B; Meta-Llama 3 8B, 8B-Instruct, 70B, 70B-Instruct; Mistral 7B-v0.1, 7B-Instruct-v0.2, 7B-OpenOrca; MistralLite 7B; Mixtral 8x7B-v0.1, 8x7B-Dolphin-2.7, 8x7B-Instruct-v0.1, SlimOrca 8x7B; Neural-Chat 7B-v3.3; Notus 7B-v1; Notux 8x7b-v1; Nous-Hermes-2 Mistral-7B-DPO, Mixtral-8x7B-DPO, Mistral-8x7B-SFT; OpenChat 7B-v3.5-0106; Open Code Interpreter CL-7B, CL-13B, CL-70B, DS-6.7B, DS-33B; OpenLLaMA 3B-v2, 7B-v2, 13B-v2; Orca 2 7B, 13B; Phi 2 2.7B; Python Code 13B, 33B; PsyMedRP 13B-v1, 20B-v1; SlimOrca 13B; Sqlcoder 2 15B; Starling LM 7B-Alpha; Tinyllama 1.1B Chat v1.0, 1.1B Chat Medical; Vicuna 7B-v1.5, 13B-v1.5, 33B-v1.3, 33B-Coder; Vigogne 2 7B-Chat, 7B-Instruct, 13B-Instruct, 70B-Chat; Wizard Coder-33B-v1.1, LM-7B-v1.0, LM-2-7B, LM-13B-v1.2, LM-70B-v1.0, Math-7B-v1.1, 13B-v1.0, 70B-v1.0; Zephyr 3B, 7B-Alpha, 7B-Beta |

| Directory | Description | Models |
|:------------:|:--------------------------------:|:---------:|
| **Generic** | Models trained for english language | CodeLLaMA 7B, 7B-Instruct, 7B-Python, 13B, 13B-Instruct, 13B-Python, 34B, 34B-Instruct, 34B-Python, 70B, 70B-Instruct; Falcon 7B, 7B-Instruct, 40B, 40B-Instruct; Gemma 2B, 2B-Instruct, 7B, 7B-Instruct; LLaMA 2 7B, 7B-Chat, 7B-Coder, 13B, 13B-Chat, 70B, 70B-Chat; LLamA Pro 8B, 8B-Instruct; Mistral 7B-v0.1, 7B-Instruct-v0.2, 7B-OpenOrca; MistralLite 7B; Mixtral 8x7B-v0.1, 8x7B-Dolphin-2.7, 8x7B-Instruct-v0.1, SlimOrca 8x7B; Neural-Chat 7B-v3.3; Notus 7B-v1; Notux 8x7b-v1; Nous-Hermes-2 Mistral-7B-DPO, Mixtral-8x7B-DPO, Mistral-8x7B-SFT; OpenChat 7B-v3.5-0106; OpenLLaMA 3B-v2, 7B-v2, 13B-v2; Orca 2 7B, 13B; Phi 2 2.7B; SlimOrca 13B; Starling LM 7B-Alpha; Tinyllama 1.1B Chat v1.0; Vicuna 7B-v1.5, 13B-v1.5, 33B-v1.3, 33B-Coder; Vigogne 2 7B-Chat, 7B-Instruct, 13B-Instruct, 70B-Chat; WizardL LM-7B-v1.0, LM-13B-v1.2, LM-70B-v1.0; Zephyr 3B, 7B-Alpha, 7B-Beta |
| **Generic** | Models trained for english language | CodeLLaMA 7B, 7B-Instruct, 7B-Python, 13B, 13B-Instruct, 13B-Python, 34B, 34B-Instruct, 34B-Python, 70B, 70B-Instruct; Falcon 7B, 7B-Instruct, 40B, 40B-Instruct; Gemma 2B, 2B-Instruct, 7B, 7B-Instruct; Gorilla 7B-Falcon, 7B, OpenFunctions V1, OpenFunctions V2; LLaMA 2 7B, 7B-Chat, 7B-Coder, 13B, 13B-Chat, 70B, 70B-Chat; LLaMA 3 11B-Instruct-v0.1, 13B-Instruct-v0.1, 16B-Instruct-v0.1; LLamA Pro 8B, 8B-Instruct; Meta-Llama 3 8B, 8B-Instruct, 70B, 70B-Instruct; Mistral 7B-v0.1, 7B-Instruct-v0.2, 7B-OpenOrca; MistralLite 7B; Mixtral 8x7B-v0.1, 8x7B-Dolphin-2.7, 8x7B-Instruct-v0.1, SlimOrca 8x7B; Neural-Chat 7B-v3.3; Notus 7B-v1; Notux 8x7b-v1; Nous-Hermes-2 Mistral-7B-DPO, Mixtral-8x7B-DPO, Mistral-8x7B-SFT; OpenChat 7B-v3.5-0106; OpenLLaMA 3B-v2, 7B-v2, 13B-v2; Orca 2 7B, 13B; Phi 2 2.7B; SlimOrca 13B; Starling LM 7B-Alpha; Tinyllama 1.1B Chat v1.0; Vicuna 7B-v1.5, 13B-v1.5, 33B-v1.3, 33B-Coder; Vigogne 2 7B-Chat, 7B-Instruct, 13B-Instruct, 70B-Chat; WizardL LM-7B-v1.0, LM-2-7B, LM-13B-v1.2, LM-70B-v1.0; Zephyr 3B, 7B-Alpha, 7B-Beta |

| Directory | Description | Models |
|:------------:|:--------------------------------:|:---------:|
| **Tiny** | Models <= 7B params | CodeLLaMA 7B, 7B-Instruct; Falcon 7B, 7B-Instruct; Gemma 2B, 2B-Instruct, 7B, 7B-Instruct; LLaMA 2 7B, 7B-Chat; Meditron 7B, 7B-Chat; Mistral 7B-v0.1, 7B-Instruct-v0.2; MistralLite 7B; Neural-Chat 7B-v3.3; Notus 7B-v1; Nous-Hermes-2 Mistral-7B-DPO, Mixtral-8x7B-DPO, Mistral-8x7B-SFT; OpenChat 7B-v3.5-0106; OpenLLaMA 3B-v2, 7B-v2; Orca 2 7B; Phi 2 2.7B; Starling LM 7B-Alpha; Tinyllama 1.1B Chat v1.0; Vicuna 7B-v1.5; Vigogne 2 7B-Chat, 7B-Instruct; Wizard LM-7B-v1.0; Zephyr 3B, 7B-Alpha, 7B-Beta |
| **Small** | Models with 7B and <= 13B params | CodeLLaMA 13B, 13B-Instruct; LLaMA 2 13B, 13B-Chat; LLamA Pro 8B, 8B-Instruct; OpenLLaMA 13B-v2; Orca 2 13B; SlimOrca 13B; Vicuna 13B-v1.5; Vigogne 2 13B-Instruct; WizardLM 13B-v1.2 |
| **Medium** | Models with > 13B abd <=40B params | Alfred 40B-1023; CodeLLaMA 34B, 34B-Instruct; Falcon 40B, 40B-Instruct |
| **Large** | Models with >40B and <=80B params | CodeLLaMA 70B, 70B-Instruct; LLaMA 2 70B, 70B-Chat; Mixtral 8x7B-v0.1, 8x7B-Instruct-v0.1; Notux 8x7b-v1, Wizard LM-70B-v1.0 |
| **Tiny** | Models <= 7B params | CodeLLaMA 7B, 7B-Instruct; Falcon 7B, 7B-Instruct; Gemma 2B, 2B-Instruct, 7B, 7B-Instruct; Gorilla 7B-Falcon, 7B, OpenFunctions V1, OpenFunctions V2; LLaMA 2 7B, 7B-Chat; Meditron 7B, 7B-Chat; Mistral 7B-v0.1, 7B-Instruct-v0.2; MistralLite 7B; Neural-Chat 7B-v3.3; Notus 7B-v1; Nous-Hermes-2 Mistral-7B-DPO, Mixtral-8x7B-DPO, Mistral-8x7B-SFT; OpenChat 7B-v3.5-0106; OpenLLaMA 3B-v2, 7B-v2; Orca 2 7B; Phi 2 2.7B; Starling LM 7B-Alpha; Tinyllama 1.1B Chat v1.0; Vicuna 7B-v1.5; Vigogne 2 7B-Chat, 7B-Instruct; Wizard LM-7B-v1.0, LM-2-7B; Zephyr 3B, 7B-Alpha, 7B-Beta |
| **Small** | Models with 7B and <= 13B params | CodeLLaMA 13B, 13B-Instruct; LLaMA 2 13B, 13B-Chat; LLaMA 3 11B-Instruct-v0.1, 13B-Instruct-v0.1; LLamA Pro 8B, 8B-Instruct; Meta-Llama 3 8B, 8B-Instruct; OpenLLaMA 13B-v2; Orca 2 13B; SlimOrca 13B; Vicuna 13B-v1.5; Vigogne 2 13B-Instruct; WizardLM 13B-v1.2 |
| **Medium** | Models with > 13B abd <=40B params | Alfred 40B-1023; CodeLLaMA 34B, 34B-Instruct; Falcon 40B, 40B-Instruct; LLaMA 3 16B-Instruct-v0.1 |
| **Large** | Models with >40B and <=80B params | CodeLLaMA 70B, 70B-Instruct; LLaMA 2 70B, 70B-Chat; Meta-Llama 3 70B, 70B-Instruct; Mixtral 8x7B-v0.1, 8x7B-Instruct-v0.1; Notux 8x7b-v1, Wizard LM-70B-v1.0 |

| Directory | Description | Models |
|:------------:|:--------------------------------:|:---------:|
| **Coder** | Models trained for coding | Code 13B, 33B; CodeLLaMA 7B-Python, 13B-Python, 34B-Python, 70B-Python; LLaMA 2 7B-Coder; Sqlcoder 2 15B; Vicuna 33B-Coder; Wizard Coder-33B-v1.1 |
| **Coder** | Models trained for coding | Code 13B, 33B; CodeLLaMA 7B-Python, 13B-Python, 34B-Python, 70B-Python; Open Code Interpreter CL-7B, CL-13B, CL-70B, DS-6.7B, DS-33B; LLaMA 2 7B-Coder; Sqlcoder 2 15B; Vicuna 33B-Coder; Wizard Coder-33B-v1.1 |
| **Finance** | Models trained for finance domain | Finance Chat, LLM, LLM-13B |
| **Math** | Models trained for math domain | Wizard Math-7B-v1.1, 13B-v1.0, 70B-v1.0 |
| **Medical** | Models trained for medical domain | Asclepius 13B; BioMistral 7B; Med42 70B; Medalpaca 13B; Medicine Chat, LLM, LLM 13B; Meditron 7B, 7B-Chat, 70B; PsyMedRP 13B-v1, 20B-v1; TinyLlama 1.1B Chat Medical |
Expand Down
Loading