Skip to content

v0.3.0

Compare
Choose a tag to compare
@XkunW XkunW released this 29 Aug 06:09
· 59 commits to main since this release
156dfa5
  • Added vec-inf CLI:

    • Install vec-inf via pip
    • launch command to launch models
    • status command to check inference server status
    • shutdown command to stop inference server
    • list command to see all available models
  • Upgraded vllm to 0.5.4

  • Added support for new model families:

    • Llama 3.1 (Including 405B)
    • Gemma 2
    • Phi 3
    • Mistral Large