Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

epic: Cortex TensorRT-LLM support #1152

Closed
3 of 7 tasks
dan-menlo opened this issue Sep 8, 2024 · 1 comment
Closed
3 of 7 tasks

epic: Cortex TensorRT-LLM support #1152

dan-menlo opened this issue Sep 8, 2024 · 1 comment
Labels
engine: tensorrt-llm wontfix This will not be worked on

Comments

@dan-menlo
Copy link
Contributor

dan-menlo commented Sep 8, 2024

Goal

  • cortex run model:tensorrt-llm
  • Support TensorRT-LLM model and inference params
  • Support advanced params (e.g. batching)

Enables

  • Jan supports TensorRT-LLM

Tasklist

@dan-menlo dan-menlo added this to Menlo Sep 8, 2024
@dan-menlo dan-menlo converted this from a draft issue Sep 8, 2024
@dan-menlo dan-menlo changed the title epic: cortex run model:tensorrt-llm epic: Cortex supports TensorRT-LLM Sep 8, 2024
@dan-menlo dan-menlo added engine: tensorrt-llm type: epic A major feature or initiative labels Sep 8, 2024
@freelerobot freelerobot added the category: engine management Related to engine abstraction label Sep 9, 2024
@dan-menlo dan-menlo changed the title epic: Cortex supports TensorRT-LLM epic: Cortex TensorRT-LLM support Sep 11, 2024
@dan-menlo dan-menlo moved this to Investigating in Menlo Oct 13, 2024
@dan-menlo
Copy link
Contributor Author

Deprecated due to TensorRT-LLM not supporting Desktop

@github-project-automation github-project-automation bot moved this from Investigating to Review + QA in Menlo Nov 28, 2024
@gabrielle-ong gabrielle-ong moved this from Review + QA to Completed in Menlo Nov 28, 2024
@gabrielle-ong gabrielle-ong added wontfix This will not be worked on and removed type: epic A major feature or initiative category: engine management Related to engine abstraction labels Nov 28, 2024
@gabrielle-ong gabrielle-ong moved this from Completed to Discontinued in Menlo Nov 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
engine: tensorrt-llm wontfix This will not be worked on
Projects
Archived in project
Development

No branches or pull requests

3 participants