epic: Semantic naming for Cortex Engines? #1168

dan-menlo · 2024-09-09T00:56:27Z

Goal / Summary of discussion

We should have more semantic naming format for Cortex engines
We should go with how the upstream authors name it, i.e.
- onnxruntime
- llama-cpp
- tensorrt-llm

#	Name	Supported Formats	Version	Status
1	onnxruntime	ONNX	0.0.1	Incompatible
2	llama-cpp	GGUF	0.0.1	Ready
3	tensorrt-llm	TensorRT Engines	0.0.1	Incompatible

Out of scope - renaming repos?

For the Repos, I recommend we add an *-engine suffix to differentiate us from the upstream library:

Old Repo name	New Repo name
`janhq/cortex.tensorrt-llm`	`janhq/tensorrt-llm-engine`
`janhq/cortex.llamacpp`	`janhq/llamacpp-engine`
`janhq/cortex.onnx`	`janhq/onnxruntime-engine`

Tasklist

Discussion: Cortex Engines should be semantically named? #1167
Make changes once team has consensus

The text was updated successfully, but these errors were encountered:

namchuai · 2024-09-10T07:29:46Z

IMO, maybe keep it as llamacpp, onnx and tensorrt

dan-menlo · 2024-09-15T11:04:45Z

@tikikun @nguyenhoangthuan99 Can I check: if we are using "engine names", should it be:

llamacpp (format: gguf)
onnxruntime (format: onnx)
tensorrt-llm? (format: tensorrt engines)

freelerobot · 2024-09-23T06:09:24Z

Lack of decision causing issues here: #1283

Gentle bump @nguyenhoangthuan99 @vansangpfiev to come to an agreement on this.

Option 1:

llama-cpp
onnx-runtime
tensorrt-llm

Option 2:

llamacpp
onnxruntime
tensortllm

Current (inconsistent):

❯ cortex-nightly engines list
+---+--------------+-------------------+---------+--------------+
| # | Name         | Supported Formats | Version | Status       |
+---+--------------+-------------------+---------+--------------+
| 1 | ONNXRuntime  | ONNX              | 0.0.1   | Incompatible |
+---+--------------+-------------------+---------+--------------+
| 2 | llama.cpp    | GGUF              | 0.0.1   | Ready        |
+---+--------------+-------------------+---------+--------------+
| 3 | TensorRT-LLM | TensorRT Engines  | 0.0.1   | Incompatible |
+---+--------------+-------------------+---------+--------------+

gabrielle-ong · 2024-10-02T06:19:49Z

@nguyenhoangthuan99 @vansangpfiev adding this as a priority to decide for consistency before the release

nguyenhoangthuan99 · 2024-10-02T07:05:23Z

Related issue: #1283
To choose the best option for both user experience and development experience, let's analyze the two proposed options and the current state:

Option 1: llama-cpp, onnx-runtime, tensorrt-llm
Option 2: llamacpp, onnxruntime, tensortllm
Current: llama.cpp, ONNXRuntime, TensorRT-LLM

Analysis:

Consistency: Option 2 is the most consistent, using all lowercase and no hyphens. This can be beneficial for both users and developers, as it reduces cognitive load and potential for errors in typing or remembering names.
Readability: Option 1 is more readable due to the use of hyphens, which can help in quickly distinguishing between words. However, this advantage is minor.
Familiarity: The current naming uses the official names of the projects (llama.cpp, ONNXRuntime, TensorRT-LLM). This might be more familiar to users who know these projects.
Consistency with current implementation: Option 1 is closer to the current implementation, which might make the transition easier for existing users.

I prefer Option 1 to update our engine naming convention to llama-cpp, onnx-runtime, and tensorrt-llm. This change brings consistency across our engine names while maintaining their familiarity. The new format improves readability and aligns with common practices in the open-source community, enhancing both user experience and code maintainability. This standardized approach will also help us smoothly integrate any future engines or technologies.

The output of terminal when list engines:

#	Name	Supported Formats	Version	Status
1	onnx-runtime	ONNX	0.0.1	Incompatible
2	llama-cpp	GGUF	0.0.1	Ready
3	tensorrt-llm	TensorRT Engines	0.0.1	Incompatible

cc @vansangpfiev @namchuai @dan-homebrew @0xSage

dan-menlo · 2024-10-03T03:59:07Z

@nguyenhoangthuan99 @vansangpfiev @namchuai I think we should go with how the upstream authors name it, i.e.

onnxruntime
llama-cpp
tensorrt-llm

So essentially how @nguyenhoangthuan99 names them, with just a change on onnxruntime.

#	Name	Supported Formats	Version	Status
1	onnxruntime	ONNX	0.0.1	Incompatible
2	llama-cpp	GGUF	0.0.1	Ready
3	tensorrt-llm	TensorRT Engines	0.0.1	Incompatible

For the Repos, I recommend we add an *-engine suffix to differentiate us from the upstream library:

Old Repo name	New Repo name
`janhq/cortex.tensorrt-llm`	`janhq/tensorrt-llm-engine`
`janhq/cortex.llamacpp`	`janhq/llamacpp-engine`
`janhq/cortex.onnx`	`janhq/onnxruntime-engine`

gabrielle-ong · 2024-10-05T09:48:28Z

Marking as done! Engines renamed for consistent experience

Out of scope: Renaming Repos task - moving to a discussion for Sprint 22

hiento09 · 2024-10-22T02:39:47Z

Reopen this epic, we have not rename upstream engines repo yet

dan-menlo added this to Menlo Sep 9, 2024

dan-menlo assigned namchuai Sep 9, 2024

dan-menlo converted this from a draft issue Sep 9, 2024

dan-menlo moved this to Scheduled in Menlo Sep 9, 2024

namchuai mentioned this issue Sep 9, 2024

feat: cortex engines list #1074

Closed

dan-menlo assigned vansangpfiev, nguyenhoangthuan99 and louis-menlo Sep 9, 2024

freelerobot added the category: engine management Related to engine abstraction label Sep 10, 2024

namchuai mentioned this issue Sep 10, 2024

chore: Renaming Cortex Engines to use semantic naming #1166

Closed

dan-menlo unassigned namchuai, vansangpfiev, nguyenhoangthuan99 and louis-menlo Sep 17, 2024

freelerobot mentioned this issue Sep 23, 2024

idea: standardize <engine_id> across get and list #1283

Closed

freelerobot assigned vansangpfiev and nguyenhoangthuan99 Sep 23, 2024

freelerobot moved this from Scheduled to Planning in Menlo Sep 23, 2024

freelerobot added P0: critical Mission critical type: question labels Sep 23, 2024

freelerobot added this to the v0.1.0 milestone Sep 23, 2024

dan-menlo moved this from Planning to Triage in Menlo Sep 29, 2024

gabrielle-ong mentioned this issue Sep 30, 2024

feat: cortex engines get <engine_id> #1073

Closed

nguyenhoangthuan99 moved this from Investigating to In Progress in Menlo Oct 1, 2024

nguyenhoangthuan99 moved this from In Progress to Investigating in Menlo Oct 1, 2024

gabrielle-ong mentioned this issue Oct 2, 2024

epic: cortex engines commands #1072

Closed

17 tasks

dan-menlo moved this from Investigating to Scheduled in Menlo Oct 3, 2024

vansangpfiev mentioned this issue Oct 3, 2024

chore: rename engines #1406

Merged

3 tasks

gabrielle-ong moved this from Scheduled to In Progress in Menlo Oct 4, 2024

gabrielle-ong closed this as completed Oct 5, 2024

github-project-automation bot moved this from In Progress to Review + QA in Menlo Oct 5, 2024

gabrielle-ong moved this from Review + QA to Completed in Menlo Oct 5, 2024

hiento09 reopened this Oct 22, 2024

hiento09 mentioned this issue Oct 22, 2024

planning: Semantic naming for Cortex Engines #1528

Closed

hiento09 closed this as completed Oct 22, 2024

dan-menlo changed the title ~~decision: Semantic naming for Cortex Engines?~~ epic: Semantic naming for Cortex Engines? Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epic: Semantic naming for Cortex Engines? #1168

epic: Semantic naming for Cortex Engines? #1168

dan-menlo commented Sep 9, 2024 •

edited by gabrielle-ong

Loading

namchuai commented Sep 10, 2024

dan-menlo commented Sep 15, 2024 •

edited

Loading

freelerobot commented Sep 23, 2024 •

edited

Loading

gabrielle-ong commented Oct 2, 2024

nguyenhoangthuan99 commented Oct 2, 2024 •

edited

Loading

dan-menlo commented Oct 3, 2024 •

edited

Loading

gabrielle-ong commented Oct 5, 2024 •

edited

Loading

hiento09 commented Oct 22, 2024

epic: Semantic naming for Cortex Engines? #1168

epic: Semantic naming for Cortex Engines? #1168

Comments

dan-menlo commented Sep 9, 2024 • edited by gabrielle-ong Loading

Goal / Summary of discussion

Out of scope - renaming repos?

Tasklist

namchuai commented Sep 10, 2024

dan-menlo commented Sep 15, 2024 • edited Loading

freelerobot commented Sep 23, 2024 • edited Loading

gabrielle-ong commented Oct 2, 2024

nguyenhoangthuan99 commented Oct 2, 2024 • edited Loading

dan-menlo commented Oct 3, 2024 • edited Loading

gabrielle-ong commented Oct 5, 2024 • edited Loading

hiento09 commented Oct 22, 2024

dan-menlo commented Sep 9, 2024 •

edited by gabrielle-ong

Loading

dan-menlo commented Sep 15, 2024 •

edited

Loading

freelerobot commented Sep 23, 2024 •

edited

Loading

nguyenhoangthuan99 commented Oct 2, 2024 •

edited

Loading

dan-menlo commented Oct 3, 2024 •

edited

Loading

gabrielle-ong commented Oct 5, 2024 •

edited

Loading