guided-decoding

Here is 1 public repository matching this topic...

modelscope / dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

cpu cuda llm llm-inference native-engine guided-decoding

Updated Dec 12, 2024
C

Improve this page

Add a description, image, and links to the guided-decoding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the guided-decoding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

guided-decoding

Here is 1 public repository matching this topic...

modelscope / dash-infer

Improve this page

Add this topic to your repo