Skip to content

Commit

Permalink
update the recipes link (#1505)
Browse files Browse the repository at this point in the history
Signed-off-by: chensuyue <[email protected]>
  • Loading branch information
chensuyue authored Dec 29, 2023
1 parent f19cc9d commit b8c7f1a
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions docs/source/llm_recipes.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
LLM Quantization Models and Recipes
LLMs Quantization Recipes
---

Intel® Neural Compressor supported advanced large language models (LLMs) quantization technologies including SmoothQuant (SQ) and Weight-Only Quant (WOQ),
Expand All @@ -21,7 +21,7 @@ This document aims to publish the specific recipes we achieved for the popular L
| meta-llama/Llama-2-70b-hf ||||
| tiiuae/falcon-40b ||||

**Detail recipes can be found [HERE](https://github.com/intel/intel-extension-for-transformers/examples/huggingface/pytorch/text-generation/quantization/llm_quantization_recipes.md).**
**Detail recipes can be found [HERE](https://github.com/intel/intel-extension-for-transformers/blob/main/examples/huggingface/pytorch/text-generation/quantization/llm_quantization_recipes.md).**
> Notes:
> - This model list comes from [IPEX](https://intel.github.io/intel-extension-for-pytorch/cpu/latest/tutorials/llm.html).
> - WOQ INT4 recipes will be published soon.

0 comments on commit b8c7f1a

Please sign in to comment.