Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

Yarn feature #97

Merged
merged 3 commits into from
Feb 22, 2024
Merged

Yarn feature #97

merged 3 commits into from
Feb 22, 2024

Conversation

xiguiw
Copy link
Contributor

@xiguiw xiguiw commented Jan 28, 2024

Type of Change

  1. Add YaRN rope scaling data structure and conversion read/write of the yarn parameters

How has this PR been tested?

convert data model and read/quant the mode correctly.

Dependency Change?

No.

neural_speed/models/llama/llama_yarn.cpp Outdated Show resolved Hide resolved
@xiguiw xiguiw force-pushed the yarn-feature branch 2 times, most recently from 45ee371 to 2df7c38 Compare January 31, 2024 02:25
@xiguiw xiguiw force-pushed the yarn-feature branch 2 times, most recently from 5a90dc4 to a57eb4f Compare February 7, 2024 07:37
@airMeng airMeng merged commit 8c846d6 into intel:main Feb 22, 2024
10 checks passed
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants