-
Notifications
You must be signed in to change notification settings - Fork 27.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RoPE: model-agnostic RoPE refactor #31999
Open
gante
wants to merge
42
commits into
huggingface:main
Choose a base branch
from
gante:rope_refactor
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Commits on May 20, 2024
-
Add YaRN and Dynamic-YaRN RoPE Scaling Methods
YaRN (Yet another RoPE extension method) combines the NTK-By-Parts Interpolation and Attention Scaling methods, improving upon existing RoPE interpolation methods for longer context window sizes. Fine-tuned models maintain their original performance across benchmarks while enabling efficient extrapolation and transfer learning for quicker convergence, especially in compute-limited environments. We implement YaRN and Dynamic-YaRN for the following list of models: - LLaMA - Falcon - GPT-NeoX - Olmo - Persimmon - Phi - StableLM - OpenLLaMA New unit tests are added to assert YaRN's correct behavior on both short and long sequence inputs. For more details, please refer to https://arxiv.org/abs/2309.00071. Co-authored-by: Miguel Almeida <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for cc9b82e - Browse repository at this point
Copy the full SHA cc9b82eView commit details
Commits on Jun 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for fc161dd - Browse repository at this point
Copy the full SHA fc161ddView commit details
Commits on Jun 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1044c7b - Browse repository at this point
Copy the full SHA 1044c7bView commit details -
Refactor YaRN implementation for LLaMA
Iterate on YaRN implementation for LLaMA and remove diff from remaining models for increased PR modularity. This commit includes the following changes: - Merge 'yarn_rope_scaling' and 'rope_scaling' dictionaries - Remove unnecessary attributes ('extrapolation_factor' and 'finetuned') from YaRN classes - Inherit 'forward' method in YaRN classes from superclass - Rename 'yarn' method to 'compute_yarn_scaling' - Extend YaRN tests with further assertions - Fix style inconsistencies Co-authored-by: Miguel Monte e Freitas <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 85552b3 - Browse repository at this point
Copy the full SHA 85552b3View commit details
Commits on Jul 10, 2024
-
Refactor Tensor Building Logic for YaRN
- Comply with the the tensor building logic introduced in huggingface#30743 - Add referencing to the optimized Attention Factor equation - Remove Dynamic YaRN for a more agile deployment Co-authored-by: mig-mfreitas <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d84baa9 - Browse repository at this point
Copy the full SHA d84baa9View commit details -
Configuration menu - View commit details
-
Copy full SHA for fdea000 - Browse repository at this point
Copy the full SHA fdea000View commit details
Commits on Jul 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a555034 - Browse repository at this point
Copy the full SHA a555034View commit details -
Configuration menu - View commit details
-
Copy full SHA for 472b168 - Browse repository at this point
Copy the full SHA 472b168View commit details -
Configuration menu - View commit details
-
Copy full SHA for 26fd6e9 - Browse repository at this point
Copy the full SHA 26fd6e9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6ea2d3c - Browse repository at this point
Copy the full SHA 6ea2d3cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9df8a43 - Browse repository at this point
Copy the full SHA 9df8a43View commit details -
Configuration menu - View commit details
-
Copy full SHA for 10dc891 - Browse repository at this point
Copy the full SHA 10dc891View commit details
Commits on Jul 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e446e64 - Browse repository at this point
Copy the full SHA e446e64View commit details -
Configuration menu - View commit details
-
Copy full SHA for cc6af77 - Browse repository at this point
Copy the full SHA cc6af77View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9914572 - Browse repository at this point
Copy the full SHA 9914572View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8befb00 - Browse repository at this point
Copy the full SHA 8befb00View commit details -
Configuration menu - View commit details
-
Copy full SHA for 20962d8 - Browse repository at this point
Copy the full SHA 20962d8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0595968 - Browse repository at this point
Copy the full SHA 0595968View commit details -
Configuration menu - View commit details
-
Copy full SHA for 748a318 - Browse repository at this point
Copy the full SHA 748a318View commit details -
Configuration menu - View commit details
-
Copy full SHA for 99305b4 - Browse repository at this point
Copy the full SHA 99305b4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f10261 - Browse repository at this point
Copy the full SHA 2f10261View commit details -
Configuration menu - View commit details
-
Copy full SHA for c34ffff - Browse repository at this point
Copy the full SHA c34ffffView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6dae958 - Browse repository at this point
Copy the full SHA 6dae958View commit details
Commits on Jul 20, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0bcd2c1 - Browse repository at this point
Copy the full SHA 0bcd2c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0ec8ddb - Browse repository at this point
Copy the full SHA 0ec8ddbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1e41bfc - Browse repository at this point
Copy the full SHA 1e41bfcView commit details -
Configuration menu - View commit details
-
Copy full SHA for dffad0d - Browse repository at this point
Copy the full SHA dffad0dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5eb821b - Browse repository at this point
Copy the full SHA 5eb821bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 61eaf7c - Browse repository at this point
Copy the full SHA 61eaf7cView commit details -
Configuration menu - View commit details
-
Copy full SHA for c720514 - Browse repository at this point
Copy the full SHA c720514View commit details -
Configuration menu - View commit details
-
Copy full SHA for d28add5 - Browse repository at this point
Copy the full SHA d28add5View commit details
Commits on Jul 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 032f662 - Browse repository at this point
Copy the full SHA 032f662View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2cba857 - Browse repository at this point
Copy the full SHA 2cba857View commit details -
Configuration menu - View commit details
-
Copy full SHA for 205c740 - Browse repository at this point
Copy the full SHA 205c740View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d19465 - Browse repository at this point
Copy the full SHA 5d19465View commit details -
Configuration menu - View commit details
-
Copy full SHA for 909b247 - Browse repository at this point
Copy the full SHA 909b247View commit details -
Configuration menu - View commit details
-
Copy full SHA for 441eabb - Browse repository at this point
Copy the full SHA 441eabbView commit details -
Configuration menu - View commit details
-
Copy full SHA for b10fee4 - Browse repository at this point
Copy the full SHA b10fee4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 961e6ad - Browse repository at this point
Copy the full SHA 961e6adView commit details
Commits on Jul 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0604c44 - Browse repository at this point
Copy the full SHA 0604c44View commit details -
Configuration menu - View commit details
-
Copy full SHA for 556e140 - Browse repository at this point
Copy the full SHA 556e140View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9c5a40e - Browse repository at this point
Copy the full SHA 9c5a40eView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.