Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PCA center, GGUF import #34

Merged
merged 2 commits into from
May 25, 2024
Merged

PCA center, GGUF import #34

merged 2 commits into from
May 25, 2024

Conversation

vgel
Copy link
Owner

@vgel vgel commented May 24, 2024

Changes

  • Adds a new method argument to ControlVector.train. This defaults to the old method, method="pca_diff", but it can be switched to method="pca_center" for more stability. This seems to be necessary for training vectors on larger models, such as llama3-70b. Thanks to @r3ndd
  • Adds ControlVector.import_gguf as a counterpart to ControlVector.export_gguf. GGUF is now the preferred way to save control vectors, for better compatibility with llama.cpp (however, the old pickle method will continue to work)
  • Adds eq implementation for ControlVector
  • More thorough testing

Co-Authored-With: [email protected]

@lin72h
Copy link

lin72h commented May 25, 2024

very impressive, thanks for working on this feature

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants