gguf : general usability improvements #3409

cebtenzzre · 2023-09-29T22:55:13Z

avoid copy-pasted tensor names in MODEL_TENSOR_NAMES
accept str for path in SpecialVocab.__init__

Resolves the discussion at #2842 (review)

cebtenzzre · 2023-09-30T23:58:09Z

@ggerganov I updated the PR to remove MODEL_TENSOR_NAMES, since it generally makes more sense to use MODEL_TENSORS and TENSOR_NAMES separately. But this is a breaking change. What do you think?

ggerganov

Hm, why is it a breaking change? If it is just that python scripts have to switch to using gguf.TENSOR_NAMES then I think it's fine

KerfuffleV2 · 2023-10-02T20:24:27Z

gguf-py/gguf/gguf.py

        for tensor, keys in self.mappings_cfg.items():
-            tensor_name = tensor_names.get(tensor)
-            if tensor_name is None:
+            if tensor not in MODEL_TENSORS[arch]:


It's kind of late to say anything, but I'm curious why you'd make these changes. There's no usability improvement from the user's perspective except it'll be twice as slow now since it requires the key to get hashed and the dictionary searched twice compared to using dict.get().

I think the code makes more sense this way - the standard name of the tensor is never dependent on the model it is from, so we should represent that fact by using separate data structures. You shouldn't have to know what model architecture you are working with to find the name of a standard tensor, either. I don't think this part of the code is performance-critical, so I didn't bother optimizing it.
This has confused developers at least, which is what I mean by usability. See #3417 (comment)

…example * 'master' of github.com:ggerganov/llama.cpp: (24 commits) convert : fix Baichuan2 models by using vocab size in config.json (ggerganov#3299) readme : add project status link ggml : fix build after ggerganov#3329 llm : add Refact model (ggerganov#3329) sync : ggml (conv 1d + 2d updates, UB fixes) (ggerganov#3468) finetune : readme fix typo (ggerganov#3465) ggml : add RISC-V Vector Support for K-Quants and improved the existing intrinsics (ggerganov#3453) main : consistent prefix/suffix coloring (ggerganov#3425) llama : fix session saving/loading (ggerganov#3400) llama : expose model's rope_freq_scale in the API (ggerganov#3418) metal : alibi for arbitrary number of heads (ggerganov#3426) cmake : make LLAMA_NATIVE flag actually use the instructions supported by the processor (ggerganov#3273) Work on the BPE tokenizer (ggerganov#3252) convert : fix vocab size when not defined in hparams (ggerganov#3421) cmake : increase minimum version for add_link_options (ggerganov#3444) CLBlast: Add broadcast support for matrix multiplication (ggerganov#3402) gguf : add BERT, MPT, and GPT-J arch info (ggerganov#3408) gguf : general usability improvements (ggerganov#3409) cmake : make CUDA flags more similar to the Makefile (ggerganov#3420) finetune : fix ggerganov#3404 (ggerganov#3437) ...

cebtenzzre added 3 commits September 29, 2023 18:44

gguf : avoid copy-pasted tensor names

ea90d2a

gguf : accept str for path in SpecialVocab.__init__

7fa5cbf

gguf : clean up SpecialVocab

84f7cea

ggerganov approved these changes Sep 30, 2023

View reviewed changes

cebtenzzre added 2 commits September 30, 2023 19:47

gguf : fix typos

fd5e226

gguf : eliminate MODEL_TENSOR_NAMES

3724ad6

cebtenzzre mentioned this pull request Oct 1, 2023

MPT support in llama.cpp #3417

Merged

ggerganov approved these changes Oct 2, 2023

View reviewed changes

cebtenzzre added 2 commits October 2, 2023 14:52

gguf : bump version

64607e4

gguf : fix typos

bd890ec

cebtenzzre merged commit 0fe3210 into ggerganov:master Oct 2, 2023
9 checks passed

KerfuffleV2 reviewed Oct 2, 2023

View reviewed changes

FNsi mentioned this pull request Oct 5, 2023

convert : update Falcon script for new HF config #3448

Merged

yusiwen pushed a commit to yusiwen/llama.cpp that referenced this pull request Oct 7, 2023

gguf : general usability improvements (ggerganov#3409)

0099b74

cebtenzzre mentioned this pull request Oct 7, 2023

py: fix 'gguf' has no attribute 'TENSOR_NAMES' #3496 #3526

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gguf : general usability improvements #3409

gguf : general usability improvements #3409

cebtenzzre commented Sep 29, 2023

cebtenzzre commented Sep 30, 2023

ggerganov left a comment

KerfuffleV2 Oct 2, 2023

cebtenzzre Oct 2, 2023 •

edited

Loading

gguf : general usability improvements #3409

gguf : general usability improvements #3409

Conversation

cebtenzzre commented Sep 29, 2023

cebtenzzre commented Sep 30, 2023

ggerganov left a comment

Choose a reason for hiding this comment

KerfuffleV2 Oct 2, 2023

Choose a reason for hiding this comment

cebtenzzre Oct 2, 2023 • edited Loading

Choose a reason for hiding this comment

cebtenzzre Oct 2, 2023 •

edited

Loading