Can't convert-checkpoint Mistral 7B v0.3: safetensors_rust.SafetensorError: File does not contain tensor model.embed_tokens.weight #1732

Ace-RR · 2024-06-05T09:36:11Z

System Info

on H100 Nvidia

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

1.git -C /workspace clone https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3

2.python tensorrt_llm/examples/llama/convert_checkpoint.py --model_dir /workspace/Mistral-7B-Instruct-v0.3 --output_dir /workspace/trt_ckpt/mistral3/fp16 --dtype bfloat16

Expected behavior

0.11.0.dev2024060400
Total time of converting checkpoints: xx:xx:xx

actual behavior

[TensorRT-LLM] TensorRT-LLM version: 0.11.0.dev2024060400
0.11.0.dev2024060400
Traceback (most recent call last):
File "/app/tensorrt_llm/examples/llama/convert_checkpoint.py", line 439, in
main()
File "/app/tensorrt_llm/examples/llama/convert_checkpoint.py", line 431, in main
convert_and_save_hf(args)
File "/app/tensorrt_llm/examples/llama/convert_checkpoint.py", line 366, in convert_and_save_hf
execute(args.workers, [convert_and_save_rank] * world_size, args)
File "/app/tensorrt_llm/examples/llama/convert_checkpoint.py", line 390, in execute
f(args, rank)
File "/app/tensorrt_llm/examples/llama/convert_checkpoint.py", line 355, in convert_and_save_rank
llama = LLaMAForCausalLM.from_hugging_face(
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/llama/model.py", line 292, in from_hugging_face
weights = load_weights_from_hf_safetensors(hf_model_dir, config)
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/llama/convert.py", line 1577, in load_weights_from_hf_safetensors
weights['transformer.vocab_embedding.weight'] = load(
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/llama/convert.py", line 1555, in load
res = safetensors_ptrs[ptr_idx].get_tensor(key)
safetensors_rust.SafetensorError: File does not contain tensor model.embed_tokens.weight

additional notes

Mistral 7B v0.3 requires transformers 4.42.0.dev0

version of transformers with tensorrtllm_backend: 4.40.2

But the command doesn't work even if transformers is upgrade to 4.42.0.dev0

The text was updated successfully, but these errors were encountered:

hijkzzz · 2024-06-05T10:11:57Z

@nv-guomingz This is a new feature due to changes in the weight format and tokenizer of mistral v0.3

nv-guomingz · 2024-06-05T15:38:24Z

hi @Ace-RR , a quick war is to delete the consolidated.safetensors from you local model directory.
Or you can apply below changes on your side.

nv-guomingz · 2024-06-14T11:18:40Z

Hi @Ace-RR
we've merged the fixing into internal repo and please fetch the next weekly update for your further work.

github-actions · 2024-07-15T01:56:12Z

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."

nv-guomingz · 2024-11-14T03:59:56Z

Hi @Ace-RR please feel free to reopen it if needed

Ace-RR added the bug Something isn't working label Jun 5, 2024

hijkzzz added the feature request New feature or request label Jun 5, 2024

hijkzzz mentioned this issue Jun 5, 2024

Runtime Dimension does not satisfy any optimization #1716

Closed

4 tasks

hijkzzz added Investigating and removed bug Something isn't working labels Jun 5, 2024

hijkzzz self-assigned this Jun 5, 2024

nv-guomingz added the triaged Issue has been triaged by maintainers label Jun 5, 2024

hijkzzz mentioned this issue Jun 5, 2024

[ERROR] Assertion failed: Can't free tmp workspace for GEMM tactics profiling. #1739

Closed

2 tasks

nv-guomingz removed the Investigating label Jun 14, 2024

Shixiaowei02 mentioned this issue Jun 18, 2024

Update TensorRT-LLM #1793

Merged

github-actions bot added the stale label Jul 15, 2024

kaiyux mentioned this issue Jul 17, 2024

TensorRT-LLM v0.11 Update #1969

Merged

nv-guomingz closed this as completed Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't convert-checkpoint Mistral 7B v0.3: safetensors_rust.SafetensorError: File does not contain tensor model.embed_tokens.weight #1732

Can't convert-checkpoint Mistral 7B v0.3: safetensors_rust.SafetensorError: File does not contain tensor model.embed_tokens.weight #1732

Ace-RR commented Jun 5, 2024

hijkzzz commented Jun 5, 2024 •

edited

Loading

nv-guomingz commented Jun 5, 2024 •

edited

Loading

nv-guomingz commented Jun 14, 2024

github-actions bot commented Jul 15, 2024

nv-guomingz commented Nov 14, 2024

Can't convert-checkpoint Mistral 7B v0.3: safetensors_rust.SafetensorError: File does not contain tensor model.embed_tokens.weight #1732

Can't convert-checkpoint Mistral 7B v0.3: safetensors_rust.SafetensorError: File does not contain tensor model.embed_tokens.weight #1732

Comments

Ace-RR commented Jun 5, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

hijkzzz commented Jun 5, 2024 • edited Loading

nv-guomingz commented Jun 5, 2024 • edited Loading

nv-guomingz commented Jun 14, 2024

github-actions bot commented Jul 15, 2024

nv-guomingz commented Nov 14, 2024

hijkzzz commented Jun 5, 2024 •

edited

Loading

nv-guomingz commented Jun 5, 2024 •

edited

Loading