Reload big model with multiple state dict files #1644

natuan · 2023-06-30T04:56:58Z

This change fixes the issue that state dict of big LLMs cannot be reloaded into the model after arch recipe applied, the reason being we only check the single file "pytorch_model.bin". This results in e.g. wrong quantized models after ONNX export.
Qualification: manually tested by exporting a pruned quantized Falcon-7b model, verifying quantization parameters.

dbogunowicz

Great job spotting this Tuan!

src/sparseml/transformers/sparsification/trainer.py

bfineran · 2023-07-03T14:52:09Z

test failure unrelated, merging

Reload big model with multiple state dict files

80620f1

natuan requested review from bfineran, anmarques, dbogunowicz and a team June 30, 2023 04:56

dbogunowicz previously approved these changes Jun 30, 2023

View reviewed changes

rahul-tuli previously approved these changes Jun 30, 2023

View reviewed changes

src/sparseml/transformers/sparsification/trainer.py Show resolved Hide resolved

Add description for reload func

d19c550

natuan dismissed stale reviews from rahul-tuli and dbogunowicz via d19c550 June 30, 2023 16:33

Merge branch 'main' into load_truly_LLMs

44abc6b

bfineran approved these changes Jun 30, 2023

View reviewed changes

Merge branch 'main' into load_truly_LLMs

a952e94

dbogunowicz approved these changes Jul 3, 2023

View reviewed changes

bfineran merged commit b73a173 into main Jul 3, 2023

bfineran deleted the load_truly_LLMs branch July 3, 2023 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reload big model with multiple state dict files #1644

Reload big model with multiple state dict files #1644

natuan commented Jun 30, 2023

dbogunowicz left a comment

bfineran commented Jul 3, 2023

Reload big model with multiple state dict files #1644

Reload big model with multiple state dict files #1644

Conversation

natuan commented Jun 30, 2023

dbogunowicz left a comment

Choose a reason for hiding this comment

bfineran commented Jul 3, 2023