8 * A100 启动巨慢，有启动成功的勇士不 #11

CarryChang · 2024-05-08T08:52:12Z

No description provided.

zwd003 · 2024-05-08T11:24:21Z

stack-heap-overflow · 2024-05-09T06:58:00Z

HuggingFace代码中accelerate库对模型的显存分配计算有问题，目前示例代码已修改，预计大幅缩短模型加载速度。

加载模型的代码修改为：

model = AutoModelForCausalLM.from_pretrained(model_name, trust_remote_code=True, device_map="sequential", torch_dtype=torch.bfloat16, max_memory=max_memory, attn_implementation="eager")

soloice closed this as completed May 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

8 * A100 启动巨慢，有启动成功的勇士不 #11

8 * A100 启动巨慢，有启动成功的勇士不 #11

CarryChang commented May 8, 2024

zwd003 commented May 8, 2024

stack-heap-overflow commented May 9, 2024

8 * A100 启动巨慢，有启动成功的勇士不 #11

8 * A100 启动巨慢，有启动成功的勇士不 #11

Comments

CarryChang commented May 8, 2024

zwd003 commented May 8, 2024

stack-heap-overflow commented May 9, 2024