QLoRa LLama3.1 405b with `qlora_sharded_model_loading: true` #1931

v-dicicco · 2024-09-27T09:36:11Z

v-dicicco
Sep 27, 2024

Hello,
in order to do QLoRa with the 405b while loading the model with lower cpu ram requirements, should we enable qlora_sharded_model_loading in the config? I've seen this commit, but I'm not sure since I don't see this flag in the config nor in the docs. Is this feature expected to work?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QLoRa LLama3.1 405b with `qlora_sharded_model_loading: true` #1931

{{title}}

Replies: 0 comments

Select a reply

QLoRa LLama3.1 405b with qlora_sharded_model_loading: true #1931

v-dicicco Sep 27, 2024

Replies: 0 comments

QLoRa LLama3.1 405b with `qlora_sharded_model_loading: true` #1931

v-dicicco
Sep 27, 2024