You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
in order to do QLoRa with the 405b while loading the model with lower cpu ram requirements, should we enable qlora_sharded_model_loadingin the config? I've seen this commit, but I'm not sure since I don't see this flag in the config nor in the docs. Is this feature expected to work?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hello,
in order to do QLoRa with the 405b while loading the model with lower cpu ram requirements, should we enable
qlora_sharded_model_loading
in the config? I've seen this commit, but I'm not sure since I don't see this flag in the config nor in the docs. Is this feature expected to work?Thanks!
Beta Was this translation helpful? Give feedback.
All reactions