-
Notifications
You must be signed in to change notification settings - Fork 976
Pull requests: huggingface/accelerate
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
support for wrapped schedulefree optimizer when using deepspeed
#3266
opened Nov 26, 2024 by
winglian
Loading…
5 tasks
Replaced set/check breakpoint with set/check trigger in the troubleshooting documentation
#3259
opened Nov 25, 2024 by
relh
Loading…
1 of 5 tasks
Select the DeepSpeedCPUOptimizer based on the original optimizer class.
#3255
opened Nov 24, 2024 by
eljandoubi
Loading…
Revert default behavior of
get_state_dict_from_offload
#3253
opened Nov 23, 2024 by
kylesayrs
Loading…
Fix: Resolve #3060,
preload_module_classes
is lost for nested modules
#3248
opened Nov 22, 2024 by
wejoncy
Loading…
Fix : get_balanced_memory when using multi gpus with small models or quantized models with a large vocabulary
#3244
opened Nov 18, 2024 by
MekkCyber
Loading…
1 of 5 tasks
Ensure explicit output
dtype
for pad_across_processes
#3219
opened Nov 5, 2024 by
mariusarvinte
Loading…
4 of 5 tasks
create _preprare_fsdp to pre- prepare fsdp model training
#3213
opened Nov 3, 2024 by
eljandoubi
Loading…
Give example on how to handle gradient accumulation with cross-entropy
#3193
opened Oct 24, 2024 by
ylacombe
Loading…
Distributed inference example for llava_next
#3179
opened Oct 20, 2024 by
VladOS95-cyber
Loading…
3 of 5 tasks
feat: support tensor parallel using Pytorch 2.0 & Data loader
#3173
opened Oct 16, 2024 by
kmehant
Loading…
1 of 5 tasks
add support for custom function for reducing the batch size
wip
Work in progress
#3071
opened Sep 3, 2024 by
winglian
Loading…
5 tasks
Some adjustment for supporting Deepspeed-Ulysses
wip
Work in progress
#2877
opened Jun 20, 2024 by
zeyugao
Loading…
1 of 5 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.