Skip to content

Pull requests: axolotl-ai-cloud/axolotl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

bump liger to 0.5.3
#2353 opened Feb 21, 2025 by winglian Loading…
[WIP] GRPO <-> Liger
#2350 opened Feb 19, 2025 by SalmanMohammadi Draft
Patch lora kernels post model load
#2345 opened Feb 18, 2025 by djsaunde Draft
Relaxed recursive transformers
#2276 opened Jan 22, 2025 by winglian Draft
Enable flex attention support
#2255 opened Jan 13, 2025 by bursteratom Draft
[KD] add uld and jsd
#2253 opened Jan 11, 2025 by kashif Loading…
feat: add deepseek_v3 sample packing
#2230 opened Jan 2, 2025 by NanoCode012 Loading…
convert-diff-transformer CLI command / codepath
#2197 opened Dec 17, 2024 by djsaunde Loading…
6 of 7 tasks
perform flakey patched tests in individual runner hold don't merge this yet
#2185 opened Dec 13, 2024 by winglian Loading…
rebased hymba multipack support
#2178 opened Dec 11, 2024 by bursteratom Loading…
Multimodal integration - pixtral/llava/qwen2-vl scheduled_release This PR is slated for the upcoming release
#2170 opened Dec 10, 2024 by bursteratom Loading…
Fix: RL base feature parity
#2133 opened Dec 6, 2024 by NanoCode012 Loading…
5 tasks done
refactor(optimizer): use optimizer_cls_and_kwargs for custom optim
#2012 opened Nov 4, 2024 by NanoCode012 Loading…
3 of 6 tasks
add soap optimizer support
#1978 opened Oct 17, 2024 by winglian Loading…
shampoo optim support
#1919 opened Sep 18, 2024 by winglian Loading…
multipack support for phi moe
#1870 opened Aug 26, 2024 by winglian Loading…
semi-weekly 8bit lora zero3 check hold don't merge this yet
#1852 opened Aug 22, 2024 by winglian Loading…
add q-galore optimizer
#1752 opened Jul 14, 2024 by winglian Loading…
Implements SPPO Alignment Algoritm
#1735 opened Jul 11, 2024 by kaykyr Loading…
1 of 3 tasks
ProTip! Add no:assignee to see everything that’s not assigned.