Skip to content

Pull requests: karpathy/llm.c

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update README.md
#789 opened Dec 14, 2024 by joeyabdalla Loading…
Mapping "py" gpt2 functionalities to match "c"
#783 opened Oct 31, 2024 by omarswelam Loading…
Verify vocab is padded before reshaping
#782 opened Oct 23, 2024 by austinleedavis Loading…
FP32 FlashAttention
#781 opened Oct 20, 2024 by ssiu Loading…
Activation Checkpointing for Llama3 branch
#773 opened Oct 2, 2024 by ademeure Loading…
-pm -> -pi: typo in error_usage
#765 opened Sep 22, 2024 by thundergolfer Loading…
Micro optimization for softmax_forward_kernel5
#762 opened Sep 20, 2024 by insop Loading…
FP8 with Tensor Reorg
#760 opened Sep 19, 2024 by ademeure Draft
Update download_starter_pack.sh
#758 opened Sep 18, 2024 by dongrixinyu Loading…
Add SwiGLU support - llama3 feature branch
#755 opened Sep 13, 2024 by gordicaleksa Loading…
add llama 3 support to llm.c
#754 opened Sep 13, 2024 by karpathy Draft
Adamw thread coarsening kernel
#753 opened Sep 3, 2024 by saladpalad Loading…
Fix sizing typo in train_gpt2_fp32.cu
#748 opened Aug 25, 2024 by gajanan-choudhary Loading…
log with LINE and FILE for better addressing.
#746 opened Aug 22, 2024 by NEWPLAN Loading…
check libnccl instead of nccl to be more reliable
#742 opened Aug 14, 2024 by dengl11 Loading…
[WIP] initial curand implementation for model init
#741 opened Aug 13, 2024 by ngc92 Loading…
multi-threaded model initialization
#737 opened Aug 12, 2024 by ngc92 Loading…
Add external KV to LLaMA 3
#734 opened Aug 10, 2024 by gordicaleksa Loading…
Add SwiGLU support
#718 opened Jul 29, 2024 by gordicaleksa Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.