Automated PR: Downstream develop rebase new changes#71
Merged
Cemberk merged 1841 commits intomainfrom tmp-main-20241114Nov 14, 2024
Commits
This pull request is big! We're only showing the most recent 250 commits
Commits on Jul 23, 2024
- authored
- authored
- authored
- authored
- authored
Enhancing SFT Training Efficiency Using Packing and FlashAttention2 with Position IDs (huggingface#31629)
authored- authored
- committed
- authored
- authored
- authored
Commits on Jul 24, 2024
adds: extra_repr() to MambaRMSNorm to include hidden size / size of weights in the layer (huggingface#32171)
authoredfix: default value reflects the runtime environment variables rather than the ones present at import time. (huggingface#32153)
authored- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Jul 25, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Allow a specific microphone to be used by the ffmpeg audio pipeline utility functions. Default to using the currently active microphone on Mac (huggingface#31846)
- authored
Commits on Jul 26, 2024
- authored
- authored
- authored
- authored
Add check for
target_sizes is None
inpost_process_image_guided_detection
for owlv2 (huggingface#31934)authored[tests] fix
static
cache implementation is not compatible withattn_implementation==flash_attention_2
(huggingface#32039)authored- authored
- authored
Commits on Jul 27, 2024
Commits on Jul 29, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Jul 30, 2024
- authored
- authored
- authored
- authored
- authored
- authored
fixes to properly shard FSDP across cpu and meta for cpu_efficient_loading for prequantized 4bit (huggingface#32276)
authoredfixes huggingface#32329 : The Torch code is correct - to get an average of 10% o… (huggingface#32335)
authored- authored
- authored
Commits on Jul 31, 2024
- authored
- authored
- authored
- authored
- authored
- authored
>3-5x faster torch.compile forward compilation for autoregressive decoder models (huggingface#32227)
- authored
- authored
Commits on Aug 1, 2024
- authored
- authored
- authored
- authored
Check device map for saving tokenizer config on TPU (fix for issue huggingface#31971) (huggingface#32043)
authored- authored
- authored
- authored
- authored
- authored
Yell at the user if zero-3 init wasn't performed, but expected to have been done (huggingface#32299)
Commits on Aug 3, 2024
fix: (issue huggingface#32124) Exception raised when running
transformers/examples/flax/language-modeling/t5_tokenizer_model.py
. (huggingface#32157)authoredMixtralFlashAttention2: put "plus 1" inside parentheses when calculating rotary_seq_len, allowing None position_ids input. (huggingface#31500)
authored
Commits on Aug 5, 2024
Bump keras from 2.8.0 to 2.13.1 in /examples/research_projects/decision_transformer (huggingface#32393)
authored- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Aug 6, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- committed
- authored
Revert "fixes to properly shard FSDP across cpu and meta for cpu_effcient_loading for prequantized 4bit (huggingface#32276)" (huggingface#32477)
Commits on Aug 7, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Aug 8, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Aug 9, 2024
- authored
- authored
- authored
Commits on Aug 12, 2024
- authored
- authored
- authored
Fix
.push_to_hub(..., create_pr=True, revision="my-branch")
when creating PR on not-owned repo (huggingface#32094)authoredBump aiohttp from 3.9.4 to 3.10.2 in /examples/research_projects/decision_transformer (huggingface#32569)
authored- authored
- authored
- authored
Bump tensorflow from 2.11.1 to 2.12.1 in /examples/research_projects/decision_transformer (huggingface#32341)
authored- authored
fix: Updated the
is_torch_mps_available()
function to includemin_version
argument (huggingface#32545)
Commits on Aug 13, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Aug 14, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Aug 16, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Reduce the error log when using core models that need their weights renamed, and provide a step forward (huggingface#32656)
authored
Commits on Aug 17, 2024
Commits on Aug 19, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Aug 20, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Aug 22, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
fix: (issue huggingface#32689)
AttributeError
raised when usingTrainer
witheval_on_start=True
in Jupyter Notebook. (huggingface#32849)authored- authored
- authored
- authored
🌐 [i18n-KO] Translated `knowledge_distillation_for_image_classification.md to Korean" (huggingface#32334)
- authored
- authored