-
Notifications
You must be signed in to change notification settings - Fork 33.6k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix Moonshine training-loss double-shift (train against labels, not labels[..., 1:])
#46784
opened Jun 20, 2026 by
Incheonkirin
Contributor
Loading…
Fix left-padding token selection in
BioGptForSequenceClassification
#46782
opened Jun 20, 2026 by
Sunt-ing
Contributor
Loading…
3 of 4 tasks
Fix QuantizedCache crash on hybrid (SSM + attention) models
#46781
opened Jun 20, 2026 by
Sunt-ing
Contributor
Loading…
3 of 6 tasks
[RT-DETRv2] Fix MPS crash in build_2d_sinusoidal_position_embedding (float64 → dtype)
#46780
opened Jun 20, 2026 by
Sahith59
Loading…
feat(examples): add minimal Tensor Parallelism training example with …
#46779
opened Jun 20, 2026 by
RithwikSharma
Loading…
6 tasks
Raise clear ValueError for empty conversation in apply_chat_template
#46778
opened Jun 19, 2026 by
punyamodi
Contributor
Loading…
Refine imports in timesformer __init__.py
#46777
opened Jun 19, 2026 by
Charansripadi
Loading…
4 of 6 tasks
Fix padding mask skipped for batch size 1 in linear-attention models
#46773
opened Jun 19, 2026 by
Sunt-ing
Contributor
Loading…
1 task done
[WIP] Support custom kernels for processing ops
#46771
opened Jun 19, 2026 by
molbap
Collaborator
Loading…
Add use_temp_cache_if_readonly decorator to testing_utils
#46768
opened Jun 19, 2026 by
ydshieh
Collaborator
Loading…
[
peft] Support key_mapping with PEFT models
#46766
opened Jun 19, 2026 by
tomaarsen
Member
Loading…
3 of 6 tasks
Add native masked MSE loss for Sapiens2ForPoseEstimation
#46764
opened Jun 19, 2026 by
Sainava
Loading…
5 of 6 tasks
Round the ue8m0 FP8 scale before quantizing so dequant matches the stored inverse
#46763
opened Jun 19, 2026 by
Incheonkirin
Contributor
Loading…
[Whisper] Add Unpack[TransformersKwargs] to forward() and set use_cache=False when labels provided
#46761
opened Jun 19, 2026 by
Sahith59
Loading…
Fix q_offset tensor causing wrong flex attention mask shape
#46757
opened Jun 18, 2026 by
abderahmane-ai
Loading…
fix: raise
ValueError for empty conversation in apply_chat_template
#46753
opened Jun 18, 2026 by
sharmax-vikas
Loading…
2 tasks done
[Offloading] Support full disk offloading
#46749
opened Jun 18, 2026 by
kylesayrs
Contributor
Loading…
Fix offloaded cache device mismatch on hybrid models
#46748
opened Jun 18, 2026 by
Sunt-ing
Contributor
Loading…
3 of 6 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.