Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Packing with flash attn kwargs
#3526 opened Jun 2, 2025 by thepowerfuldeez Loading…
Enable Numba for FFD packing algorithm
#3524 opened Jun 2, 2025 by thepowerfuldeez Loading…
5 tasks
[Data] Added add_generation_prompt=False to chat template
#3522 opened Jun 2, 2025 by kashif Loading…
5 tasks
💆🏻‍♀️ RLOOV2
#3519 opened May 31, 2025 by shirinyamani Loading…
5 tasks
Push KTAE impl
#3518 opened May 30, 2025 by SamComber Loading…
5 tasks
🎀 New defaults: bf16=True
#3515 opened May 30, 2025 by qgallouedec Loading…
5 tasks
🎀 New defaults: logging_steps=10
#3514 opened May 30, 2025 by qgallouedec Loading…
5 tasks
intuit
#3513 opened May 29, 2025 by shirinyamani Loading…
5 tasks
🎀 New defaults: gradient_checkpointing=True
#3510 opened May 29, 2025 by qgallouedec Loading…
5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508 opened May 29, 2025 by shaischaudhry Loading…
3 of 5 tasks
Rearrange DPOTrainer
#3501 opened May 27, 2025 by DaizeDong Loading…
2 of 5 tasks
HF Doc Builder style
#3498 opened May 26, 2025 by qgallouedec Draft
[GRPO] Adds SSR priorized replay buffer
#3496 opened May 26, 2025 by edbeeching Loading…
[GKD] Use vllm for the teacher model
#3475 opened May 21, 2025 by kashif Draft
5 tasks
Add support for CB with native transformers
#3471 opened May 20, 2025 by ArthurZucker Loading…
Allow an user to train from a local dataset
#3470 opened May 19, 2025 by gogo2464 Loading…
1 of 5 tasks
add support for image inputs in GRPO
#3460 opened May 16, 2025 by hellopahe Loading…
[SFT] add warning if dataset's input_ids exceed max_length
#3449 opened May 15, 2025 by HERIUN Loading…
1 of 5 tasks
Fix logging docs
#3447 opened May 14, 2025 by xingyaoww Draft
2 of 5 tasks
🛠️ quantization support for vllm generation
#3428 opened May 8, 2025 by shirinyamani Loading…
5 tasks
Reintroducing step method in ppo_trainer
#3410 opened May 3, 2025 by jskaf34 Loading…
2 of 5 tasks
ProTip! Add no:assignee to see everything that’s not assigned.