Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Remove issue labeller
#6052 opened Jun 13, 2026 by qgallouedec Member Loading…
chore: update tests_transformers_branch.yml
#6051 opened Jun 13, 2026 by hf-security-analysis Bot Loading…
chore: update pr_style_bot.yml
#6050 opened Jun 13, 2026 by hf-security-analysis Bot Loading…
chore: update docker-build.yml
#6048 opened Jun 13, 2026 by hf-security-analysis Bot Loading…
chore: update clear_cache.yml
#6047 opened Jun 13, 2026 by hf-security-analysis Bot Loading…
Remove redundant .contiguous() calls
#6045 opened Jun 13, 2026 by qgallouedec Member Loading…
fix: preserve OnlineDPO vLLM completion ids
#6038 opened Jun 13, 2026 by he-yufeng Loading…
4 of 8 tasks
fix: load image-text policy for async grpo
#6032 opened Jun 12, 2026 by he-yufeng Loading…
5 of 8 tasks
fix: pass AsyncGRPO environment rewards
#6031 opened Jun 12, 2026 by he-yufeng Loading…
5 of 8 tasks
Remove silently-ignored W&B/Hub fields from GOLD and Distillation configs
#6023 opened Jun 11, 2026 by DaoyuanLi2816 Contributor Loading…
3 of 4 tasks
Align AsyncGRPO clip-ratio metrics with GRPOTrainer
#6021 opened Jun 11, 2026 by qgallouedec Member Loading…
Align epsilon help/docstring wording
#6014 opened Jun 11, 2026 by qgallouedec Member Loading…
Align async GRPO loss variable names with GRPOTrainer
#6013 opened Jun 11, 2026 by qgallouedec Member Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.