Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Experimental] Add SDFT trainer, config, docs, and tests
#4941 opened Jan 31, 2026 by Shekswess Loading…
4 of 5 tasks
Update RewardFunc type to use RewardCallable protocol
#4938 opened Jan 31, 2026 by amit9oct Loading…
2 of 5 tasks
Update wordle.py example with masking of env tokens
#4895 opened Jan 26, 2026 by sergiopaniego Loading…
5 tasks
Expose generation index to tool callables in GRPOTrainer
#4894 opened Jan 25, 2026 by lukehinds Loading…
4 tasks done
Upgrade GitHub Actions to latest versions
#4893 opened Jan 24, 2026 by salmanmkc Loading…
[GRPO] feat: Geometric Sequence Masking
#4891 opened Jan 24, 2026 by LeonEricsson Loading…
5 tasks
Fix grpo tool calling
#4890 opened Jan 23, 2026 by akshayballal95 Loading…
2 tasks done
fix(vLLM): Add tool calling support to VLLMClient.chat()
#4889 opened Jan 23, 2026 by kansalaman Loading…
1 of 2 tasks
NeMo-Gym Integration
#4848 opened Jan 17, 2026 by cmunley1 Loading…
make dpo compatible with fsdp2
#4838 opened Jan 16, 2026 by flutist Loading…
4 of 5 tasks
feat: Support log_completion for swanlab backend
#4826 opened Jan 14, 2026 by ZiyiTsang Loading…
2 of 5 tasks
forward_masked_logits in SFTTrainer
#4794 opened Jan 8, 2026 by qgallouedec Draft
5 tasks
make dpo compatible with qwen3vl
#4773 opened Jan 4, 2026 by flutist Loading…
ProTip! Updated in the last three days: updated:>2026-01-29.