Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
huggingface
/
trl
Public
generated from
fastai/nbdev_template
Notifications
You must be signed in to change notification settings
Fork
2.5k
Star
17.2k
Code
Issues
555
Pull requests
96
Discussions
Actions
Projects
0
Security
0
Insights
Additional navigation options
Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights
Actions: huggingface/trl
Actions
All workflows
All workflows
Actions
Loading...
Loading
Sorry, something went wrong.
Uh oh!
There was an error while loading.
Please reload this page
.
Showing runs from all workflows
31,912 workflow runs
31,912 workflow runs
Event
Filter by Event
Sorry, something went wrong.
Filter
Loading
Sorry, something went wrong.
No matching events.
Status
Filter by Status
Sorry, something went wrong.
Filter
Loading
Sorry, something went wrong.
No matching statuses.
Branch
Filter by Branch
Sorry, something went wrong.
Filter
Loading
Sorry, something went wrong.
No matching branches.
Actor
Filter by Actor
Sorry, something went wrong.
Filter
Loading
Sorry, something went wrong.
No matching users.
Tests latest TRL release with dev dependencies
Tests latest TRL release with dev dependencies
#426:
Scheduled
25m 55s
main
main
25m 55s
View workflow file
Cleanup Cache
Cleanup Cache
#1043:
Scheduled
15s
main
main
15s
View workflow file
[Experimental] Add SDFT trainer, config, docs, and tests
Build PR Documentation
#13340:
Pull request
#4941
opened by
Shekswess
Action required
Shekswess:feature/sdft-trainer
Shekswess:feature/sdft-trainer
Action required
View #4941
View workflow file
[Experimental] Add SDFT trainer, config, docs, and tests
Tests (experimental)
#691:
Pull request
#4941
opened by
Shekswess
Action required
Shekswess:feature/sdft-trainer
Shekswess:feature/sdft-trainer
Action required
View #4941
View workflow file
SDFT: Self-Distillation Fine-Tuning Trainer
Hugging Face Issue Labeler
#931:
Issue
#4940
opened by
Shekswess
38s
38s
View workflow file
NeMo-Gym Integration
Build PR Documentation
#13339:
Pull request
#4848
synchronize by
cmunley1
Action required
cmunley1:cmunley1/nemo_gym_on_policy
cmunley1:cmunley1/nemo_gym_on_policy
Action required
View #4848
View workflow file
NeMo-Gym Integration
Tests
#14308:
Pull request
#4848
synchronize by
cmunley1
Action required
cmunley1:cmunley1/nemo_gym_on_policy
cmunley1:cmunley1/nemo_gym_on_policy
Action required
View #4848
View workflow file
NeMo-Gym Integration
Build PR Documentation
#13338:
Pull request
#4848
synchronize by
cmunley1
Action required
cmunley1:cmunley1/nemo_gym_on_policy
cmunley1:cmunley1/nemo_gym_on_policy
Action required
View #4848
View workflow file
NeMo-Gym Integration
Tests
#14307:
Pull request
#4848
synchronize by
cmunley1
Action required
cmunley1:cmunley1/nemo_gym_on_policy
cmunley1:cmunley1/nemo_gym_on_policy
Action required
View #4848
View workflow file
NeMo-Gym Integration
Build PR Documentation
#13337:
Pull request
#4848
synchronize by
cmunley1
Action required
cmunley1:cmunley1/nemo_gym_on_policy
cmunley1:cmunley1/nemo_gym_on_policy
Action required
View #4848
View workflow file
NeMo-Gym Integration
Tests
#14306:
Pull request
#4848
synchronize by
cmunley1
Action required
cmunley1:cmunley1/nemo_gym_on_policy
cmunley1:cmunley1/nemo_gym_on_policy
Action required
View #4848
View workflow file
Copilot code review
Copilot code review
#11:
by
Copilot
AI
2m 50s
refs/pull/4938/head
refs/pull/4938/head
2m 50s
GRPO Reward Function documentation and usage mismatch
Hugging Face Issue Labeler
#930:
Issue
#4939
opened by
amit9oct
42s
42s
View workflow file
Update RewardFunc type to use RewardCallable protocol
Build PR Documentation
#13336:
Pull request
#4938
opened by
amit9oct
Action required
amit9oct:patch-1
amit9oct:patch-1
Action required
View #4938
View workflow file
Update RewardFunc type to use RewardCallable protocol
Tests
#14305:
Pull request
#4938
opened by
amit9oct
Action required
amit9oct:patch-1
amit9oct:patch-1
Action required
View #4938
View workflow file
Tests latest TRL release with dev dependencies
Tests latest TRL release with dev dependencies
#425:
Scheduled
25m 14s
main
main
25m 14s
View workflow file
Cleanup Cache
Cleanup Cache
#1042:
Scheduled
17s
main
main
17s
View workflow file
documentation for modifying chat templates for assistant-only loss
Build PR Documentation
#13335:
Pull request
#4937
synchronize by
jiosephlee
Action required
jiosephlee:chat_template_docs
jiosephlee:chat_template_docs
Action required
View #4937
View workflow file
documentation for modifying chat templates for assistant-only loss
Build PR Documentation
#13334:
Pull request
#4937
opened by
jiosephlee
Action required
jiosephlee:chat_template_docs
jiosephlee:chat_template_docs
Action required
View #4937
View workflow file
Upload PR Documentation
Upload PR Documentation
#9764:
completed by
qgallouedec
33s
33s
View workflow file
Refactor DPO
Build PR Documentation
#13333:
Pull request
#3906
synchronize by
qgallouedec
2m 19s
refactor-dpo
refactor-dpo
2m 19s
View #3906
View workflow file
Refactor DPO
Tests
#14304:
Pull request
#3906
synchronize by
qgallouedec
29m 47s
refactor-dpo
refactor-dpo
29m 47s
View #3906
View workflow file
Automatic Dependency Submission (Python)
Automatic Dependency Submission
#2245:
by
github-advanced-security
bot
2m 36s
refactor-dpo
refactor-dpo
2m 36s
View #3906
Fix ipo normalization
Secret Leaks
#7268:
Commit
dd1e074
pushed by
qgallouedec
18s
refactor-dpo
refactor-dpo
18s
View #3906
View workflow file
Upload PR Documentation
Upload PR Documentation
#9763:
completed by
LeonEricsson
28s
28s
View workflow file
Previous
1
2
3
4
5
…
1276
1277
Next
You can’t perform that action at this time.