Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

SFTTrainer to add support for IterableDataset
#1761 opened Jun 21, 2024 by helloworld1 Loading…
Issue #1751 Fix
#1754 opened Jun 18, 2024 by yash-srivastava19 Loading…
Fix GPT2 sentiment notebook reward
#1738 opened Jun 14, 2024 by cemiu Loading…
add Efficient Exact Optimization (EXO)
#1735 opened Jun 14, 2024 by haozheji Loading…
allow ref model use ds stage3 only
#1730 opened Jun 13, 2024 by gromzhu Loading…
rloo trainer with trainer callbacks
#1729 opened Jun 13, 2024 by mnoukhov Draft
Adding SimPO to TRL
#1725 opened Jun 11, 2024 by yumeng5 Loading…
Visual DPO
#1647 opened May 17, 2024 by qgallouedec Draft
Prototype Dataset Processor
#1646 opened May 16, 2024 by vwxyzjn Loading…
[DRAFT] Vllm integration
#1628 opened May 7, 2024 by vwxyzjn Draft
[WIP] Unify Policy Trainers
#1586 opened Apr 25, 2024 by lapp0 Draft
4 tasks
A pull request for POVIDTrainer
#1573 opened Apr 23, 2024 by gzcch Loading…
ProTip! Filter pull requests by the default branch with base:main.