Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

huggingface / trl Public

generated from fastai/nbdev_template

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 8.6k

Code
Issues 51
Pull requests 18
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: huggingface/trl

Labels 15 Milestones 0

Labels 15 Milestones 0

New pull request New

Clear current search query, filters, and sorts

18 Open 787 Closed

18 Open 787 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[WIP] Unify Policy Trainers

#1586 opened Apr 25, 2024 by lapp0 • Draft

4 tasks

16

[WIP] Add WinRateCallback

#1598 opened Apr 29, 2024 by lewtun • Draft

2 of 5 tasks

2

Adds Online DPO

#1605 opened Apr 30, 2024 by edbeeching • Draft

2

[DRAFT] Vllm integration

#1628 opened May 7, 2024 by vwxyzjn • Draft

1

Prototype Dataset Processor

#1646 opened May 16, 2024 by vwxyzjn

Loading…

4

Adding SimPO to TRL

#1725 opened Jun 11, 2024 by yumeng5

Loading…

6

rloo and ppov2 trainer with trainer callbacks

#1729 opened Jun 13, 2024 by mnoukhov

Loading…

3

allow ref model use ds stage3 only

#1730 opened Jun 13, 2024 by gromzhu

Loading…

10

Fix GPT2 sentiment notebook reward

#1738 opened Jun 14, 2024 by cemiu

Loading…

4

Issue #1751 Fix

#1754 opened Jun 18, 2024 by yash-srivastava19

Loading…

6

Add ppov2 sentiment example (as a replacement to imdb example)

#1759 opened Jun 21, 2024 by vwxyzjn

Loading…

1

1

SFTTrainer to add support for IterableDataset

#1761 opened Jun 21, 2024 by helloworld1

Loading…

1

[Code Improvement] Support concatnate forward in reward trainer

#1769 opened Jun 24, 2024 by 1485840691

Loading…

4

Add SRPO algorithm.

#1772 opened Jun 25, 2024 by frasermince • Draft

1

fix model to save in ppov2

#1776 opened Jun 26, 2024 by mnoukhov

Loading…

1

#1779 opened Jun 27, 2024 by kashif • Draft

Fix start index under batched_forward_pass

#1782 opened Jun 27, 2024 by mertsayar8

Loading…

2

[SFT] add model_init_kwargs to training_args

#1787 opened Jun 28, 2024 by kashif

Loading…

1

ProTip! What’s not been updated in a month: updated:<2024-05-28.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.