TextRL

TextRL

voidful

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

566 Stars
61 Forks
566 Watchers
Python Language
mit License
100 SrcLog Score
Cost to Build
$17.8K
Market Value
$50.6K

Growth over time

3 data points  ·  2023-02-01 → 2026-04-01
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about TextRL

Question copied to clipboard

What is the voidful/TextRL GitHub project? Description: "Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone TextRL

Clone via HTTPS

git clone https://github.com/voidful/TextRL.git

Clone via SSH

[email protected]:voidful/TextRL.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the TextRL issue tracker:

Open GitHub Issues