TextRL

TextRL

voidful

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

459 Stars
56 Forks
459 Watchers
Python Language
mit License
Cost to Build
$19.8K
Market Value
$43.8K

Growth over time

2 data points  ·  2023-02-15 → 2023-07-07
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about TextRL

Question copied to clipboard

What is the voidful/TextRL GitHub project? Description: "Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone TextRL

Clone via HTTPS

git clone https://github.com/voidful/TextRL.git

Clone via SSH

[email protected]:voidful/TextRL.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the TextRL issue tracker:

Open GitHub Issues