TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

nlp

View on GitHub

566 Stars

61 Forks

566 Watchers

Python Language

mit License

100 SrcLog Score

Cost to Build

$17.8K

Market Value

$50.6K

How is this calculated?

Growth over time

3 data points · 2023-02-01 → 2026-04-01

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about TextRL

Question copied to clipboard

What is the voidful/TextRL GitHub project? Description: "Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone TextRL

Clone via HTTPS

git clone https://github.com/voidful/TextRL.git

Clone via SSH

[email protected]:voidful/TextRL.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the TextRL issue tracker:

Open GitHub Issues

Similar to TextRL

lectures spaCy HanLP compromise gensim stanford-tensorflow-tutorials nltk awesome-nlp TextBlob ailearning CoreNLP ansj_seg rasa tensorflow_cookbook allennlp flashtext TagUI franc mycroft-core practical-pytorch text_classification nlp_tasks DeepPavlov snips-nlu Awesome-pytorch-list kcws Awesome-Chinese-NLP sentiment prose DeepLearn