AI-Can-Learn-Scientific-Taste

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference modeling and alignment problem.

agent

View on GitHub Website

393 Stars

10 Forks

393 Watchers

apache-2.0 License

100 SrcLog Score

Cost to Build

$80.2K

Market Value

$264.1K

How is this calculated?

Growth over time

3 data points · 2026-04-11 → 2026-04-26

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about AI-Can-Learn-Scientific-Taste

Question copied to clipboard

What is the tongjingqi/AI-Can-Learn-Scientific-Taste GitHub project? Description: "We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference modeling and alignment problem.". Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone AI-Can-Learn-Scientific-Taste

Clone via HTTPS

git clone https://github.com/tongjingqi/AI-Can-Learn-Scientific-Taste.git

Clone via SSH

[email protected]:tongjingqi/AI-Can-Learn-Scientific-Taste.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the AI-Can-Learn-Scientific-Taste issue tracker:

Open GitHub Issues

Similar to AI-Can-Learn-Scientific-Taste

netdata huginn pinpoint amon scouter dd-agent egjs merlin PyGame-Learning-Environment inspectIT goappmonitor mario-ai zappr covertutils Recaf deep-trading-agent trezor-agent PKURemote jvm-profiler stackimpact-go sematext-agent-docker opennars docker-inbound-agent xnumon fusioninventory-agent outis pi-web-agent logdna-agent sarl perfmon-agent