Reinforcement Learning from Human Feedback
133 points
1 month ago
| 3 comments
| rlhfbook.com
| HN
https://arxiv.org/abs/2504.12501
verdverm
1 month ago
[-]
Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials
reply
leggerss
1 month ago
[-]
You could say he's also learning from human feedback
reply
dang
1 month ago
[-]
Related. Others?

RLHF Book - https://news.ycombinator.com/item?id=42902936 - Feb 2025 (37 comments)

reply
klelatti
1 month ago
[-]
Web version with links, etc:

https://rlhfbook.com/

reply
dang
1 month ago
[-]
Thanks! We've switched to that above from https://arxiv.org/abs/2504.12501, and put the latter in the toptext.
reply