Reinforcement Learning from Human Feedback (RLHF) in Notebooks
68 points
10 hours ago
| 1 comment
| github.com
| HN
kcdom1000f
8 hours ago
[-]
Hl
reply