Reinforcement learning towards broadly and persistently beneficial models
2 points
1 day ago
| 0 comments
| alignment.openai.com
| HN
No one has commented on this post.