> One well-documented but underappreciated flaw of artificial intelligence models is that they tend to favor work produced by artificial intelligence. A 2025 paper in The Proceedings of the National Academy of Sciences found that several large language models had a low opinion of text written by humans, creating a “potentially consequential form of implicit ‘anti-human’ bias.”
Great. Now AI is judging humans' work (instead of the opposite).
Who will eval the AI evals??