Show HN: AA-Briefcase: a frontier knowledge work evaluation
11 points
2 hours ago
| 2 comments
| artificialanalysis.ai
| HN
mrdbourke
2 hours ago
[-]
the example submissions are really good comparisons, comparing Fable 5's submission to Opus 4 is fairly stark
reply
brenton_on_news
1 hour ago
[-]
GLM hanging with the frontier big dogs
reply