FilterHN

Show HN: AA-Briefcase: a frontier knowledge work evaluation

11 points

by declanjackson

2 hours ago

| 2 comments

| artificialanalysis.ai

| HN

2 hours ago

[-]

the example submissions are really good comparisons, comparing Fable 5's submission to Opus 4 is fairly stark

brenton_on_news

1 hour ago

[-]

GLM hanging with the frontier big dogs