Many SWE-bench-Passing PRs would not be merged
37 points
1 hour ago
| 1 comment
| metr.org
| HN
love2read
22 minutes ago
[-]
Seems to fail to mention that clearly documented AI-generated PR's (especially autonomously created ones) tend to have a much higher bar of acceptance, hinging on the reviewer's relationship with AI.

With this consideration, I submit that All of the SWE-bench-Passing PRs over a certain line count threshold would not be merged (if clearly noted as autonomous AI contributions).

reply