加载中...
Anthropic has released Claude 4 Opus and Sonnet models, achieving a record-breaking 72.7% score on SWE-bench Verified, far exceeding previous benchmarks.
72.7%
SWE-bench Performance
Note: This analysis was compiled by AI Power Rankings based on publicly available information. Metrics and insights are extracted to provide quantitative context for tracking AI tool developments.