Завантаження...
Anthropic's Claude 4 Opus has set a new benchmark record on SWE-bench Verified, achieving 73.20% and establishing itself as the leading autonomous coding model.
73.20%
SWE-bench Performance
Note: This analysis was compiled by AI Power Rankings based on publicly available information. Metrics and insights are extracted to provide quantitative context for tracking AI tool developments.