The startup says Claude Sonnet 4.5 is the world's best model for AI coding tasks, and a leap forward in applied artificial intelligence.
Anthropic's Claude Sonnet 4.5 now scores 77% on a key software engineering benchmark and can work autonomously for over 30 hours on complex tasks.
Now, Claude Sonnet 4.5 has lapped that last model, outperforming it on the SWE-bench Verified evaluation, a human-filtered subset of the SWE-bench. Claude Sonnet 4.5 also outperformed leading models ...