News

Claude Opus 4.1 scores 74.5% on the SWE-bench Verified benchmark, indicating major improvements in real-world programming, ...
Only three months later, Anthropic is upping the ante further by launching the highly anticipated Claude Opus 4.1, which now takes its predecessor's crown as Anthropic's most advanced model.
Anthropic says Claude Opus 4.1 improves software engineering accuracy to 74.5%. That compares to 62.3% with Claude Sonnet 3.7 and 72.5% with Claude Opus 4.
Anthropic's Claude Opus 4.1 achieves 74.5% on coding benchmarks, leading the AI market, but faces risk as nearly half its $3.1B API revenue depends on just two customers.
The earthquake occurred at 9:32 a.m. at a depth of about 3.3 miles and was followed by a magnitude 3.1 aftershock about four minutes later in Rialto, according to USGS.
Developers can access the updated model using the claude-opus-4-1-20250805 API tag. New high score in real-world programming benchmark Claude Opus 4.1 sets a record on the SWE-bench Verified benchmark ...