Artificial Intelligence

Anthropic Releases Claude Opus 4.5 for Advanced Coding

November 28, 2025

Anthropic’s new flagship Claude Opus 4.5 excels in professional coding workflows and agent orchestration, achieving over 80% on SWE-bench Verified—the highest score on this real-world GitHub issue benchmark. This milestone demonstrates superior end-to-end software engineering capabilities with minimal human intervention.

Benchmark Leadership and Coding Excellence

Opus 4.5 surpasses competitors on SWE-bench Verified, Terminal-Bench, and OSWorld, excelling at planning, tool use, bug finding, and test-passing diffs. Enhanced multi-step planning and follow-through improve dependency analysis, multi-file refactors, and test authoring.

Advanced Agent and Computer Use Capabilities

Multi-agent coordination for parallel workstreams (fix tests while researching PRs)
Zoom action for detailed screen inspection of UI elements and fine text
Browser automation via Claude for Chrome (summarize tabs, extract data)
Excel integration for data cleaning, formulas, pivot tables, and outlier detection

Persistent Memory for Long-Running Projects

Expanded 200k token context with automatic thinking block preservation maintains continuity across extended sessions. Users can return to codebases, specs, or audits days later with full context awareness, reducing repetitive re-establishment.

Practical Workflow Improvements

Multi-session desktop app for concurrent tasks
Effort parameter control (low/medium/high) balances speed, cost, and capability
67% price reduction vs previous Opus models
Production-ready integrations with GitHub Copilot, Amazon Bedrock, Vertex AI

Impact for Engineering Teams

Teams should pilot on internal repos, measure suggestion acceptance rates, and scale to CI-assisted patching. Business users benefit from parallel browser/spreadsheet automation, reducing context switching in ops, support, and analysis workflows.

Benchmark Leadership and Coding Excellence

Advanced Agent and Computer Use Capabilities

Persistent Memory for Long-Running Projects

Practical Workflow Improvements

Impact for Engineering Teams

RELATED ARTICLESMORE FROM AUTHOR

Poetry Can Jailbreak Your AI Models, Study Finds

Gemini Lands on Google Home: How to Get It Now

Court Blocks OpenAI’s Use of IO for AI Device Name

LEAVE A REPLY Cancel reply

RELATED ARTICLES MORE FROM AUTHOR