Update1 Apr 2026

Claude API Gets 300K Output Tokens and Free Code Execution with Web Search

Anthropic raised the max output to 300K tokens on the Batch API for Opus 4.6 and Sonnet 4.6 — perfect for large code generation. Plus, code execution is now free when paired with web search or fetch.

Anthropic quietly shipped two significant API upgrades that expand what developers can build with Claude. The max_tokens cap on the Message Batches API has been raised to 300,000 tokens for Claude Opus 4.6 and Sonnet 4.6, enabled via the output-300k-2026-03-24 beta header. This is a game-changer for use cases requiring long-form output — large code generation tasks, structured data extraction, and comprehensive documentation can now be produced in a single API call.

The second upgrade is equally impactful: API code execution is now free when used alongside web search or web fetch tools. Sandboxed code execution allows Claude to run Python code during a conversation to process data, generate visualisations, or validate its own outputs. By making this free when combined with web tools, Anthropic is encouraging developers to build agentic workflows where Claude can search the web, fetch content, and immediately process it with code — all within a single API call at no additional cost.

Both features reflect Anthropic's strategy of removing friction from agentic development. The 300K output limit addresses a long-standing complaint from developers building code generation pipelines: previous output limits forced artificial chunking of large codebases. The free code execution removes a cost barrier that discouraged developers from letting Claude verify its own work — a pattern the leaked source code revealed is central to Claude Code's internal architecture.

For context engineers building production systems, these updates mean more ambitious single-pass code generation, cheaper agentic pipelines that combine search and computation, and fewer workarounds for output length limitations. The Batch API's 300K limit is particularly relevant for teams using Claude to generate or refactor entire modules.

Read original source

Join the Conversation

Discuss this with developers building with AI tools every day in the COR community.

Join Discord

Update

Apple Intelligence Finally Approved for China — Alibaba's Qwen to Power AI Features Across iOS, iPadOS, macOS and visionOS After 22-Month Regulatory Wait

Update

ASML Smashes Q2 Estimates on AI Chip Demand — Raises Full-Year Guidance to €43–45 Billion as Stock Surges 75% Year-to-Date and EUV Capacity Is Fully Booked Through 2027

Update

China's Anthropomorphic AI Rules Take Effect Today — ByteDance Doubao and Alibaba Qwen Forced to Disable Humanlike Agent Features as Beijing Draws Line on Emotional AI

Claude API Gets 300K Output Tokens and Free Code Execution with Web Search

Join the Conversation

Related Posts

Apple Intelligence Finally Approved for China — Alibaba's Qwen to Power AI Features Across iOS, iPadOS, macOS and visionOS After 22-Month Regulatory Wait

ASML Smashes Q2 Estimates on AI Chip Demand — Raises Full-Year Guidance to €43–45 Billion as Stock Surges 75% Year-to-Date and EUV Capacity Is Fully Booked Through 2027

China's Anthropomorphic AI Rules Take Effect Today — ByteDance Doubao and Alibaba Qwen Forced to Disable Humanlike Agent Features as Beijing Draws Line on Emotional AI

Claude API Gets 300K Output Tokens and Free Code Execution with Web Search

Join the Conversation

Related Posts

Apple Intelligence Finally Approved for China — Alibaba's Qwen to Power AI Features Across iOS, iPadOS, macOS and visionOS After 22-Month Regulatory Wait

ASML Smashes Q2 Estimates on AI Chip Demand — Raises Full-Year Guidance to €43–45 Billion as Stock Surges 75% Year-to-Date and EUV Capacity Is Fully Booked Through 2027

China's Anthropomorphic AI Rules Take Effect Today — ByteDance Doubao and Alibaba Qwen Forced to Disable Humanlike Agent Features as Beijing Draws Line on Emotional AI