Azure

Claude Sonnet 4.6 in Microsoft Foundry: 1M Context

3 min read

Summary

Microsoft Foundry now offers Claude Sonnet 4.6, bringing a 1 million token context window in beta, up to 128K output, and adaptive reasoning controls designed for enterprise AI workloads. This matters because it gives organizations a managed Azure-based option for handling large codebases, complex document analysis, and agent workflows while balancing quality, latency, and cost more efficiently than premium-tier models.

Audio Summary

0:00--:--
Need help with Azure?Talk to an Expert

Introduction: why this matters

Claude Sonnet 4.6 is now available in Microsoft Foundry, giving IT and engineering teams another “frontier-class” model option that targets large-scale enterprise workloads—without always needing the premium cost profile of top-tier models. For organizations building developer copilots, knowledge-work assistants, and automation agents, Sonnet 4.6 aims to balance intelligence, throughput, and governance-friendly deployment in a managed environment.

What’s new in Claude Sonnet 4.6 (in Foundry)

Massive context + large outputs

  • 1 million token context window (beta), matching the extended context tier called out for Claude Opus 4.6.
  • Up to 128K maximum output, enabling longer generated artifacts (reports, code changes, multi-step plans) in a single response.
  • Practical upside: less “context chopping” for large codebases, long financial models, multi-document review, and long-running multi-turn agent workflows.

Adaptive thinking and “effort” controls

  • Sonnet 4.6 introduces adaptive thinking, allowing the model to apply deeper reasoning only when needed.
  • Effort parameters provide a way to tune the quality–latency–cost tradeoff for different tasks (e.g., quick triage vs. high-assurance analysis).

Developer-focused upgrade (vs. Sonnet 4.5)

Microsoft highlights Sonnet 4.6 as a direct upgrade to Sonnet 4.5 with minimal prompting changes required for most workflows. Improvements emphasized include:

  • Stronger reasoning across broader code context
  • Better understanding of complex codebases
  • More reliable performance across iterative development cycles (feature build, refactor, debug, refine)

Improved knowledge work at scale

For enterprise content and analysis workflows, Sonnet 4.6 targets fewer revision cycles for:

  • Report drafting and refinement
  • Summarizing large document sets
  • Structured business documentation
  • Presentation and narrative generation

“Computer use” for browser automation

Sonnet 4.6 is described as Anthropic’s most capable computer use model yet, scoring 72.5% on OSWorld Verified. Key enterprise implications:

  • Automates browser-based tasks without depending on an API (useful for legacy and SaaS tools with limited integration).
  • Improved UI interaction precision (e.g., difficult click targets).
  • Can move across web apps and complete multi-step tasks (e.g., read context, respond to a message, create a calendar event).

Impact for IT admins and platform teams

  • Architecture choices: Sonnet 4.6 can serve as a cost-efficient default model for high-volume chat, document workflows, or as a sub-agent in multi-model pipelines.
  • Automation reach: Browser-based automation expands what’s possible where APIs don’t exist, but increases the need for guardrails, auditing, and least-privilege access.
  • Operational governance: Deploying via Microsoft Foundry positions teams to align model usage with enterprise expectations for governance, compliance, and operational tooling.

Action items / next steps

  1. Pilot Sonnet 4.6 in Foundry for one high-context scenario (e.g., repo-wide code assistant, policy/document analysis) to validate the 1M context benefits.
  2. Define “effort” profiles (low/medium/high) mapped to task classes to control cost and latency.
  3. Evaluate computer-use automations in a sandbox: start with non-destructive workflows (read-only validation, QA UI checks) before enabling transactional actions.
  4. Attend Model Mondays (Feb 23) for guidance from Anthropic on Opus 4.6 and Sonnet 4.6 architectures and enterprise deployment patterns.

Need help with Azure?

Our experts can help you implement and optimize your Microsoft solutions.

Talk to an Expert

Stay updated on Microsoft technologies

Azure AI FoundryClaude Sonnet 4.6generative AIagentsenterprise automation

Related Posts

Azure

Microsoft The Shift Podcast on Agentic AI Challenges

Microsoft has launched a new season of The Shift podcast focused on agentic AI, with eight weekly episodes exploring how AI agents use data, coordinate with each other, and depend on platforms like Postgres, Microsoft Fabric, and OneLake. The series matters because it highlights that deploying agents in enterprises is not just about models—it requires rethinking architecture, governance, security, and IT workflows across the full Azure and data stack.

Azure

Azure Agentic AI for Regulated Industry Modernization

Microsoft says Azure combined with agentic AI can help regulated industries modernize legacy systems faster by automating workload assessment, migration, and ongoing operations while maintaining compliance. The update matters because it positions cloud migration as more than a cost-saving exercise: for sectors like healthcare and other highly regulated industries, it is increasingly essential for resilience, governance, and readiness to deploy AI at scale.

Azure

Fireworks AI on Microsoft Foundry for Azure Inference

Microsoft has launched a public preview of Fireworks AI on Microsoft Foundry, bringing high-throughput, low-latency open-model inference to Azure through a single managed endpoint. It matters because enterprises can now access models like DeepSeek V3.2, gpt-oss-120b, Kimi K2.5, and MiniMax M2.5 with Azure’s governance, serverless or provisioned deployment options, and bring-your-own-weights support—making it easier to move open-model AI from experimentation into production.

Azure

Azure Copilot Migration Agent for App Modernization

Microsoft has introduced new public preview modernization agents in Azure Copilot and GitHub Copilot to help organizations automate migration and application transformation across discovery, assessment, planning, deployment, and code upgrades. The announcement matters because it aims to turn complex, fragmented modernization work into a coordinated AI-assisted workflow, helping enterprises move legacy infrastructure and applications to Azure faster and with clearer cost, dependency, and prioritization insights.

Azure

Azure IaaS Resource Center for Resilient Infrastructure

Microsoft has introduced the Azure IaaS Resource Center, a centralized hub for infrastructure teams to find design guidance, demos, architecture resources, and best practices for compute, storage, and networking. The launch matters because it reinforces Azure IaaS as a unified platform for building resilient, high-performance, and cost-optimized infrastructure, helping organizations better support everything from traditional business apps to AI workloads.

Azure

Microsoft Foundry ROI Study Shows 327% Enterprise AI Gains

A Forrester Total Economic Impact study commissioned around Microsoft Foundry found that a modeled enterprise could achieve 327% ROI over three years, break even in about six months, and realize $49.5 million in benefits from productivity and infrastructure savings. The results matter because they highlight how much enterprise AI costs are driven by developer time and fragmented tooling, suggesting that a unified platform like Foundry can help IT teams accelerate AI delivery while improving governance and efficiency.