Azure

Azure AI Infrastructure Expands With NVIDIA at GTC

3 min read

Summary

Microsoft announced major Azure AI updates at NVIDIA GTC, including expanded Microsoft Foundry capabilities, new Azure infrastructure for inference-heavy AI workloads, and deeper support for Physical AI. The changes matter because they help organizations build and run production-grade AI agents, prepare for next-generation NVIDIA systems, and extend AI into regulated and real-world operational environments.

Audio Summary

0:00--:--
Need help with Azure?Talk to an Expert

Introduction

Microsoft used NVIDIA GTC to outline a broader Azure AI strategy focused on production-ready agents, inference-optimized infrastructure, and Physical AI. For IT leaders and platform teams, the announcement signals faster paths from AI experimentation to governed enterprise deployment.

What’s new in Microsoft Foundry

Microsoft expanded Microsoft Foundry as its enterprise AI platform for building, deploying, and operating agents at scale.

Key updates include:

  • Foundry Agent Service is now generally available for building AI agents that can reason, plan, and act across tools, data, and workflows.
  • Observability in the Foundry Control Plane is also generally available, giving teams better visibility into agent behavior and operations.
  • Voice Live API integration for Foundry Agent Service is now in public preview, enabling voice-first and multimodal real-time agent experiences.
  • A refreshed Microsoft Foundry portal is now generally available.
  • NVIDIA Nemotron models are now available in Microsoft Foundry, expanding the model catalog for enterprise AI workloads.

Microsoft also highlighted additional integrations with security and governance partners such as Palo Alto Networks Prisma AIRS and Zenity.

Azure AI infrastructure gets a major boost

Microsoft is also expanding Azure infrastructure to support inference-heavy, reasoning-based AI workloads.

Highlights include:

  • Azure is the first hyperscale cloud to power on NVIDIA Vera Rubin NVL72 systems in Microsoft labs.
  • Vera Rubin NVL72 is expected to roll out to Azure datacenters over the coming months.
  • Microsoft says it has already deployed hundreds of thousands of liquid-cooled Grace Blackwell GPUs globally in under a year.
  • Azure Local now has initial support for the NVIDIA Vera Rubin platform, helping customers in sovereign and regulated environments run advanced AI closer to where data resides.

Physical AI and digital twins on Azure

Microsoft and NVIDIA are also deepening work around Physical AI.

New capabilities include:

  • A public Azure Physical AI Toolchain GitHub repository integrated with NVIDIA Physical AI Data Factory and core Azure services.
  • Deeper integration between Microsoft Fabric and NVIDIA Omniverse libraries.
  • Support for workflows that connect live operational data, digital twins, simulation, and AI-driven actions.

This is especially relevant for manufacturing, energy, and industrial operations where organizations want AI to move beyond dashboards into real-time decision support and automation.

Why this matters for IT admins

For Azure administrators and enterprise architects, these announcements point to three priorities:

  • Better tooling for moving AI agents from pilot to production
  • More infrastructure options for high-performance inference and reasoning workloads
  • Stronger support for regulated, sovereign, and edge scenarios with Azure-consistent governance

Next steps

Organizations evaluating Azure AI should review Microsoft Foundry GA features, assess Nemotron model support, and watch for Azure availability of Vera Rubin NVL72. Teams in industrial sectors should also explore the new Physical AI Toolchain and Fabric-Omniverse integration for digital twin and simulation use cases.

Need help with Azure?

Our experts can help you implement and optimize your Microsoft solutions.

Talk to an Expert

Stay updated on Microsoft technologies

Azure AIMicrosoft FoundryNVIDIAPhysical AIAzure infrastructure

Related Posts

Azure

Microsoft Azure Europe Expansion Boosts AI Capacity

Microsoft is expanding Azure datacenter capacity across Europe to meet rising demand for cloud and AI workloads, with investments in new and existing regions including Denmark, Belgium, Austria, Greece, and Finland. The update matters for IT leaders because it improves data residency options, supports sovereign cloud requirements, and brings lower-latency infrastructure closer to users and regulated workloads.

Azure

Azure IaaS Security: Defense-in-Depth by Design

Microsoft has outlined how Azure IaaS applies defense-in-depth across hardware, compute, networking, storage, and operations using secure-by-design, secure-by-default, and secure-in-operation principles. The update matters because it clarifies which protections are built into the platform by default and where IT teams should align their own VM, network, and identity configurations.

Azure

Azure API Management Named IDC Leader for 2026

Microsoft has been named a Leader in the IDC MarketScape: Worldwide API Management 2026 Vendor Assessment, highlighting Azure API Management’s role in governing both traditional APIs and AI workloads. For IT teams, the announcement underscores Microsoft’s push to provide a single platform for API security, observability, policy enforcement, and AI gateway capabilities at enterprise scale.

Azure

Azure Local Scales Sovereign Private Cloud

Microsoft has expanded Azure Local to support sovereign private cloud deployments that scale from hundreds to thousands of servers within a single sovereign boundary. The update helps governments, regulated industries, and critical infrastructure operators run larger AI, analytics, and mission-critical workloads locally while maintaining data residency, compliance, and operational control.

Azure

Azure Integrated HSM Open Source Boosts Trust

Microsoft has open-sourced key components of Azure Integrated HSM, including firmware, drivers, and the software stack, while launching an Open Compute Project workgroup to guide development. The move gives customers and regulators more transparency into Azure’s server-local hardware key protection model and prepares the technology for broader availability in Azure V7 virtual machines.

Azure

GPT-5.5 in Microsoft Foundry for Enterprise AI

Microsoft is making OpenAI GPT-5.5 generally available in Microsoft Foundry, giving Azure customers a new frontier model designed for long-context reasoning, agentic execution, and lower token usage. The update matters for enterprises because Foundry adds the security, governance, identity, and deployment controls needed to run production AI agents at scale.