By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    Close-up of ancient sedimentary rock layers with a glowing clock dial overlay, resting on a laboratory table alongside geological drill cores.
    New Rock Clock Refines Timeline of Earth’s Early Complex Animal Life
    March 18, 2026
    Wide view of a modern semiconductor fabrication plant with automated wafer equipment and engineers in protective suits on the production floor.
    Semiconductor Capex Risk Grows as India Expands Fabs
    March 18, 2026
    A sleek laptop on a modern office desk displaying an advanced AI interface integrated into a document, representing the new Google Gemini Workspace features.
    Google Gemini Workspace Features: Powerful AI Upgrades
    March 18, 2026
    A dark street in Havana, Cuba, entirely without power during a nationwide electrical grid collapse, illuminated only by faint flashlights and headlights.
    Cuba Blackout: Nationwide Grid Collapses Amid U.S. Blockade
    March 18, 2026
    A digital artificial intelligence network mapped over a flooded city street, representing AI flood forecasting technology.
    Google Transforms AI Flood Forecasting Using 5 Million News Articles
    March 18, 2026
  • Technology
    TechnologyShow More
    Wide view of a modern semiconductor fabrication plant with automated wafer equipment and engineers in protective suits on the production floor.
    Semiconductor Capex Risk Grows as India Expands Fabs
    March 18, 2026
    A glowing smartphone screen showing an artificial intelligence chat interface on a dark desk, representing AI chatbot safety concerns.
    AI Chatbot Safety Concerns Mount Amid Lawsuits and Violence
    March 18, 2026
    A modern corporate glass building at dusk with a blue artificial intelligence hologram glowing above it.
    Meta Shares Jump as Zuckerberg Weighs Major Layoffs to Offset AI Spending
    March 18, 2026
    A professional news-style image showing an iPhone, a thin laptop, and a large desktop display arranged on a clean studio desk.
    Apple 2026 Roadmap Adds iPhone 17e, M5 MacBook Air
    March 17, 2026
    A leather-bound encyclopedia and dictionary resting on a wooden desk in front of a glowing digital screen displaying AI data networks, representing the legal clash between traditional publishers and artificial intelligence.
    Encyclopedia Britannica and Merriam-Webster Sue OpenAI Over AI Training Data
    March 17, 2026
  • AI
    AIShow More
    A sleek laptop on a modern office desk displaying an advanced AI interface integrated into a document, representing the new Google Gemini Workspace features.
    Google Gemini Workspace Features: Powerful AI Upgrades
    March 18, 2026
    A modern corporate boardroom featuring a glowing holographic interface representing enterprise AI agents managing data and workflows.
    Enterprise AI Agents: Microsoft & Nvidia Lead the Race
    March 18, 2026
    A high-tech conference stage featuring a large illuminated screen displaying glowing artificial intelligence and autonomous vehicle graphics.
    Nvidia GTC 2026: AI Revenue and Robotaxi Expansion
    March 18, 2026
    A sleek Nvidia graphics card with green LED lighting on a dark high-tech desk in front of blurred gaming monitors.
    Nvidia DLSS 5: AI-Powered Photorealism for PC Games
    March 17, 2026
    Diverse tech professionals collaborating on artificial intelligence projects in a modern, brightly lit startup accelerator workspace.
    Google and Accel AI Startups Join 2026 Atoms Cohort
    March 17, 2026
  • Science
    ScienceShow More
    Close-up of ancient sedimentary rock layers with a glowing clock dial overlay, resting on a laboratory table alongside geological drill cores.
    New Rock Clock Refines Timeline of Earth’s Early Complex Animal Life
    March 18, 2026
    A digital artificial intelligence network mapped over a flooded city street, representing AI flood forecasting technology.
    Google Transforms AI Flood Forecasting Using 5 Million News Articles
    March 18, 2026
    A bright fireball meteor soaring over a suburban neighborhood during the day, leaving a glowing, fiery trail in the clear blue sky above residential rooftops.
    Ohio Meteor Boom: Daylight Fireball Triggers Massive Shock Wave
    March 18, 2026
    A microscopic 3D rendering of glowing intelectin-2 proteins reinforcing a mucus barrier and neutralizing harmful bacteria in the human gut.
    MIT Scientists Discover Gut Protein That Kills Bacteria
    March 17, 2026
    A glowing microscopic antibody illuminating a cluster of tumor cells in a dark medical laboratory environment.
    Scientists Unveil Cancer Flashlight for Tumor Detection
    March 17, 2026
  • World
    WorldShow More
    A dark street in Havana, Cuba, entirely without power during a nationwide electrical grid collapse, illuminated only by faint flashlights and headlights.
    Cuba Blackout: Nationwide Grid Collapses Amid U.S. Blockade
    March 18, 2026
    Nighttime rescue operations underway at the destroyed Omid Addiction Treatment Hospital in Kabul following a devastating airstrike, with first responders searching the rubble using flashlights.
    Pakistan Airstrike on Kabul Hospital Leaves Hundreds Dead Amid Escalating Tensions
    March 18, 2026
    A large commercial oil tanker anchored near an illuminated coastal energy hub at dusk.
    Strait of Hormuz Crisis: Oil Spikes & US Diesel Tops $5
    March 18, 2026
    Rugged, dusty mountain terrain in Somalia under dawn lighting, representing the remote locations of recent military operations.
    U.S. Airstrikes in Somalia Double Amid Major Offensives Against ISIS and Al-Shabaab
    March 17, 2026
    A Ugandan political opposition leader in a suit and red beret speaks passionately into a microphone in a dimly lit, undisclosed room.
    Ugandan Opposition Leader Bobi Wine Flees Into Exile Following Disputed Election
    March 17, 2026
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: Microsoft Maia 200 AI chip boosts Azure inference
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

Microsoft Maia 200 AI chip boosts Azure inference

Rakesh Paul
Last updated: 27/01/2026
Rakesh Paul
Share
8 Min Read
Hyper-realistic image of a modern data center with glowing server racks and a Maia 200 AI accelerator card highlighted in the foreground.

Microsoft has unveiled its Maia 200 AI accelerator, a custom chip built to make running large AI models faster and more cost-efficient across its Azure cloud. The company is positioning Maia 200 as a breakthrough for AI inference, the stage where trained models answer user prompts and power real-world applications like chatbots and productivity tools.

Contents
A custom AI chip focused on inferencePerformance numbers and comparisonsMemory, networking and system designRole in Azure and Microsoft AI servicesDeveloper tools and future roadmap

Maia 200 is designed specifically for large-scale inference rather than training, and Microsoft says it delivers better performance per dollar than the hardware it currently uses in its data centers. It will power services such as Microsoft 365 Copilot, Azure AI Foundry, and models from the Microsoft Superintelligence team, including OpenAI’s latest GPT‑5.2 family.

A custom AI chip focused on inference

Microsoft describes Maia 200 as a “breakthrough inference accelerator” tuned for the heavy workloads of modern reasoning and language models. Unlike some rival chips that are built to handle both training and inference, Microsoft and industry analysts say this design is optimized for the production side of AI, where efficiency and throughput matter most.

The chip is fabricated on Taiwan Semiconductor Manufacturing Company’s 3‑nanometer process and contains more than 140 billion transistors. According to Microsoft, Maia 200 is its most performant first‑party silicon yet and the most efficient inference system it has deployed, delivering 30% better performance per dollar than the latest generation of hardware in its fleet.

Performance numbers and comparisons

Each Maia 200 accelerator can deliver more than 10 petaFLOPS of compute at 4‑bit precision (FP4) and over 5 petaFLOPS at 8‑bit precision (FP8), within a 750‑watt system‑on‑chip power envelope. Microsoft says this is enough for a single chip to run today’s largest AI models while still leaving room for even bigger models in the future.

Microsoft also draws direct comparisons with rival cloud providers’ in‑house AI chips. The company claims Maia 200 delivers three times the FP4 performance of Amazon’s third‑generation Trainium and FP8 performance above Google’s seventh‑generation TPU. Analysts note that Maia 200 uses a more advanced 3‑nanometer manufacturing node than the 5‑nanometer or 7‑nanometer processes used in these competing chips, and say it shows Microsoft is closing earlier gaps in custom silicon.

Microsoft and external commentators emphasize that customers will still need to validate real‑world performance and pricing within Azure before shifting workloads from other vendors, including Nvidia. One analyst also points out that enterprises will want to see how much of Microsoft’s own infrastructure savings from Maia 200 are eventually reflected in cloud subscription costs.

Memory, networking and system design

Maia 200’s architecture centers on feeding data to AI models as efficiently as possible, not just pushing raw compute power. Each chip includes 216GB of high‑bandwidth HBM3e memory delivering 7 TB/s of bandwidth, along with 272MB of on‑chip SRAM and specialized data‑movement engines to keep large models highly utilized.

Microsoft redesigned the memory subsystem around low‑precision data types, a dedicated direct memory access engine, on‑die SRAM, and a custom network‑on‑chip fabric to move data quickly and increase token throughput during inference. At the system level, each accelerator exposes 2.8 TB/s of bidirectional scale‑up bandwidth and connects into a two‑tier network built on standard Ethernet rather than proprietary fabrics.

Clusters can scale up to 6,144 Maia 200 accelerators, using the same communication protocols within trays, racks, and across the data center. Within each tray, four accelerators are directly linked to keep high‑bandwidth communication local, which Microsoft says helps reduce power use and total cost of ownership for dense inference deployments.

Role in Azure and Microsoft AI services

Maia 200 is already deployed in Microsoft’s US Central data center region near Des Moines, Iowa, and is rolling out next to the US West 3 region near Phoenix, Arizona, with more regions planned later. The chip is integrated with Azure as part of a heterogeneous AI infrastructure that also includes other types of accelerators.

Microsoft says Maia 200 will run multiple models, including OpenAI’s GPT‑5.2 family, and will support workloads for Azure AI Foundry and Microsoft 365 Copilot. The Microsoft Superintelligence team plans to use the chip for reinforcement learning and synthetic data generation, which are key for improving future in‑house AI models and speeding up the creation of domain‑specific training data.

Industry analysts argue that Microsoft’s long experience with enterprise IT gives it an advantage in embedding Maia‑based inference services directly into the broader Azure platform. Commentators also stress that Microsoft’s strategy is to complement, not outright replace, other vendors like Nvidia and AMD, while offering customers more options for high‑throughput, memory‑intensive AI inference.

Developer tools and future roadmap

To encourage early adoption, Microsoft is offering a preview of the Maia software development kit. The SDK supports popular AI frameworks, including PyTorch, and includes a Triton compiler, an optimized kernel library, access to Maia’s low‑level NPL programming language, a simulator, and a cost calculator for tuning workloads.

Microsoft says its silicon program used a sophisticated pre‑silicon environment to model large language model workloads and validate networking and cooling systems, including a second‑generation closed‑loop liquid cooling unit. According to the company, this approach allowed AI models to run on Maia 200 within days of first silicon and cut the time from first chip to data center deployment by more than half compared with earlier infrastructure programs.

The company describes Maia as a multi‑generation accelerator family and says it is already designing future versions while it deploys Maia 200 across its global infrastructure. As these chips scale out, Microsoft expects continued improvements in performance per dollar and per watt for its most important AI workloads in Azure.

TAGGED: AI chips, AI inference, Azure, cloud computing, data centers, GPT-5.2, Maia 200, Microsoft
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Rakesh Paul
I'm the Co-Founder of VellaTimes and an experienced digital marketer. With substantial experience in the blogging industry, I love crafting insightful and engaging news articles on technology, sports, and automobiles.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

ChatGPT outage disrupts users worldwide in Feb. 2025

January 28, 2026

Sun Unleashes Powerful X8.1 Flare as Giant Sunspot Region AR4366 Erupts

February 4, 2026

Entangled atomic clouds boost precision quantum sensing

January 27, 2026

Rubin Observatory Issues 800,000 Nightly Real-Time Alerts

February 27, 2026

US-Azerbaijan strategic partnership signed by JD Vance

February 11, 2026

Apple MacBook Neo: Budget Laptop Leads New Device Lineup

March 8, 2026

Related News

Close-up of ancient sedimentary rock layers with a glowing clock dial overlay, resting on a laboratory table alongside geological drill cores.
News

New Rock Clock Refines Timeline of Earth’s Early Complex Animal Life

Nisha Pradhan Nisha Pradhan March 18, 2026
Wide view of a modern semiconductor fabrication plant with automated wafer equipment and engineers in protective suits on the production floor.
News

Semiconductor Capex Risk Grows as India Expands Fabs

Rakesh Paul Rakesh Paul March 18, 2026
A sleek laptop on a modern office desk displaying an advanced AI interface integrated into a document, representing the new Google Gemini Workspace features.
News

Google Gemini Workspace Features: Powerful AI Upgrades

Sameer Katoch Sameer Katoch March 18, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist