By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    A glowing quantum clock fragmenting into light particles against a dark cosmic background with swirling entangled atoms and spacetime waves, representing quantum physics breakthroughs in time and the universe.
    Quantum Physics Breakthroughs Reshaping How We Understand Time and the Universe
    May 3, 2026
    A sleek and modern stage at a corporate technology launch event with glowing digital displays.
    OpenAI GPT-5.5 Launch Party and the Goblin Problem
    May 3, 2026
    A glowing digital medical tablet displaying artificial intelligence graphics in a modern hospital emergency room.
    AI Outperforms Doctors in Harvard Trial of Emergency Triage Diagnoses
    May 3, 2026
    A glowing antimatter atom passing through a hexagonal graphene sheet and splitting into a quantum wave interference pattern in a high-tech laboratory setting.
    Scientists Observe Positronium Wave Behavior in Lab
    May 1, 2026
    Hyper-realistic news-style image of a modern AI data center with server racks and a digital display labeled DeepSeek V4, shown in cool blue lighting.
    DeepSeek V4 launch puts Huawei AI chips in spotlight
    May 1, 2026
  • Technology
    TechnologyShow More
    A glowing digital medical tablet displaying artificial intelligence graphics in a modern hospital emergency room.
    AI Outperforms Doctors in Harvard Trial of Emergency Triage Diagnoses
    May 3, 2026
    A modern smartphone displaying an app storefront positioned next to a wooden judge's gavel on a desk, representing the legal battle over digital marketplace policies.
    Apple Loses Bid to Pause App Store Fee Changes
    May 1, 2026
    A business professional using an AI assistant on a laptop in a modern office with a data center visible in the background.
    Microsoft Copilot Tops 20 Million Paid Enterprise Seats
    May 1, 2026
    A brightly lit modern semiconductor cleanroom featuring advanced silicon wafers and glowing blue server racks.
    Samsung Q1 Profit Surges Eightfold as AI Boom Fuels Record Chip Earnings
    April 30, 2026
    A person holding a smartphone displaying the Amazon Shopping app's AI audio chat interface in a modern living room.
    Amazon AI Audio Shopping Chat Enhanced With Real-Time Q&A
    April 29, 2026
  • AI
    AIShow More
    A sleek and modern stage at a corporate technology launch event with glowing digital displays.
    OpenAI GPT-5.5 Launch Party and the Goblin Problem
    May 3, 2026
    Hyper-realistic news-style image of a modern AI data center with server racks and a digital display labeled DeepSeek V4, shown in cool blue lighting.
    DeepSeek V4 launch puts Huawei AI chips in spotlight
    May 1, 2026
    News-style image of Elon Musk seated in a courtroom during a legal dispute involving OpenAI.
    Elon Musk OpenAI Trial Puts Nonprofit Mission on Trial
    May 1, 2026
    News-style image showing LG Electronics and Nvidia branding in a modern tech setting with AI server racks and a service robot.
    Nvidia-LG Talks Highlight Wider AI Expansion Strategy
    April 30, 2026
    A dramatic courtroom setting featuring an abstract artificial intelligence hologram on a wooden table, representing the high-stakes tech trial.
    Elon Musk vs Sam Altman OpenAI Trial Over AI Future
    April 29, 2026
  • Science
    ScienceShow More
    A glowing quantum clock fragmenting into light particles against a dark cosmic background with swirling entangled atoms and spacetime waves, representing quantum physics breakthroughs in time and the universe.
    Quantum Physics Breakthroughs Reshaping How We Understand Time and the Universe
    May 3, 2026
    A glowing antimatter atom passing through a hexagonal graphene sheet and splitting into a quantum wave interference pattern in a high-tech laboratory setting.
    Scientists Observe Positronium Wave Behavior in Lab
    May 1, 2026
    The NASA Curiosity rover is using its robotic arm to drill into a red sandstone rock on the dusty surface of Mars.
    Mars Organic Molecules: Curiosity Rover Makes Historic Find
    May 1, 2026
    Aerial view of the Pacific Ocean off a forested coastline with a glowing geological fault line beneath the water representing the Cascadia subduction zone.
    Earth Tearing Apart Under the Cascadia Subduction Zone
    May 1, 2026
    A young adult female patient and a doctor are looking at medical charts in a modern clinical office setting.
    Rising Cancer Rates in Young Adults: Is Obesity to Blame?
    April 29, 2026
  • World
    WorldShow More
    Allu Arjun Commitment to Ethical Brand Partnerships
    Exploring Allu Arjun’s Commitment to Ethical Brand Partnerships
    December 18, 2023
    Orry aka Orhan Awatramani
    Orhan Awatramani ‘Orry’ Biography, Lifestyle and Rise to Fame
    December 8, 2023
    Alia Bhatt Latest Deepake Video Victim
    Alia Bhatt becomes latest victim of Deepfake Videos, Obscene Video goes Viral
    November 28, 2023
    Napoleon Movie Review
    Napoleon Movie Review: A Historical Epic by Ridley Scott Reviewed
    November 25, 2023
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: Amazon and Cerebras Partner to Accelerate AI Inference
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
AI

Amazon and Cerebras Partner to Accelerate AI Inference

Sameer Katoch
Last updated: 15/03/2026
Sameer Katoch
Share
6 Min Read
A glowing, massive AI processor integrated into a modern cloud computing data center rack.

Amazon Web Services has reached a major agreement with semiconductor startup Cerebras Systems to integrate new hardware into its cloud infrastructure. The collaboration focuses on accelerating AI inference, which is the process where trained artificial intelligence models respond to user requests. By combining computing power from both companies, Amazon aims to speed up operations for chatbots, coding assistants, and other interactive tools.

The new cloud computing service is scheduled to launch in the second half of 2026. While the specific financial terms of the agreement remain undisclosed, the companies have been laying the groundwork for this integration for several years. The partnership marks a significant milestone, as Amazon Web Services becomes the first major cloud provider, or hyperscaler, to officially commit to offering Cerebras technology to its vast network of customers.

A Divided Approach to Faster Processing

To achieve faster response times, the two companies are utilizing a method known as inference disaggregation. Instead of relying on a single piece of hardware to manage the entire workload, Amazon and Cerebras will split the computation into two distinct stages. Company leadership describes this workflow as a divide-and-conquer strategy designed to overcome traditional processing delays.

When a user submits a prompt, the artificial intelligence model must first understand the request. This initial stage is called the prefill phase, where human words are converted into data tokens that the computer can process. Under the new system architecture, Amazon’s proprietary Trainium3 chips will exclusively handle these highly parallel prefill calculations.

Once the prefill phase is complete, the workload moves to the decode stage. During this second step, the artificial intelligence actually generates and delivers the requested answer token by token. Cerebras’ massive Wafer Scale Engine processors, which are optimized for rapid token generation, will take over to complete the decode phase. By dedicating specialized chips to different parts of the AI inference process, the companies expect to drastically reduce latency for tasks that require immediate, iterative feedback.

Integrating Hardware Inside the Cloud

As part of the hardware arrangement, massive processors from the startup will be physically installed inside Amazon Web Services data centers. The third-party processors will be directly linked to Amazon’s custom Trainium3 hardware using the cloud provider’s proprietary networking technology. This deep physical and digital integration ensures that the divided workload can move seamlessly between the two different types of silicon without unnecessary communication delays.

Nafea Bshara, a vice president at Amazon Web Services, noted that the integrated chip solution is particularly valuable for customers working in scenarios where time is money. He also indicated that the cloud provider plans to deploy as many of the startup’s chips as necessary to meet overall market demand.

For Cerebras, gaining a footprint within the world’s largest cloud computing platform offers immense visibility. Chief Executive Officer Andrew Feldman emphasized the vast reach of the cloud provider, noting that the customer base ranges from individual independent developers to massive global financial institutions. By embedding their hardware directly into this existing ecosystem, the startup hopes to make accessing its specialized computing power as simple as a single click for users around the world.

Challenging the Market Leader

The partnership arrives as technology companies scramble to build enough infrastructure to support the surging demand for artificial intelligence capabilities. Cerebras, which is currently valued at $23.1 billion, is positioning its technology as a unique alternative to traditional hardware. The company is actively preparing for an initial public offering and seeks to capture a larger share of the enterprise market.

Unlike the flagship processors sold by market leader Nvidia, the startup has engineered a fundamentally different architecture. The company relies on exceptionally large chips that can process massive volumes of data simultaneously, eliminating the need for the expensive high-bandwidth memory that typical graphics processing units require. In addition to this new cloud partnership, the startup recently secured a $10 billion contract to supply hardware to OpenAI, the creator of ChatGPT.

While Amazon remains a major purchaser of Nvidia hardware, it continues to invest heavily in developing its own custom silicon to improve data center efficiency and offer distinct services. By bringing a new, highly capitalized hardware partner into its data centers, the cloud giant is expanding the options available to artificial intelligence developers. The collaboration ultimately gives enterprise customers a new, highly specialized avenue for running complex models at high speeds.

TAGGED: AI inference, Amazon Web Services, Artificial Intelligence, Cerebras Systems, cloud computing, data centers, Nvidia, Trainium3
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Sameer Katoch
As the Founder of VellaTimes and an avid traveler, I'm passionate about the daily news events happening globally. With over five years of experience in the writing field, I am committed to delivering top-notch news that satisfies your daily news intake.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

MacBook Air M5, MacBook Pro M5: prices and specs 2026

March 4, 2026

Meta Superintelligence Push Accelerates With AI Chip Deal

March 6, 2026

Killer Whale Cannibalism: Severed Fins in Russia Spark Scientific Debate

March 8, 2026

Gravitational Constant Mystery Deepens After NIST Study

April 27, 2026

Cosmic Hum May Solve the Hubble Tension Expansion Mystery

March 7, 2026

Massive X1.4 Solar Flare Erupts Before Artemis II Launch

March 31, 2026

Related News

A glowing quantum clock fragmenting into light particles against a dark cosmic background with swirling entangled atoms and spacetime waves, representing quantum physics breakthroughs in time and the universe.
News

Quantum Physics Breakthroughs Reshaping How We Understand Time and the Universe

Nisha Pradhan Nisha Pradhan May 3, 2026
A sleek and modern stage at a corporate technology launch event with glowing digital displays.
News

OpenAI GPT-5.5 Launch Party and the Goblin Problem

Sameer Katoch Sameer Katoch May 3, 2026
A glowing digital medical tablet displaying artificial intelligence graphics in a modern hospital emergency room.
News

AI Outperforms Doctors in Harvard Trial of Emergency Triage Diagnoses

Rakesh Paul Rakesh Paul May 3, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist