By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    A glowing antimatter atom passing through a hexagonal graphene sheet and splitting into a quantum wave interference pattern in a high-tech laboratory setting.
    Scientists Observe Positronium Wave Behavior in Lab
    May 1, 2026
    Hyper-realistic news-style image of a modern AI data center with server racks and a digital display labeled DeepSeek V4, shown in cool blue lighting.
    DeepSeek V4 launch puts Huawei AI chips in spotlight
    May 1, 2026
    A modern smartphone displaying an app storefront positioned next to a wooden judge's gavel on a desk, representing the legal battle over digital marketplace policies.
    Apple Loses Bid to Pause App Store Fee Changes
    May 1, 2026
    The NASA Curiosity rover is using its robotic arm to drill into a red sandstone rock on the dusty surface of Mars.
    Mars Organic Molecules: Curiosity Rover Makes Historic Find
    May 1, 2026
    News-style image of Elon Musk seated in a courtroom during a legal dispute involving OpenAI.
    Elon Musk OpenAI Trial Puts Nonprofit Mission on Trial
    May 1, 2026
  • Technology
    TechnologyShow More
    A modern smartphone displaying an app storefront positioned next to a wooden judge's gavel on a desk, representing the legal battle over digital marketplace policies.
    Apple Loses Bid to Pause App Store Fee Changes
    May 1, 2026
    A business professional using an AI assistant on a laptop in a modern office with a data center visible in the background.
    Microsoft Copilot Tops 20 Million Paid Enterprise Seats
    May 1, 2026
    A brightly lit modern semiconductor cleanroom featuring advanced silicon wafers and glowing blue server racks.
    Samsung Q1 Profit Surges Eightfold as AI Boom Fuels Record Chip Earnings
    April 30, 2026
    A person holding a smartphone displaying the Amazon Shopping app's AI audio chat interface in a modern living room.
    Amazon AI Audio Shopping Chat Enhanced With Real-Time Q&A
    April 29, 2026
    Hyper-realistic news-style image of a modern cloud operations room with Amazon Bedrock and OpenAI-themed screens and a coding dashboard in view.
    OpenAI Models and Codex Debut on Amazon Bedrock in Preview
    April 29, 2026
  • AI
    AIShow More
    Hyper-realistic news-style image of a modern AI data center with server racks and a digital display labeled DeepSeek V4, shown in cool blue lighting.
    DeepSeek V4 launch puts Huawei AI chips in spotlight
    May 1, 2026
    News-style image of Elon Musk seated in a courtroom during a legal dispute involving OpenAI.
    Elon Musk OpenAI Trial Puts Nonprofit Mission on Trial
    May 1, 2026
    News-style image showing LG Electronics and Nvidia branding in a modern tech setting with AI server racks and a service robot.
    Nvidia-LG Talks Highlight Wider AI Expansion Strategy
    April 30, 2026
    A dramatic courtroom setting featuring an abstract artificial intelligence hologram on a wooden table, representing the high-stakes tech trial.
    Elon Musk vs Sam Altman OpenAI Trial Over AI Future
    April 29, 2026
    A glowing artificial intelligence processor chip with V4 etched on the surface, mounted on an illuminated circuit board.
    DeepSeek V4 Launches on Huawei Chips for AI Innovation
    April 29, 2026
  • Science
    ScienceShow More
    A glowing antimatter atom passing through a hexagonal graphene sheet and splitting into a quantum wave interference pattern in a high-tech laboratory setting.
    Scientists Observe Positronium Wave Behavior in Lab
    May 1, 2026
    The NASA Curiosity rover is using its robotic arm to drill into a red sandstone rock on the dusty surface of Mars.
    Mars Organic Molecules: Curiosity Rover Makes Historic Find
    May 1, 2026
    Aerial view of the Pacific Ocean off a forested coastline with a glowing geological fault line beneath the water representing the Cascadia subduction zone.
    Earth Tearing Apart Under the Cascadia Subduction Zone
    May 1, 2026
    A young adult female patient and a doctor are looking at medical charts in a modern clinical office setting.
    Rising Cancer Rates in Young Adults: Is Obesity to Blame?
    April 29, 2026
    Microscopic view of engineered immune cells glowing in a modern, high-tech medical laboratory setting.
    CAR-T Cell Therapy Eradicates Severe Autoimmune Diseases
    April 29, 2026
  • World
    WorldShow More
    Allu Arjun Commitment to Ethical Brand Partnerships
    Exploring Allu Arjun’s Commitment to Ethical Brand Partnerships
    December 18, 2023
    Orry aka Orhan Awatramani
    Orhan Awatramani ‘Orry’ Biography, Lifestyle and Rise to Fame
    December 8, 2023
    Alia Bhatt Latest Deepake Video Victim
    Alia Bhatt becomes latest victim of Deepfake Videos, Obscene Video goes Viral
    November 28, 2023
    Napoleon Movie Review
    Napoleon Movie Review: A Historical Epic by Ridley Scott Reviewed
    November 25, 2023
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: AWS and Cerebras Deal Accelerates Cloud AI Inference
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

AWS and Cerebras Deal Accelerates Cloud AI Inference

Rakesh Paul
Last updated: 14/03/2026
Rakesh Paul
Share
6 Min Read
A highly detailed, glowing Cerebras WSE-3 artificial intelligence chip installed inside a modern Amazon Web Services data center server rack, illuminated by dramatic blue and orange lighting.

Amazon Web Services (AWS) and artificial intelligence startup Cerebras Systems have struck a major partnership to bring ultra-fast AI inference to the cloud. Announced on Friday, March 13, 2026, the agreement integrates Cerebras’s high-performance WSE-3 chips into AWS data centers to accelerate generative AI and large language model workloads.

Contents
A Disaggregated Architecture for AI ProcessingInside the Cerebras CS-3 HardwareExecutive Perspectives on the CollaborationFuture Rollouts and Cloud Expansion

This collaboration aims to solve critical speed bottlenecks in AI inference by deploying Cerebras CS-3 systems alongside AWS Trainium-powered servers on the Amazon Bedrock platform. The integration creates a powerful, highly optimized cloud computing environment designed for developers and enterprises that require real-time responses for demanding applications, such as interactive chatbots and coding assistants.

A Disaggregated Architecture for AI Processing

To achieve these performance gains, AWS and Cerebras are introducing a novel “disaggregated architecture” for AI inference workloads. Inference is the process where previously trained AI models take user requests and generate responses. Traditionally, this process is handled by a single computing system, but the new partnership splits the workload into two distinct phases using different hardware optimized for each specific task.

The first phase, known as the “prefill” stage, involves processing a user’s prompt and converting natural language into tokens that the AI system can understand. Under the new architecture, Amazon’s proprietary Trainium custom AI chips will manage this prefill phase.

The second phase is the “decode” stage, where the AI system actually generates and delivers the desired response to the user. The massive Cerebras chips will be solely responsible for this decoding process. The two systems will be interconnected using AWS’s Elastic Fabric Adapter, a high-speed networking technology that allows the hardware to communicate seamlessly.

Cerebras CEO Andrew Feldman referred to this strategy as a “divide and conquer” approach, separating prompt processing from token generation to maximize efficiency across the computing pipeline.

Inside the Cerebras CS-3 Hardware

The hardware driving the decode phase represents a significant departure from standard AI processors. The Cerebras WSE-3 is an extraordinarily powerful chip featuring 900,000 cores and 44 gigabytes of on-chip SRAM. Unlike the primary chips produced by competitors, the Cerebras architecture does not depend on costly external high-bandwidth memory.

Instead, the chips store all model weights on-chip, delivering exceptional speed. These chips are housed within the Cerebras CS-3 appliance, a water-cooled system roughly the size of a mini-fridge. The CS-3 combines the massive chip with external networking equipment and other vital components necessary for data center integration.

By bringing the CS-3 to AWS data centers, the partnership allows organizations to access this unique hardware without the friction of traditional procurement. Customers can leverage the performance benefits of the WSE-3 chip entirely through the cloud.

Executive Perspectives on the Collaboration

Leaders from both companies emphasize that this deal will democratize access to top-tier AI hardware. Feldman noted that every type of customer, from solo developers to the world’s largest financial institutions, utilizes AWS. He stated that this partnership will streamline access to Cerebras hardware, making the powerful technology available with just a simple click.

David Brown, Vice President of Compute and Machine Learning Services at AWS, highlighted the importance of speed in the current AI landscape. He explained that inference is where artificial intelligence delivers actual value to customers, but processing speed remains a critical bottleneck for demanding, real-time workloads.

However, the companies recognize that a disaggregated approach is not a one-size-fits-all solution. James Wang, Director of Product Marketing at Cerebras, explained that the disaggregated architecture is ideal for large, stable workloads. Because most customers run a mix of tasks with varying prefill and decode ratios, the traditional aggregated computing approach remains ideal for many scenarios. Consequently, the companies expect most customers will want access to both architectures depending on their specific needs.

Future Rollouts and Cloud Expansion

While the financial terms of the agreement between Amazon and the $23.1 billion chip startup were not disclosed, the deployment is expected to roll out rapidly. Later this year, AWS plans to make leading open-source large language models, as well as its proprietary Amazon Nova models, available to run on the Cerebras hardware through the Amazon Bedrock service.

The partnership arrives during a period of massive infrastructure expansion for Amazon. In its fourth-quarter earnings report, Amazon announced plans for $200 billion in capital expenditures for 2026, the vast majority of which is dedicated to expanding AWS capacity. Furthermore, the Cerebras deal coincides with a reported 11-part, $37 billion bond sale by Amazon, which is specifically aimed at funding its ongoing artificial intelligence infrastructure buildout.

Ultimately, this alliance between a dominant cloud provider and an innovative chipmaker aims to set a new standard for efficient and scalable AI inference, driving significant advancements across the tech industry.

TAGGED: AI inference, Amazon Bedrock, AWS, Cerebras Systems, cloud computing, Generative AI, machine learning, WSE-3 chip
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Rakesh Paul
I'm the Co-Founder of VellaTimes and an experienced digital marketer. With substantial experience in the blogging industry, I love crafting insightful and engaging news articles on technology, sports, and automobiles.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

Human Evolution Accelerated After Farming, Massive Ancient DNA Study Finds

April 20, 2026

Amazon smartphone comeback takes shape around Alexa

March 24, 2026

NVIDIA’s $4 Billion Bet on Photonics for AI Data Centers

March 5, 2026

Amazon AI Audio Shopping Chat Enhanced With Real-Time Q&A

April 29, 2026

Chrome Auto Browse arrives with Gemini AI in Google Chrome

January 30, 2026

UK Physics Funding Cuts Spark Alarm Across Research Sector

March 12, 2026

Related News

A glowing antimatter atom passing through a hexagonal graphene sheet and splitting into a quantum wave interference pattern in a high-tech laboratory setting.
News

Scientists Observe Positronium Wave Behavior in Lab

Nisha Pradhan Nisha Pradhan May 1, 2026
Hyper-realistic news-style image of a modern AI data center with server racks and a digital display labeled DeepSeek V4, shown in cool blue lighting.
News

DeepSeek V4 launch puts Huawei AI chips in spotlight

Sameer Katoch Sameer Katoch May 1, 2026
A modern smartphone displaying an app storefront positioned next to a wooden judge's gavel on a desk, representing the legal battle over digital marketplace policies.
News

Apple Loses Bid to Pause App Store Fee Changes

Rakesh Paul Rakesh Paul May 1, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist