By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    An illustration of two glowing neutron stars colliding and emitting a bright burst of light within a massive stream of cosmic gas and dust.
    Neutron Star Collision Found in Tiny Distant Galaxy
    March 14, 2026
    News-style header image showing a modern tech workspace with computer screens representing Tesla, xAI, and the Macrohard software automation project.
    Macrohard: Musk’s Tesla-xAI Project Targets Software
    March 14, 2026
    A brightly lit, modern artificial intelligence data center featuring rows of advanced server racks glowing with blue and green lights.
    Nvidia Nebius Investment: AI Cloud Firm Secures $2 Billion
    March 14, 2026
    Tactical police officers and law enforcement vehicles with flashing lights outside a modern mansion during a dawn raid.
    Uruguayan Drug Kingpin Sebastian Marset Arrested in Bolivia
    March 14, 2026
    A lush field of golden wheat glowing under warm sunrise lighting, representing thriving agricultural crop yields.
    How a Tiny Plant Trick Could Supercharge Crop Yields
    March 14, 2026
  • Technology
    TechnologyShow More
    A brightly lit, modern artificial intelligence data center featuring rows of advanced server racks glowing with blue and green lights.
    Nvidia Nebius Investment: AI Cloud Firm Secures $2 Billion
    March 14, 2026
    A fleet of SpaceX Starlink satellites orbiting Earth in the darkness of space, showcasing the vast satellite constellation in low Earth orbit.
    SpaceX Starlink Satellites Shifting to Lower Orbits in 2026 to Cut Collision Risks
    March 14, 2026
    A computer monitor on a modern office desk displaying an interactive AI-generated chart and architectural diagram in a chat interface.
    Claude AI Charts and Diagrams: Anthropic’s New Chat Update
    March 14, 2026
    Professionals working on laptops in a modern office with glowing digital data charts hovering above their screens, representing artificial intelligence in the workplace.
    How AI in the Workplace is Changing Jobs, Workloads, and Productivity
    March 13, 2026
    A high-tech digital map displaying weather data overlaid with fading vintage newspaper articles, representing Google's AI flood prediction model.
    Google Uses Gemini AI and Old News Reports to Predict Flash Floods
    March 13, 2026
  • AI
    AIShow More
    News-style header image showing a modern tech workspace with computer screens representing Tesla, xAI, and the Macrohard software automation project.
    Macrohard: Musk’s Tesla-xAI Project Targets Software
    March 14, 2026
    A modern smartphone displaying a digital medical dashboard, resting on a desk next to a smartwatch and stethoscope.
    Microsoft Copilot Health: AI Chatbot for Medical Data
    March 14, 2026
    A glowing digital AI node hovering over a brown e-commerce shipping box in a modern data center.
    Amazon Blocks Perplexity AI Shopping Bots in Court
    March 14, 2026
    A sleek Cerebras CS-3 AI server appliance installed inside a brightly lit Amazon Web Services data center.
    AWS Integrates Cerebras AI Chips to Supercharge Cloud Inference
    March 14, 2026
    A sleek smartphone displaying a glowing AI waveform on a glass desk, representing Apple's Siri upgrade powered by Google Gemini.
    Apple AI Strategy 2026: Google Gemini Deal and Siri Overhaul
    March 13, 2026
  • Science
    ScienceShow More
    An illustration of two glowing neutron stars colliding and emitting a bright burst of light within a massive stream of cosmic gas and dust.
    Neutron Star Collision Found in Tiny Distant Galaxy
    March 14, 2026
    A glowing neutron star and a dark black hole swirling around each other in an elliptical, oval-shaped orbit against a backdrop of deep space.
    Black Hole and Neutron Star Merger Reveals First Oval Orbit
    March 14, 2026
    Research-themed header image showing a scientist examining temperature-response data with visual representations of multiple forms of life and a curved heat-performance graphic.
    Universal thermal performance curve governs all life
    March 14, 2026
    High school students using laptops in a modern, well-lit classroom while a teacher supervises.
    AI in Education: How AI Classrooms Shape the Future
    March 13, 2026
    A high-tech optical lattice clock illuminated by blue and purple lasers in a modern physics laboratory.
    Strontium Optical Lattice Clock: 30 Billion Year Accuracy
    March 13, 2026
  • World
    WorldShow More
    Tactical police officers and law enforcement vehicles with flashing lights outside a modern mansion during a dawn raid.
    Uruguayan Drug Kingpin Sebastian Marset Arrested in Bolivia
    March 14, 2026
    A lush field of golden wheat glowing under warm sunrise lighting, representing thriving agricultural crop yields.
    How a Tiny Plant Trick Could Supercharge Crop Yields
    March 14, 2026
    EU and French flags near government buildings with an oil tanker moving through a narrow shipping lane in a professional news-style header image.
    Europe Iran War Response Splits Over US-Israel Strikes
    March 14, 2026
    A line of riot police stands behind metal barricades on a city street as thick black smoke rises in the background.
    Al-Quds Day Rallies: Explosion and Bans Disrupt Events
    March 14, 2026
    An aerial view of numerous commercial oil tankers and cargo ships sitting idle in the Strait of Hormuz at dusk, illustrating the severe maritime shipping blockade.
    Global Trade Stalls Amid Strait of Hormuz Blockade
    March 14, 2026
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: AWS and Cerebras Deal Accelerates Cloud AI Inference
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

AWS and Cerebras Deal Accelerates Cloud AI Inference

Rakesh Paul
Last updated: 14/03/2026
Rakesh Paul
Share
6 Min Read
A highly detailed, glowing Cerebras WSE-3 artificial intelligence chip installed inside a modern Amazon Web Services data center server rack, illuminated by dramatic blue and orange lighting.

Amazon Web Services (AWS) and artificial intelligence startup Cerebras Systems have struck a major partnership to bring ultra-fast AI inference to the cloud. Announced on Friday, March 13, 2026, the agreement integrates Cerebras’s high-performance WSE-3 chips into AWS data centers to accelerate generative AI and large language model workloads.

Contents
A Disaggregated Architecture for AI ProcessingInside the Cerebras CS-3 HardwareExecutive Perspectives on the CollaborationFuture Rollouts and Cloud Expansion

This collaboration aims to solve critical speed bottlenecks in AI inference by deploying Cerebras CS-3 systems alongside AWS Trainium-powered servers on the Amazon Bedrock platform. The integration creates a powerful, highly optimized cloud computing environment designed for developers and enterprises that require real-time responses for demanding applications, such as interactive chatbots and coding assistants.

A Disaggregated Architecture for AI Processing

To achieve these performance gains, AWS and Cerebras are introducing a novel “disaggregated architecture” for AI inference workloads. Inference is the process where previously trained AI models take user requests and generate responses. Traditionally, this process is handled by a single computing system, but the new partnership splits the workload into two distinct phases using different hardware optimized for each specific task.

The first phase, known as the “prefill” stage, involves processing a user’s prompt and converting natural language into tokens that the AI system can understand. Under the new architecture, Amazon’s proprietary Trainium custom AI chips will manage this prefill phase.

The second phase is the “decode” stage, where the AI system actually generates and delivers the desired response to the user. The massive Cerebras chips will be solely responsible for this decoding process. The two systems will be interconnected using AWS’s Elastic Fabric Adapter, a high-speed networking technology that allows the hardware to communicate seamlessly.

Cerebras CEO Andrew Feldman referred to this strategy as a “divide and conquer” approach, separating prompt processing from token generation to maximize efficiency across the computing pipeline.

Inside the Cerebras CS-3 Hardware

The hardware driving the decode phase represents a significant departure from standard AI processors. The Cerebras WSE-3 is an extraordinarily powerful chip featuring 900,000 cores and 44 gigabytes of on-chip SRAM. Unlike the primary chips produced by competitors, the Cerebras architecture does not depend on costly external high-bandwidth memory.

Instead, the chips store all model weights on-chip, delivering exceptional speed. These chips are housed within the Cerebras CS-3 appliance, a water-cooled system roughly the size of a mini-fridge. The CS-3 combines the massive chip with external networking equipment and other vital components necessary for data center integration.

By bringing the CS-3 to AWS data centers, the partnership allows organizations to access this unique hardware without the friction of traditional procurement. Customers can leverage the performance benefits of the WSE-3 chip entirely through the cloud.

Executive Perspectives on the Collaboration

Leaders from both companies emphasize that this deal will democratize access to top-tier AI hardware. Feldman noted that every type of customer, from solo developers to the world’s largest financial institutions, utilizes AWS. He stated that this partnership will streamline access to Cerebras hardware, making the powerful technology available with just a simple click.

David Brown, Vice President of Compute and Machine Learning Services at AWS, highlighted the importance of speed in the current AI landscape. He explained that inference is where artificial intelligence delivers actual value to customers, but processing speed remains a critical bottleneck for demanding, real-time workloads.

However, the companies recognize that a disaggregated approach is not a one-size-fits-all solution. James Wang, Director of Product Marketing at Cerebras, explained that the disaggregated architecture is ideal for large, stable workloads. Because most customers run a mix of tasks with varying prefill and decode ratios, the traditional aggregated computing approach remains ideal for many scenarios. Consequently, the companies expect most customers will want access to both architectures depending on their specific needs.

Future Rollouts and Cloud Expansion

While the financial terms of the agreement between Amazon and the $23.1 billion chip startup were not disclosed, the deployment is expected to roll out rapidly. Later this year, AWS plans to make leading open-source large language models, as well as its proprietary Amazon Nova models, available to run on the Cerebras hardware through the Amazon Bedrock service.

The partnership arrives during a period of massive infrastructure expansion for Amazon. In its fourth-quarter earnings report, Amazon announced plans for $200 billion in capital expenditures for 2026, the vast majority of which is dedicated to expanding AWS capacity. Furthermore, the Cerebras deal coincides with a reported 11-part, $37 billion bond sale by Amazon, which is specifically aimed at funding its ongoing artificial intelligence infrastructure buildout.

Ultimately, this alliance between a dominant cloud provider and an innovative chipmaker aims to set a new standard for efficient and scalable AI inference, driving significant advancements across the tech industry.

TAGGED: AI inference, Amazon Bedrock, AWS, Cerebras Systems, cloud computing, Generative AI, machine learning, WSE-3 chip
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Rakesh Paul
I'm the Co-Founder of VellaTimes and an experienced digital marketer. With substantial experience in the blogging industry, I love crafting insightful and engaging news articles on technology, sports, and automobiles.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

Grok deepfake images: California probes Musk’s xAI

January 19, 2026

Stem cell brain implants for Parkinson’s trial at USC

February 21, 2026

Macron Urges Calm Following Fatal Lyon Activist Beating

February 22, 2026

Cloudflare outage: Bot file bug disrupted internet

January 20, 2026

Argentina’s CGT Challenges Milei Labour Reform in Court

March 3, 2026

Chrome auto browse: Google adds Gemini side panel to Chrome

January 29, 2026

Related News

An illustration of two glowing neutron stars colliding and emitting a bright burst of light within a massive stream of cosmic gas and dust.
News

Neutron Star Collision Found in Tiny Distant Galaxy

Nisha Pradhan Nisha Pradhan March 14, 2026
News-style header image showing a modern tech workspace with computer screens representing Tesla, xAI, and the Macrohard software automation project.
News

Macrohard: Musk’s Tesla-xAI Project Targets Software

Sameer Katoch Sameer Katoch March 14, 2026
A brightly lit, modern artificial intelligence data center featuring rows of advanced server racks glowing with blue and green lights.
News

Nvidia Nebius Investment: AI Cloud Firm Secures $2 Billion

Rakesh Paul Rakesh Paul March 14, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist