By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    Close-up of a silver espresso machine extracting a fresh shot of coffee into a glass cup in a softly lit cafe setting.
    Espresso Extraction Science: The Finer Grind Flaw
    May 18, 2026
    A smartphone resting on a wooden desk displaying an AI-powered Amazon search bar in a modern home office setting.
    Amazon Alexa for Shopping Replaces Rufus AI Assistant
    May 18, 2026
    Wide news-style image showing an OpenAI office scene with screens displaying audio waveforms and voice technology graphics
    OpenAI acquires Weights.gg to boost voice AI tools
    May 18, 2026
    Federal agents standing outside a modern university biology laboratory building at dusk during an active investigation.
    US Arrests Chinese Scientists for Smuggling Biological Materials
    May 18, 2026
    A dramatically lit modern corporate courtroom with futuristic technology elements, representing a high-stakes artificial intelligence legal trial.
    Elon Musk OpenAI Lawsuit Exposes Clashes Over AI Safety
    May 18, 2026
  • Technology
    TechnologyShow More
    Wide news-style image showing an OpenAI office scene with screens displaying audio waveforms and voice technology graphics
    OpenAI acquires Weights.gg to boost voice AI tools
    May 18, 2026
    A polished silicon wafer rests on a surface inside a modern semiconductor manufacturing facility.
    Samsung Strike Threatens Global AI Chip Production
    May 18, 2026
    A glowing computer screen displaying the text GPT-5.5 Instant in a modern, high-tech office environment with soft blue and purple lighting.
    GPT-5.5 Instant: OpenAI’s New Default ChatGPT Model
    May 10, 2026
    Wide view of a modern AI data center with server racks, glowing fiber-optic cables, and semiconductor hardware in the foreground.
    AI Infrastructure Spending Drives Nvidia, AMD Shares
    May 10, 2026
    A glowing computer monitor displaying lines of code and digital network graphics in a modern tech office setting.
    Airbnb AI Coding: 60% of New Software Now Generated by AI
    May 9, 2026
  • AI
    AIShow More
    A smartphone resting on a wooden desk displaying an AI-powered Amazon search bar in a modern home office setting.
    Amazon Alexa for Shopping Replaces Rufus AI Assistant
    May 18, 2026
    A dramatically lit modern corporate courtroom with futuristic technology elements, representing a high-stakes artificial intelligence legal trial.
    Elon Musk OpenAI Lawsuit Exposes Clashes Over AI Safety
    May 18, 2026
    A high-tech global map visualization showing glowing digital connections across different continents, representing the worldwide adoption of artificial intelligence.
    Global AI Adoption in 2026: Trends and Growing Divide
    May 10, 2026
    A modern smartphone displaying an artificial intelligence chat interface used for online shopping and product comparison.
    Alibaba Qwen AI Taobao Integration Launches Agentic Shopping
    May 10, 2026
    A split-screen illustration showing a high-tech modern office using advanced AI tools contrasted against an older, dimly lit workspace.
    Global AI Adoption Surges But Rich-Poor Divide Widens
    May 9, 2026
  • Science
    ScienceShow More
    Close-up of a silver espresso machine extracting a fresh shot of coffee into a glass cup in a softly lit cafe setting.
    Espresso Extraction Science: The Finer Grind Flaw
    May 18, 2026
    Federal agents standing outside a modern university biology laboratory building at dusk during an active investigation.
    US Arrests Chinese Scientists for Smuggling Biological Materials
    May 18, 2026
    Header image of a quantum communication lab setup with fiber-optic equipment, a telecom quantum dot device, and interferometer components used for long-distance quantum key distribution.
    Quantum Key Distribution Reaches 120 km With Quantum Dots
    May 10, 2026
    Abstract geometric representation of glowing quantum paraparticles interacting within a three-dimensional mathematical grid in deep blue and gold tones.
    Quantum Paraparticles Exist: New Math Challenges Physics
    May 10, 2026
    A large expedition cruise ship is navigating rough ocean waters under a cloudy sky.
    Global Authorities Respond to Andes Hantavirus Outbreak on MV Hondius Cruise Ship
    May 9, 2026
  • World
    WorldShow More
    Allu Arjun Commitment to Ethical Brand Partnerships
    Exploring Allu Arjun’s Commitment to Ethical Brand Partnerships
    December 18, 2023
    Orry aka Orhan Awatramani
    Orhan Awatramani ‘Orry’ Biography, Lifestyle and Rise to Fame
    December 8, 2023
    Alia Bhatt Latest Deepake Video Victim
    Alia Bhatt becomes latest victim of Deepfake Videos, Obscene Video goes Viral
    November 28, 2023
    Napoleon Movie Review
    Napoleon Movie Review: A Historical Epic by Ridley Scott Reviewed
    November 25, 2023
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: AWS Integrates Cerebras AI Chips to Supercharge Cloud Inference
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

AWS Integrates Cerebras AI Chips to Supercharge Cloud Inference

Sameer Katoch
Last updated: 14/03/2026
Sameer Katoch
Share
6 Min Read
A sleek Cerebras CS-3 AI server appliance installed inside a brightly lit Amazon Web Services data center.

Amazon Web Services (AWS) has partnered with artificial intelligence chipmaker Cerebras Systems to deploy the world’s largest AI processor in its cloud data centers . The multi-year agreement, announced on Friday, will make Cerebras’ Wafer-Scale Engine 3 (WSE-3) chips available to developers via the Amazon Bedrock managed service in the coming months . By combining Cerebras hardware with Amazon’s custom Trainium processors, the companies expect to increase the speed at which AI models generate output by a factor of five .

The collaboration introduces a disaggregated architecture designed to tackle the distinct computational challenges of AI inference . Inference, the stage where trained models generate responses to user prompts, is divided into two main phases known as prefill and decode .

During the prefill stage, a user’s prompt is broken down into smaller data tokens, which is a computationally intensive and naturally parallel process . The decode phase follows, generating the model’s response sequentially, one token at a time . Decoding is less demanding on raw computation but requires massive memory bandwidth to constantly move data between logic circuits and memory .

Traditionally, a single chip handles both phases of this process . However, the AWS and Cerebras partnership splits the workload between specialized hardware . Amazon’s proprietary Trainium chips will handle the prefill stage, while the Cerebras WSE-3 processors will take over the decode phase . The two systems will be linked using Amazon’s Elastic Fabric Adapter (EFA), a custom network device that bypasses the host server’s operating system to accelerate connections and prevent network congestion .

David Brown, Vice President of Compute and Machine Learning Services at AWS, highlighted that speed remains a critical bottleneck for demanding workloads like real-time coding assistance and interactive applications . By separating the workload across Trainium and Cerebras systems, each chip can perform the specific tasks it handles best . This approach is expected to deliver inference speeds an order of magnitude faster than current cloud offerings .

Cerebras Systems Founder and Chief Executive Officer Andrew Feldman stated that the disaggregated inference solution will bring blisteringly fast AI performance to a global customer base within their existing AWS environments .

The Massive Scale of the WSE-3 Processor

Cerebras has gained industry attention for its unconventional approach to semiconductor manufacturing . While traditional methods involve cutting a silicon wafer into numerous smaller chips, Cerebras uses an entire wafer to build a single massive processor .

The WSE-3 chip features approximately four trillion transistors and 900,000 AI-optimized cores . It also includes 44 gigabytes of on-chip memory . Cerebras packages this processor within a water-cooled system known as the CS-3, an appliance roughly the size of a mini-fridge that houses the WSE-3 alongside external memory and networking equipment .

This massive scale provides the WSE-3 with 27 petabytes per second of internal memory bandwidth . According to the company, this bandwidth is more than 200 times greater than what is offered by Nvidia’s NVLink interconnect technology . The immense data movement capabilities make the WSE-3 highly optimized for the demanding memory requirements of the decode phase in AI inference .

Through Amazon Bedrock, customers will be able to utilize this hardware without managing the physical infrastructure directly . The service will support popular open-source large language models as well as Amazon’s proprietary generative AI systems, including the Nova model family .

Rising Competition in AI Hardware

The AWS and Cerebras partnership underscores the intensifying battle for dominance in the AI hardware market . Currently, Nvidia and its graphics processing unit (GPU) accelerators hold a commanding market share . The explosive adoption of generative AI has led to surging demand for these chips, prompting major cloud providers to seek alternative architectures and develop custom silicon .

Google relies on its proprietary Tensor Processing Units (TPUs) to power AI models across its ecosystem . Microsoft recently introduced its Maia AI accelerator and Cobalt central processing units . Similarly, Meta Platforms has deployed its custom Meta Training and Inference Accelerator (MTIA) chips for workloads on Facebook and Instagram .

For Cerebras, the AWS collaboration follows significant business momentum . The startup recently secured a computing infrastructure deal with OpenAI, agreeing to supply 750 megawatts of computing capacity through 2028 . This agreement, reportedly worth over $10 billion, arrived between two funding rounds that raised more than $2 billion for Cerebras . The company is reportedly preparing for an initial public offering as soon as the second quarter, and these high-profile cloud partnerships could bolster investor confidence ahead of the listing .

TAGGED: AI chips, Amazon Bedrock, AWS, Cerebras Systems, cloud computing, Generative AI, machine learning, WSE-3
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Sameer Katoch
As the Founder of VellaTimes and an avid traveler, I'm passionate about the daily news events happening globally. With over five years of experience in the writing field, I am committed to delivering top-notch news that satisfies your daily news intake.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

K2-18b Biosignature Claim Faces Scientific Scrutiny

April 20, 2026

Tesla CarPlay delay tied to iOS 26 Maps issue: Report

February 17, 2026

HBM4 chips: Samsung to start production for Nvidia

January 26, 2026

Experience the Thrill: Top 10 Paragliding Spots in India

December 1, 2023

Critical BeyondTrust RCE Flaw Under Active Attack: What You Need to Know

February 18, 2026

Alibaba Launches Qwen3.5-Omni Multimodal AI to Rival Gemini

March 31, 2026

Related News

Close-up of a silver espresso machine extracting a fresh shot of coffee into a glass cup in a softly lit cafe setting.
News

Espresso Extraction Science: The Finer Grind Flaw

Nisha Pradhan Nisha Pradhan May 18, 2026
A smartphone resting on a wooden desk displaying an AI-powered Amazon search bar in a modern home office setting.
News

Amazon Alexa for Shopping Replaces Rufus AI Assistant

Sameer Katoch Sameer Katoch May 18, 2026
Wide news-style image showing an OpenAI office scene with screens displaying audio waveforms and voice technology graphics
News

OpenAI acquires Weights.gg to boost voice AI tools

Rakesh Paul Rakesh Paul May 18, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist