By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    Close-up of ancient sedimentary rock layers with a glowing clock dial overlay, resting on a laboratory table alongside geological drill cores.
    New Rock Clock Refines Timeline of Earth’s Early Complex Animal Life
    March 18, 2026
    Wide view of a modern semiconductor fabrication plant with automated wafer equipment and engineers in protective suits on the production floor.
    Semiconductor Capex Risk Grows as India Expands Fabs
    March 18, 2026
    A sleek laptop on a modern office desk displaying an advanced AI interface integrated into a document, representing the new Google Gemini Workspace features.
    Google Gemini Workspace Features: Powerful AI Upgrades
    March 18, 2026
    A dark street in Havana, Cuba, entirely without power during a nationwide electrical grid collapse, illuminated only by faint flashlights and headlights.
    Cuba Blackout: Nationwide Grid Collapses Amid U.S. Blockade
    March 18, 2026
    A digital artificial intelligence network mapped over a flooded city street, representing AI flood forecasting technology.
    Google Transforms AI Flood Forecasting Using 5 Million News Articles
    March 18, 2026
  • Technology
    TechnologyShow More
    Wide view of a modern semiconductor fabrication plant with automated wafer equipment and engineers in protective suits on the production floor.
    Semiconductor Capex Risk Grows as India Expands Fabs
    March 18, 2026
    A glowing smartphone screen showing an artificial intelligence chat interface on a dark desk, representing AI chatbot safety concerns.
    AI Chatbot Safety Concerns Mount Amid Lawsuits and Violence
    March 18, 2026
    A modern corporate glass building at dusk with a blue artificial intelligence hologram glowing above it.
    Meta Shares Jump as Zuckerberg Weighs Major Layoffs to Offset AI Spending
    March 18, 2026
    A professional news-style image showing an iPhone, a thin laptop, and a large desktop display arranged on a clean studio desk.
    Apple 2026 Roadmap Adds iPhone 17e, M5 MacBook Air
    March 17, 2026
    A leather-bound encyclopedia and dictionary resting on a wooden desk in front of a glowing digital screen displaying AI data networks, representing the legal clash between traditional publishers and artificial intelligence.
    Encyclopedia Britannica and Merriam-Webster Sue OpenAI Over AI Training Data
    March 17, 2026
  • AI
    AIShow More
    A sleek laptop on a modern office desk displaying an advanced AI interface integrated into a document, representing the new Google Gemini Workspace features.
    Google Gemini Workspace Features: Powerful AI Upgrades
    March 18, 2026
    A modern corporate boardroom featuring a glowing holographic interface representing enterprise AI agents managing data and workflows.
    Enterprise AI Agents: Microsoft & Nvidia Lead the Race
    March 18, 2026
    A high-tech conference stage featuring a large illuminated screen displaying glowing artificial intelligence and autonomous vehicle graphics.
    Nvidia GTC 2026: AI Revenue and Robotaxi Expansion
    March 18, 2026
    A sleek Nvidia graphics card with green LED lighting on a dark high-tech desk in front of blurred gaming monitors.
    Nvidia DLSS 5: AI-Powered Photorealism for PC Games
    March 17, 2026
    Diverse tech professionals collaborating on artificial intelligence projects in a modern, brightly lit startup accelerator workspace.
    Google and Accel AI Startups Join 2026 Atoms Cohort
    March 17, 2026
  • Science
    ScienceShow More
    Close-up of ancient sedimentary rock layers with a glowing clock dial overlay, resting on a laboratory table alongside geological drill cores.
    New Rock Clock Refines Timeline of Earth’s Early Complex Animal Life
    March 18, 2026
    A digital artificial intelligence network mapped over a flooded city street, representing AI flood forecasting technology.
    Google Transforms AI Flood Forecasting Using 5 Million News Articles
    March 18, 2026
    A bright fireball meteor soaring over a suburban neighborhood during the day, leaving a glowing, fiery trail in the clear blue sky above residential rooftops.
    Ohio Meteor Boom: Daylight Fireball Triggers Massive Shock Wave
    March 18, 2026
    A microscopic 3D rendering of glowing intelectin-2 proteins reinforcing a mucus barrier and neutralizing harmful bacteria in the human gut.
    MIT Scientists Discover Gut Protein That Kills Bacteria
    March 17, 2026
    A glowing microscopic antibody illuminating a cluster of tumor cells in a dark medical laboratory environment.
    Scientists Unveil Cancer Flashlight for Tumor Detection
    March 17, 2026
  • World
    WorldShow More
    A dark street in Havana, Cuba, entirely without power during a nationwide electrical grid collapse, illuminated only by faint flashlights and headlights.
    Cuba Blackout: Nationwide Grid Collapses Amid U.S. Blockade
    March 18, 2026
    Nighttime rescue operations underway at the destroyed Omid Addiction Treatment Hospital in Kabul following a devastating airstrike, with first responders searching the rubble using flashlights.
    Pakistan Airstrike on Kabul Hospital Leaves Hundreds Dead Amid Escalating Tensions
    March 18, 2026
    A large commercial oil tanker anchored near an illuminated coastal energy hub at dusk.
    Strait of Hormuz Crisis: Oil Spikes & US Diesel Tops $5
    March 18, 2026
    Rugged, dusty mountain terrain in Somalia under dawn lighting, representing the remote locations of recent military operations.
    U.S. Airstrikes in Somalia Double Amid Major Offensives Against ISIS and Al-Shabaab
    March 17, 2026
    A Ugandan political opposition leader in a suit and red beret speaks passionately into a microphone in a dimly lit, undisclosed room.
    Ugandan Opposition Leader Bobi Wine Flees Into Exile Following Disputed Election
    March 17, 2026
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: AWS and Cerebras Partner to Deliver Faster AI Inference with Giant Chips
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

AWS and Cerebras Partner to Deliver Faster AI Inference with Giant Chips

Rakesh Paul
Last updated: 16/03/2026
Rakesh Paul
Share
6 Min Read
A glowing giant computer chip displayed on a server rack inside a modern, brightly lit cloud data center.

Amazon Web Services (AWS) is officially partnering with hardware startup Cerebras Systems to combine Amazon’s custom Trainium processors with Cerebras’ giant chips. This high-profile collaboration aims to significantly accelerate artificial intelligence (AI) inference workloads for global cloud computing customers. The joint effort also seeks to challenge Nvidia’s current dominance in the AI infrastructure and hardware market.

The new integrated hardware service will be directly deployed via Amazon Bedrock inside AWS data centers. There are conflicting reports regarding the exact launch timeline for the new hardware integration. According to Bloomberg, the new cloud computing service is expected to roll out in the second half of 2026. In contrast, official press statements from AWS and Cerebras indicate that the integration will officially launch in the next couple of months. While the exact financial terms of the agreement were not disclosed to the public, AWS Vice President Nafea Bshara noted that the two companies have been working toward this partnership for several years. Bshara also indicated that AWS intends to install as many Cerebras chips as market demand dictates.

Tackling the Speed Bottleneck

According to AWS, inference is the specific phase where AI delivers tangible value to end users. However, processing speed remains a critical bottleneck for highly demanding workloads, such as real-time coding assistance and interactive AI applications. As reasoning models begin to represent the majority of AI inference, these systems must compute and generate significantly more tokens per request as they “think” through complex problems. This shift has drastically increased the industry-wide need to accelerate the AI workflow.

Currently, prominent AI companies like OpenAI, Cognition, and Mistral utilize Cerebras hardware to accelerate their most demanding computing workloads. Cerebras has demonstrated that it can power models from OpenAI, Cognition, and Meta at speeds of up to 3,000 tokens per second. This speed is particularly crucial for tasks like agentic coding, where a software developer’s productivity is directly constrained by AI inference speeds.

The Disaggregated Inference Strategy

To achieve industry-leading processing speeds, the partner companies are deploying an innovative hardware strategy called disaggregated inference. Instead of relying on a single type of graphics processing unit (GPU) for the entire AI pipeline, the workload is strategically split into two specialized computing stages. These two distinct hardware systems are seamlessly connected within the AWS cloud infrastructure using Amazon’s high-bandwidth, low-latency Elastic Fabric Adapter (EFA) networking stack.

The first stage of the inference process is called “prefill,” which involves interpreting user prompts and converting them into tokens that AI systems can process. Amazon’s custom Trainium 3 chips, which feature dense compute cores designed for scalable performance, will exclusively handle this highly compute-intensive phase.

The second stage, known as “decode,” is a highly memory-intensive process where the AI model generates its final response token by token. Cerebras’ CS-3 system, also referred to as the Wafer Scale Engine, will exclusively manage this decode stage. The giant CS-3 chip is uniquely designed to store all AI model weights directly on-chip in static random-access memory (SRAM). This architectural design gives the CS-3 thousands of times more memory bandwidth than the fastest traditional GPUs available on the market.

Industry Impact and Future Outlook

David Brown, Vice President of Compute and Machine Learning Services at AWS, stated that separating the inference workload allows each piece of hardware to focus entirely on what it does best. He noted that this dual-chip approach will deliver inference speeds an order of magnitude faster and offer significantly higher performance than currently available options. Cerebras CEO Andrew Feldman described the hybrid architecture as a “divide and conquer” strategy that will bring the fastest possible inference to a global enterprise customer base.

This specialized hybrid hardware model is designed for strict cost efficiency. It aims to deliver five times more high-speed token capacity within the exact same physical hardware footprint. Later this year, AWS plans to begin offering leading open-source large language models (LLMs) and its proprietary Amazon Nova models running specifically on the new Cerebras hardware.

For Cerebras, a startup currently preparing for an initial public offering, securing AWS as a client marks a major corporate milestone. AWS is the first major hyperscaler data center operator to commit to utilizing Cerebras technology. While Amazon remains a significant customer of market leader Nvidia, the cloud provider continues to expand its own proprietary silicon roadmap. Because inference workloads are becoming massively large, cloud providers are increasingly experimenting with heterogeneous hardware architectures to bypass Nvidia’s firmly established CUDA software ecosystem and mature tooling.

TAGGED: AI inference, Amazon Bedrock, AWS, Cerebras Systems, cloud computing, Generative AI, machine learning, Trainium 3
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Rakesh Paul
I'm the Co-Founder of VellaTimes and an experienced digital marketer. With substantial experience in the blogging industry, I love crafting insightful and engaging news articles on technology, sports, and automobiles.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

Bill Clinton Epstein Testimony: House Deposition Details

February 28, 2026

AI bioterrorism warning: Bill Gates calls for 2026 action

January 10, 2026

Asia Energy Crisis: Iran War and Middle East Conflict Trigger Severe Fuel Crunch

March 12, 2026

Male Birth Control Breakthrough: Scientists Identify Sperm’s Energy Switch

February 16, 2026

Horoscope Today, 01 December 2023: Find Out Your Daily Astrological Prediction Now

December 1, 2023

US-Iran War Escalates: Trump Faces Warnings Over Epic Fury

March 8, 2026

Related News

Close-up of ancient sedimentary rock layers with a glowing clock dial overlay, resting on a laboratory table alongside geological drill cores.
News

New Rock Clock Refines Timeline of Earth’s Early Complex Animal Life

Nisha Pradhan Nisha Pradhan March 18, 2026
Wide view of a modern semiconductor fabrication plant with automated wafer equipment and engineers in protective suits on the production floor.
News

Semiconductor Capex Risk Grows as India Expands Fabs

Rakesh Paul Rakesh Paul March 18, 2026
A sleek laptop on a modern office desk displaying an advanced AI interface integrated into a document, representing the new Google Gemini Workspace features.
News

Google Gemini Workspace Features: Powerful AI Upgrades

Sameer Katoch Sameer Katoch March 18, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist