By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    Close-up of a silver espresso machine extracting a fresh shot of coffee into a glass cup in a softly lit cafe setting.
    Espresso Extraction Science: The Finer Grind Flaw
    May 18, 2026
    A smartphone resting on a wooden desk displaying an AI-powered Amazon search bar in a modern home office setting.
    Amazon Alexa for Shopping Replaces Rufus AI Assistant
    May 18, 2026
    Wide news-style image showing an OpenAI office scene with screens displaying audio waveforms and voice technology graphics
    OpenAI acquires Weights.gg to boost voice AI tools
    May 18, 2026
    Federal agents standing outside a modern university biology laboratory building at dusk during an active investigation.
    US Arrests Chinese Scientists for Smuggling Biological Materials
    May 18, 2026
    A dramatically lit modern corporate courtroom with futuristic technology elements, representing a high-stakes artificial intelligence legal trial.
    Elon Musk OpenAI Lawsuit Exposes Clashes Over AI Safety
    May 18, 2026
  • Technology
    TechnologyShow More
    Wide news-style image showing an OpenAI office scene with screens displaying audio waveforms and voice technology graphics
    OpenAI acquires Weights.gg to boost voice AI tools
    May 18, 2026
    A polished silicon wafer rests on a surface inside a modern semiconductor manufacturing facility.
    Samsung Strike Threatens Global AI Chip Production
    May 18, 2026
    A glowing computer screen displaying the text GPT-5.5 Instant in a modern, high-tech office environment with soft blue and purple lighting.
    GPT-5.5 Instant: OpenAI’s New Default ChatGPT Model
    May 10, 2026
    Wide view of a modern AI data center with server racks, glowing fiber-optic cables, and semiconductor hardware in the foreground.
    AI Infrastructure Spending Drives Nvidia, AMD Shares
    May 10, 2026
    A glowing computer monitor displaying lines of code and digital network graphics in a modern tech office setting.
    Airbnb AI Coding: 60% of New Software Now Generated by AI
    May 9, 2026
  • AI
    AIShow More
    A smartphone resting on a wooden desk displaying an AI-powered Amazon search bar in a modern home office setting.
    Amazon Alexa for Shopping Replaces Rufus AI Assistant
    May 18, 2026
    A dramatically lit modern corporate courtroom with futuristic technology elements, representing a high-stakes artificial intelligence legal trial.
    Elon Musk OpenAI Lawsuit Exposes Clashes Over AI Safety
    May 18, 2026
    A high-tech global map visualization showing glowing digital connections across different continents, representing the worldwide adoption of artificial intelligence.
    Global AI Adoption in 2026: Trends and Growing Divide
    May 10, 2026
    A modern smartphone displaying an artificial intelligence chat interface used for online shopping and product comparison.
    Alibaba Qwen AI Taobao Integration Launches Agentic Shopping
    May 10, 2026
    A split-screen illustration showing a high-tech modern office using advanced AI tools contrasted against an older, dimly lit workspace.
    Global AI Adoption Surges But Rich-Poor Divide Widens
    May 9, 2026
  • Science
    ScienceShow More
    Close-up of a silver espresso machine extracting a fresh shot of coffee into a glass cup in a softly lit cafe setting.
    Espresso Extraction Science: The Finer Grind Flaw
    May 18, 2026
    Federal agents standing outside a modern university biology laboratory building at dusk during an active investigation.
    US Arrests Chinese Scientists for Smuggling Biological Materials
    May 18, 2026
    Header image of a quantum communication lab setup with fiber-optic equipment, a telecom quantum dot device, and interferometer components used for long-distance quantum key distribution.
    Quantum Key Distribution Reaches 120 km With Quantum Dots
    May 10, 2026
    Abstract geometric representation of glowing quantum paraparticles interacting within a three-dimensional mathematical grid in deep blue and gold tones.
    Quantum Paraparticles Exist: New Math Challenges Physics
    May 10, 2026
    A large expedition cruise ship is navigating rough ocean waters under a cloudy sky.
    Global Authorities Respond to Andes Hantavirus Outbreak on MV Hondius Cruise Ship
    May 9, 2026
  • World
    WorldShow More
    Allu Arjun Commitment to Ethical Brand Partnerships
    Exploring Allu Arjun’s Commitment to Ethical Brand Partnerships
    December 18, 2023
    Orry aka Orhan Awatramani
    Orhan Awatramani ‘Orry’ Biography, Lifestyle and Rise to Fame
    December 8, 2023
    Alia Bhatt Latest Deepake Video Victim
    Alia Bhatt becomes latest victim of Deepfake Videos, Obscene Video goes Viral
    November 28, 2023
    Napoleon Movie Review
    Napoleon Movie Review: A Historical Epic by Ridley Scott Reviewed
    November 25, 2023
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: Google Launches Gemini 3.1 Flash Live AI Audio Model
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

Google Launches Gemini 3.1 Flash Live AI Audio Model

Sameer Katoch
Last updated: 27/03/2026
Sameer Katoch
Share
7 Min Read
A sleek smartphone and a glowing modern microphone on a desk with digital sound waves floating above, representing advanced AI audio technology and real-time voice processing.

Google and Cohere have officially introduced new artificial intelligence models optimized for audio and voice processing . Google released Gemini 3.1 Flash Live, an advanced audio-to-audio model designed for real-time dialogue and voice-first applications . Simultaneously, Cohere launched Cohere Transcribe, an AI algorithm built exclusively for highly accurate speech transcription . Both releases offer significant improvements in output quality, latency, and task execution over previous generations .

Contents
Enhancing Real-Time Dialogue and Multimodal FeaturesSetting New Benchmarks in Audio PerformanceGlobal Expansion and Security FeaturesCohere Transcribe Targets Speech Accuracy

Gemini 3.1 Flash Live is now rolling out across multiple Google platforms . Developers can access the preview version through the Gemini Live API in Google AI Studio . Enterprise clients can utilize the technology via Gemini Enterprise for Customer Experience to automate and manage customer service interactions . For everyday consumers, the model is available right now through Gemini Live and Search Live, bringing faster and more fluid voice interactions to mobile devices and Chromebooks .

Enhancing Real-Time Dialogue and Multimodal Features

Google built Gemini 3.1 Flash Live to handle the natural rhythm and speed of human speech . The model directly addresses common issues in voice AI, such as stuttering, hesitation, or user interruptions . It delivers much faster responses and can follow a conversation thread for twice as long as the previous model, keeping a user’s train of thought intact during lengthy brainstorming sessions .

The updated AI is significantly better at recognizing acoustic nuances, such as pitch and pace, compared to the 2.5 Flash Native Audio model . It can detect when a speaker is getting confused or frustrated and will dynamically adjust its tone and responses to match the situation . This makes it highly effective for enterprise customer support, where a voice agent could automatically process tasks like product return requests .

Furthermore, Gemini 3.1 Flash Live supports multimodal inputs . Users can combine speech with images to solve problems . For example, a customer dealing with a malfunctioning smart home appliance can upload a photo of the device and use voice commands to troubleshoot the issue . The model also features tool use capabilities, allowing it to retrieve relevant data from external sources, such as product documentation repositories, to assist users .

Setting New Benchmarks in Audio Performance

Google evaluated the new model’s tool use and reasoning capabilities through rigorous industry benchmarks . On ComplexFuncBench Audio, which measures multi-step function calling with various constraints, Gemini 3.1 Flash Live achieved a score of 90.8 percent . This represents a nearly 20 percent improvement over Google’s previous-generation model .

The AI also set a record on Scale AI’s Audio MultiChallenge, scoring 36.1 percent with its “thinking” feature enabled . This specific benchmark tests the model’s ability to follow complex instructions and perform long-horizon reasoning while navigating the interruptions and hesitations typical of real-world audio . Major companies, including Verizon, LiveKit, and The Home Depot, have already provided positive feedback after integrating the model into their workflows, highlighting its improved, natural conversation .

Global Expansion and Security Features

Because Gemini 3.1 Flash Live is inherently multilingual, Google is using this launch to expand Search Live globally . Users in more than 200 countries and territories can now engage in real-time, multimodal conversations with Search in their preferred languages .

To help prevent the spread of misinformation, Google has implemented a security feature called SynthID . All audio generated by Gemini 3.1 Flash Live includes an imperceptible SynthID watermark interwoven directly into the audio output . This ensures that AI-generated content can be reliably detected .

Cohere Transcribe Targets Speech Accuracy

Alongside Google’s release, Cohere introduced Cohere Transcribe, an AI model with a narrower focus built exclusively for transcription tasks . The company states that the algorithm is the most accurate in its category, achieving the top position on the Hugging Face Open ASR Leaderboard and demonstrating an average word error rate of just 5.42 percent .

Cohere Transcribe begins the transcript generation process by translating raw audio into mathematical representations that are easier to process . This task is performed by a Conformer algorithm, which combines a convolutional neural network—a type of AI often used for audio processing tasks—with a transformer model . A standalone transformer then uses those representations to generate the final text transcript . The model can output text in more than a dozen languages .

Despite its high accuracy, Cohere Transcribe operates efficiently . The model contains a total of 2 billion parameters across its components, requiring relatively little computing power to run . It is available under an open-source Apache 2.0 license, allowing companies to deploy the algorithm on their own infrastructure or through Cohere’s Model Vault managed inference service . Cohere also plans to integrate the transcription model into its North productivity platform, enabling workers to search business documents and automate repetitive tasks .

TAGGED: AI audio models, Cohere Transcribe, Gemini 3.1 Flash Live, Google AI, Google AI Studio, machine learning, voice AI
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Sameer Katoch
As the Founder of VellaTimes and an avid traveler, I'm passionate about the daily news events happening globally. With over five years of experience in the writing field, I am committed to delivering top-notch news that satisfies your daily news intake.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

AWS Upgrades Storage for the AI Era With Amazon S3 Files

April 11, 2026

Curiosity Rover Finds New Organic Molecules on Mars

April 25, 2026

Ireland Opens Data Protection Probe Into Grok Over Deepfake Images

February 17, 2026

AI Chatbot Safety Concerns Mount Amid Lawsuits and Violence

March 18, 2026

SpaceX’s $60B Cursor Deal and IPO Plans Reshape AI Race

April 24, 2026

Microsoft data center initiative targets power, water

January 15, 2026

Related News

Close-up of a silver espresso machine extracting a fresh shot of coffee into a glass cup in a softly lit cafe setting.
News

Espresso Extraction Science: The Finer Grind Flaw

Nisha Pradhan Nisha Pradhan May 18, 2026
A smartphone resting on a wooden desk displaying an AI-powered Amazon search bar in a modern home office setting.
News

Amazon Alexa for Shopping Replaces Rufus AI Assistant

Sameer Katoch Sameer Katoch May 18, 2026
Wide news-style image showing an OpenAI office scene with screens displaying audio waveforms and voice technology graphics
News

OpenAI acquires Weights.gg to boost voice AI tools

Rakesh Paul Rakesh Paul May 18, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist