By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    A laptop displaying the Wikipedia logo with a prohibited symbol overlay, set in a dimly lit newsroom, representing Wikipedia's ban on AI-generated article content.
    Wikipedia Bans AI-Generated Text: What the New Policy Means
    March 27, 2026
    A wooden judge's gavel on a desk with glowing social media icons floating in a blurred courtroom background, representing the legal battle over tech addiction.
    Jury Finds Meta and Google Liable in Landmark Social Media Addiction Trial
    March 27, 2026
    A glowing digital brain hovering over academic research papers, representing the use of artificial intelligence in the scientific peer review process.
    AI Peer Review Reaches New Milestones in Academic Publishing
    March 27, 2026
    A glowing artificial intelligence logo displayed in a modern data center with financial charts and digital screens in the background.
    OpenAI Strategy Shift: Sora Shut Down Amid Tech Updates
    March 27, 2026
    A modern smartphone resting on a desk displaying a glowing Siri interface, representing Apple's upcoming artificial intelligence integrations in iOS 27.
    Apple Plans to Open Siri to Rival AI Assistants in iOS 27
    March 27, 2026
  • Technology
    TechnologyShow More
    A laptop displaying the Wikipedia logo with a prohibited symbol overlay, set in a dimly lit newsroom, representing Wikipedia's ban on AI-generated article content.
    Wikipedia Bans AI-Generated Text: What the New Policy Means
    March 27, 2026
    A modern smartphone resting on a desk displaying a glowing Siri interface, representing Apple's upcoming artificial intelligence integrations in iOS 27.
    Apple Plans to Open Siri to Rival AI Assistants in iOS 27
    March 27, 2026
    Amazon, AWS, Bahrain, cloud computing, drone strikes, Middle East conflict, data center outage
    AWS Bahrain Disruption Deepens Amid Middle East Conflict
    March 26, 2026
    A glowing digital neural network being compressed into a microchip with blurred, red downward-trending stock market charts in the background.
    Google TurboQuant Slashes AI Memory, Rattles Stocks
    March 26, 2026
    A hyper-realistic wide shot of a solemn modern courtroom interior featuring a wooden judge's bench and a screen displaying a social media logo.
    Meta Ordered to Pay $375 Million in Historic New Mexico Child Safety Lawsuit
    March 25, 2026
  • AI
    AIShow More
    A glowing artificial intelligence logo displayed in a modern data center with financial charts and digital screens in the background.
    OpenAI Strategy Shift: Sora Shut Down Amid Tech Updates
    March 27, 2026
    A sleek smartphone on a reflective corporate boardroom table showing the digital Sora logo dissolving into computer code.
    OpenAI Shuts Down Sora App and Ends $1B Disney Deal
    March 26, 2026
    A modern tech office conference table featuring a glowing artificial intelligence hologram, representing Meta's internal AI integration.
    Meta AI For Work Initiative Led by CTO Andrew Bosworth
    March 25, 2026
    Nvidia CEO Jensen Huang wearing a black leather jacket on stage at a technology conference, presenting new AI hardware to a large audience in a dimly lit auditorium.
    Nvidia GTC 2026: CEO Projects $1 Trillion AI Revenue
    March 24, 2026
    A brightly lit, modern data center featuring rows of advanced server racks with glowing blue lights and liquid cooling systems.
    Amazon Trainium Chips Win Over OpenAI, Apple, and Anthropic
    March 24, 2026
  • Science
    ScienceShow More
    A glowing digital brain hovering over academic research papers, representing the use of artificial intelligence in the scientific peer review process.
    AI Peer Review Reaches New Milestones in Academic Publishing
    March 27, 2026
    A human engineer interacts with glowing holographic data nodes representing autonomous AI agents in a modern, high-tech corporate workspace.
    Autonomous AI Agents Redefine Tech and Research in 2026
    March 26, 2026
    A glowing anatomical model of a human liver next to a bottle of Vitamin B3 capsules on a laboratory desk, representing new medical treatments for liver health.
    Vitamin B3 for Fatty Liver: A New Treatment Breakthrough
    March 25, 2026
    A hyper-realistic depiction of the Sun emitting a massive and bright solar flare toward a distant Earth in deep space.
    Powerful Solar Flares and Geomagnetic Storms Strike Earth
    March 25, 2026
    A futuristic, glowing quantum battery prototype sitting on a modern laboratory testing bench illuminated by subtle blue lasers.
    Quantum Battery Prototype Promises Near-Instant Charging
    March 24, 2026
  • World
    WorldShow More
    A wooden judge's gavel on a desk with glowing social media icons floating in a blurred courtroom background, representing the legal battle over tech addiction.
    Jury Finds Meta and Google Liable in Landmark Social Media Addiction Trial
    March 27, 2026
    A grand federal courthouse in Manhattan with imposing stone pillars and American flags, surrounded by a crowd of news reporters and camera crews behind barricades.
    Nicolas Maduro US Court Hearing Focuses on Legal Fees
    March 27, 2026
    Wide view of an investment conference in Miami with Delcy Rodríguez appearing on a large screen as business attendees watch in a modern ballroom.
    Venezuela Oil Investment Push Takes Center Stage
    March 26, 2026
    The Danish parliament building at dusk, symbolizing the political uncertainty following the deadlocked Denmark election.
    Denmark Election: Deadlocked Vote Forces Coalition Talks
    March 26, 2026
    A thick column of smoke rises over the dense Amazonian jungle in Putumayo, Colombia, following a tragic military plane crash.
    Colombia Military Plane Crash: 69 Dead in Putumayo
    March 25, 2026
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: Google Launches Gemini 3.1 Flash Live AI Audio Model
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

Google Launches Gemini 3.1 Flash Live AI Audio Model

Sameer Katoch
Last updated: 27/03/2026
Sameer Katoch
Share
7 Min Read
A sleek smartphone and a glowing modern microphone on a desk with digital sound waves floating above, representing advanced AI audio technology and real-time voice processing.

Google and Cohere have officially introduced new artificial intelligence models optimized for audio and voice processing . Google released Gemini 3.1 Flash Live, an advanced audio-to-audio model designed for real-time dialogue and voice-first applications . Simultaneously, Cohere launched Cohere Transcribe, an AI algorithm built exclusively for highly accurate speech transcription . Both releases offer significant improvements in output quality, latency, and task execution over previous generations .

Contents
Enhancing Real-Time Dialogue and Multimodal FeaturesSetting New Benchmarks in Audio PerformanceGlobal Expansion and Security FeaturesCohere Transcribe Targets Speech Accuracy

Gemini 3.1 Flash Live is now rolling out across multiple Google platforms . Developers can access the preview version through the Gemini Live API in Google AI Studio . Enterprise clients can utilize the technology via Gemini Enterprise for Customer Experience to automate and manage customer service interactions . For everyday consumers, the model is available right now through Gemini Live and Search Live, bringing faster and more fluid voice interactions to mobile devices and Chromebooks .

Enhancing Real-Time Dialogue and Multimodal Features

Google built Gemini 3.1 Flash Live to handle the natural rhythm and speed of human speech . The model directly addresses common issues in voice AI, such as stuttering, hesitation, or user interruptions . It delivers much faster responses and can follow a conversation thread for twice as long as the previous model, keeping a user’s train of thought intact during lengthy brainstorming sessions .

The updated AI is significantly better at recognizing acoustic nuances, such as pitch and pace, compared to the 2.5 Flash Native Audio model . It can detect when a speaker is getting confused or frustrated and will dynamically adjust its tone and responses to match the situation . This makes it highly effective for enterprise customer support, where a voice agent could automatically process tasks like product return requests .

Furthermore, Gemini 3.1 Flash Live supports multimodal inputs . Users can combine speech with images to solve problems . For example, a customer dealing with a malfunctioning smart home appliance can upload a photo of the device and use voice commands to troubleshoot the issue . The model also features tool use capabilities, allowing it to retrieve relevant data from external sources, such as product documentation repositories, to assist users .

Setting New Benchmarks in Audio Performance

Google evaluated the new model’s tool use and reasoning capabilities through rigorous industry benchmarks . On ComplexFuncBench Audio, which measures multi-step function calling with various constraints, Gemini 3.1 Flash Live achieved a score of 90.8 percent . This represents a nearly 20 percent improvement over Google’s previous-generation model .

The AI also set a record on Scale AI’s Audio MultiChallenge, scoring 36.1 percent with its “thinking” feature enabled . This specific benchmark tests the model’s ability to follow complex instructions and perform long-horizon reasoning while navigating the interruptions and hesitations typical of real-world audio . Major companies, including Verizon, LiveKit, and The Home Depot, have already provided positive feedback after integrating the model into their workflows, highlighting its improved, natural conversation .

Global Expansion and Security Features

Because Gemini 3.1 Flash Live is inherently multilingual, Google is using this launch to expand Search Live globally . Users in more than 200 countries and territories can now engage in real-time, multimodal conversations with Search in their preferred languages .

To help prevent the spread of misinformation, Google has implemented a security feature called SynthID . All audio generated by Gemini 3.1 Flash Live includes an imperceptible SynthID watermark interwoven directly into the audio output . This ensures that AI-generated content can be reliably detected .

Cohere Transcribe Targets Speech Accuracy

Alongside Google’s release, Cohere introduced Cohere Transcribe, an AI model with a narrower focus built exclusively for transcription tasks . The company states that the algorithm is the most accurate in its category, achieving the top position on the Hugging Face Open ASR Leaderboard and demonstrating an average word error rate of just 5.42 percent .

Cohere Transcribe begins the transcript generation process by translating raw audio into mathematical representations that are easier to process . This task is performed by a Conformer algorithm, which combines a convolutional neural network—a type of AI often used for audio processing tasks—with a transformer model . A standalone transformer then uses those representations to generate the final text transcript . The model can output text in more than a dozen languages .

Despite its high accuracy, Cohere Transcribe operates efficiently . The model contains a total of 2 billion parameters across its components, requiring relatively little computing power to run . It is available under an open-source Apache 2.0 license, allowing companies to deploy the algorithm on their own infrastructure or through Cohere’s Model Vault managed inference service . Cohere also plans to integrate the transcription model into its North productivity platform, enabling workers to search business documents and automate repetitive tasks .

TAGGED: AI audio models, Cohere Transcribe, Gemini 3.1 Flash Live, Google AI, Google AI Studio, machine learning, voice AI
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Sameer Katoch
As the Founder of VellaTimes and an avid traveler, I'm passionate about the daily news events happening globally. With over five years of experience in the writing field, I am committed to delivering top-notch news that satisfies your daily news intake.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

NASA Repairs Artemis II Rocket Ahead of Targeted April Moon Launch

March 5, 2026

Tinder Turns to New “Chemistry” AI Feature to Fight Swipe Fatigue

February 5, 2026

Brazil Floods: Death Toll Rises Amid Heavy Rain in Southeast

February 28, 2026

Venezuela Amnesty Law: 1,500 Apply, 200+ on Hunger Strike

February 23, 2026

Massive Russian Air Strike Overshadows Opening of US-Led Peace Talks in Geneva

February 17, 2026

Meta Corning fiber optic deal: up to $6B agreement

January 28, 2026

Related News

A laptop displaying the Wikipedia logo with a prohibited symbol overlay, set in a dimly lit newsroom, representing Wikipedia's ban on AI-generated article content.
News

Wikipedia Bans AI-Generated Text: What the New Policy Means

Rakesh Paul Rakesh Paul March 27, 2026
A wooden judge's gavel on a desk with glowing social media icons floating in a blurred courtroom background, representing the legal battle over tech addiction.
News

Jury Finds Meta and Google Liable in Landmark Social Media Addiction Trial

Editorial Staff Editorial Staff March 27, 2026
A glowing digital brain hovering over academic research papers, representing the use of artificial intelligence in the scientific peer review process.
News

AI Peer Review Reaches New Milestones in Academic Publishing

Nisha Pradhan Nisha Pradhan March 27, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist