By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    Abstract visualization of supercooled water molecules illuminated by bright X-ray laser beams against a dark background.
    Supercooled Water Critical Point Discovered by Scientists
    March 31, 2026
    A modern entrepreneur working in a high-tech office with a glowing red lobster icon symbolizing OpenClaw AI .
    How OpenClaw AI is Driving China’s ‘One-Person Company’ Boom
    March 31, 2026
    A close-up of a sleek pair of rectangular Meta Ray-Ban prescription smart glasses resting on a wooden surface in a well-lit optical store.
    Meta Ray-Ban Prescription Smart Glasses: New Release
    March 31, 2026
    A hyper-realistic depiction of a bright orange solar flare erupting from the sun alongside a silhouetted space rocket on a launch pad.
    Massive X1.4 Solar Flare Erupts Before Artemis II Launch
    March 31, 2026
    A sleek smartphone displaying an artificial intelligence interface rests on a desk in a modern corporate office, with a software engineer working in the blurred background.
    Google Agent Smith: Internal AI Tool Automates Coding Task
    March 31, 2026
  • Technology
    TechnologyShow More
    A close-up of a sleek pair of rectangular Meta Ray-Ban prescription smart glasses resting on a wooden surface in a well-lit optical store.
    Meta Ray-Ban Prescription Smart Glasses: New Release
    March 31, 2026
    A young child sitting in a dimly lit room, staring intensely at a glowing tablet screen displaying chaotic, brightly colored AI-generated cartoon graphics.
    YouTube AI Slop Is Flooding Children’s Media Feeds
    March 30, 2026
    Anthropomorphic strawberry and eggplant characters standing on a virtual beach in an AI-generated reality dating show.
    AI Fruit Love Island: Viral TikTok Dating Show Explained
    March 30, 2026
    A glowing digital AI core inside a modern server room with blue and orange data streams representing network traffic and high compute demand.
    Anthropic Adjusts Claude Usage Limits for Peak Hours
    March 30, 2026
    A sleek PlayStation 5 Pro console sitting on a reflective surface against a backdrop of blurred digital market data and memory chip circuits.
    Sony Announces Major PS5 Price Increase for April 2026
    March 29, 2026
  • AI
    AIShow More
    A modern entrepreneur working in a high-tech office with a glowing red lobster icon symbolizing OpenClaw AI .
    How OpenClaw AI is Driving China’s ‘One-Person Company’ Boom
    March 31, 2026
    A sleek smartphone displaying an artificial intelligence interface rests on a desk in a modern corporate office, with a software engineer working in the blurred background.
    Google Agent Smith: Internal AI Tool Automates Coding Task
    March 31, 2026
    A smartphone with a fading video icon on a desk alongside robotic schematics, symbolizing OpenAI's shift away from video generation toward robotics and coding.
    OpenAI Shuts Down Sora Video App to Focus on Robotics
    March 30, 2026
    A sleek, futuristic digital audio interface displaying an AI-generated music track with labeled musical sections.
    Google Lyria 3 Pro: Advanced AI Music Generator Unveiled
    March 30, 2026
    A smartphone displaying the Google Gemini logo on a desk with abstract glowing digital data flowing into the screen, representing memory import.
    Google Gemini Memory Import Tool Makes Switching Easy
    March 30, 2026
  • Science
    ScienceShow More
    Abstract visualization of supercooled water molecules illuminated by bright X-ray laser beams against a dark background.
    Supercooled Water Critical Point Discovered by Scientists
    March 31, 2026
    A hyper-realistic depiction of a bright orange solar flare erupting from the sun alongside a silhouetted space rocket on a launch pad.
    Massive X1.4 Solar Flare Erupts Before Artemis II Launch
    March 31, 2026
    A futuristic X-ray laser beam illuminating a morphing, glowing droplet of supercooled water in a dark, high-tech physics laboratory.
    Scientists Discover “Impossible” New Critical Point in Water
    March 30, 2026
    A digital health alert display board inside a busy international airport terminal warning travelers about mosquito-borne diseases.
    Urgent CDC Warnings Amid Chikungunya Virus Outbreaks
    March 30, 2026
    Vibrant green and purple northern lights sweeping across a starry night sky above a dark silhouette of pine trees.
    Northern Lights Alert: 10 States May See Aurora Sunday Night
    March 30, 2026
  • World
    WorldShow More
    Allu Arjun Commitment to Ethical Brand Partnerships
    Exploring Allu Arjun’s Commitment to Ethical Brand Partnerships
    December 18, 2023
    Orry aka Orhan Awatramani
    Orhan Awatramani ‘Orry’ Biography, Lifestyle and Rise to Fame
    December 8, 2023
    Alia Bhatt Latest Deepake Video Victim
    Alia Bhatt becomes latest victim of Deepfake Videos, Obscene Video goes Viral
    November 28, 2023
    Napoleon Movie Review
    Napoleon Movie Review: A Historical Epic by Ridley Scott Reviewed
    November 25, 2023
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: Alibaba Launches Qwen3.5-Omni Multimodal AI to Rival Gemini
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

Alibaba Launches Qwen3.5-Omni Multimodal AI to Rival Gemini

Rakesh Paul
Last updated: 31/03/2026
Rakesh Paul
Share
6 Min Read
A futuristic server room screen displaying a glowing AI neural network merging soundwaves, text code, and video frames, representing the native multimodal capabilities of an advanced AI model.

On March 30, 2026, Alibaba introduced Qwen3.5-Omni, a native multimodal AI model designed to process text, images, audio, and video simultaneously. Moving away from older systems that simply stitch together separate text and vision tools, this new release uses a unified computational pipeline to handle all data types natively. The model aims to compete directly with major industry players, delivering real-time interaction, complex problem solving, and advanced reasoning capabilities for both enterprise and everyday users.

Contents
Unified Architecture and Context CapacityOutperforming Gemini in Audio BenchmarksAudio-Visual Vibe Coding and Real-Time VoiceConflicting Reports on Open-Source AvailabilityLeadership Changes at Alibaba

The Alibaba Qwen3.5-Omni series is available in three sizes to balance performance and cost. The Plus tier focuses on maximum accuracy and complex reasoning, the Flash version prioritizes high-throughput and low-latency interactions, and the Light variant is built for efficiency.

Unified Architecture and Context Capacity

All three models share a massive 256,000-token context window. This large data capacity allows the system to process over ten hours of continuous audio input or more than 400 seconds of 720p video at one frame per second. The system relies on a specialized Thinker-Talker architecture powered by a Hybrid-Attention Mixture of Experts framework. The Thinker component manages all multimodal reasoning and text generation, analyzing everything from visual cues to spoken words. Meanwhile, the Talker component seamlessly transforms those internal representations into streaming speech outputs for real-time conversations.

Outperforming Gemini in Audio Benchmarks

Pre-trained on over 100 million hours of native audio-visual data, the new model sets several performance milestones. The flagship Plus version achieved state-of-the-art results across 215 audio and audio-visual subtasks. It outright outperforms Google’s Gemini 3.1 Pro in general audio understanding, reasoning, speech recognition, and translation, while matching the Google flagship in overall audio-visual comprehension. In addition to audio dominance, the Kursol blog reports that the model matches GPT-5.4 in many core reasoning domains, making it a highly competitive alternative in the broader AI market.

The system brings significant upgrades to language support. Speech recognition now handles 113 languages and dialects, including 74 languages and 39 Chinese dialects. This is a massive jump from the previous generation, which only supported 11 languages and eight dialects. The model also generates speech in 36 languages and dialects, offering 55 different voices. In tests evaluating multilingual voice stability across 20 languages, it outperformed competitors like ElevenLabs, GPT-Audio, and Minimax.

Audio-Visual Vibe Coding and Real-Time Voice

A unique emergent capability discovered during the model’s training process is a feature the team calls audio-visual vibe coding. Without relying on traditional text prompts, developers can use a camera to show a software interface or a physical object, speak their instructions out loud, and the model will generate functional code to address the request. By processing the visual evidence and spoken intent simultaneously in a single pass, the system seamlessly writes code directly from video and voice inputs. In one demonstration, the model built a working snake game from a brief verbal description and a video clip.

To handle real-time voice interactions smoothly, Alibaba developed the Adaptive Rate Interleave Alignment technique. Because text and speech tokens process at different speeds, streaming voice AI often suffers from dropped words or stuttering. This new alignment method dynamically synchronizes text and speech units, improving the naturalness of the voice output without increasing delay or sacrificing performance.

The update also introduces native semantic interruption for voice assistants. The AI can intelligently distinguish between harmless background noise, simple listener feedback, and an actual attempt by the user to interrupt the conversation. This allows for more natural, human-like turn-taking without the AI stopping its thought process prematurely. Additionally, the system includes built-in live web search to answer current questions without relying on separate pipelines, alongside custom voice cloning capabilities that allow users to generate custom voices from short reference clips.

Conflicting Reports on Open-Source Availability

Reports conflict regarding the model’s availability to the public. According to the news outlet The Decoder and a briefing by The Information, Alibaba has not released the model weights openly, making Qwen3.5-Omni accessible only as a paid API service. However, the Build Fast with AI blog states that while the Plus and Flash versions are limited to Alibaba Cloud’s DashScope API, the Light variant is available as open weights on Hugging Face.

Leadership Changes at Alibaba

This major technical release arrives during a period of internal change at Alibaba. The Decoder reports that Junyang Lin, the chief AI developer behind the Qwen series, recently announced his sudden departure alongside other key team leaders. The exits reportedly stem from a management shakeup involving a new researcher hired from Google’s Gemini team. In response, Alibaba CEO Eddie Wu announced a new Foundation Model Task Force to maintain the company’s strategic focus on AI development.

TAGGED: AI voice assistants, Alibaba, Artificial Intelligence, Gemini 3.1 Pro, machine learning, multimodal AI, Qwen3.5-Omni
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Rakesh Paul
I'm the Co-Founder of VellaTimes and an experienced digital marketer. With substantial experience in the blogging industry, I love crafting insightful and engaging news articles on technology, sports, and automobiles.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

OpenAI Secures Pentagon AI Agreement Amid Anthropic Supply Chain Dispute

March 1, 2026

Lifetime alcohol consumption tied to colorectal cancer

January 26, 2026

LMArena Unicorn AI Startup Secures $150M at $1.7B Valuation

January 8, 2026

NASA Repairs Artemis II Rocket Ahead of Targeted April Moon Launch

March 5, 2026

Google Gemini Hits 750 Million Users, Driving Alphabet Growth

February 5, 2026

Google Patches First Chrome Zero-Day of 2026: Update Immediately

February 18, 2026

Related News

Abstract visualization of supercooled water molecules illuminated by bright X-ray laser beams against a dark background.
News

Supercooled Water Critical Point Discovered by Scientists

Nisha Pradhan Nisha Pradhan March 31, 2026
A modern entrepreneur working in a high-tech office with a glowing red lobster icon symbolizing OpenClaw AI .
News

How OpenClaw AI is Driving China’s ‘One-Person Company’ Boom

Sameer Katoch Sameer Katoch March 31, 2026
A close-up of a sleek pair of rectangular Meta Ray-Ban prescription smart glasses resting on a wooden surface in a well-lit optical store.
News

Meta Ray-Ban Prescription Smart Glasses: New Release

Rakesh Paul Rakesh Paul March 31, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist