By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
Pratzo - Daily NewsPratzo - Daily NewsPratzo - Daily News
Notification Show More
Font ResizerAa
  • Technology
    • AI & Machine Learning
    • Software & Apps
    • Hardware & Gadgets
    Technology
    Show More
    Top News
    South Korea’s Central Bank Dismisses Bitcoin as Reserved Asset Citing Uncertainty, Risks: Report
    March 17, 2025
    Zoom AI Companion Is Being Upgraded With Agentic Capabilities and New AI Features
    March 18, 2025
    Vivo X200 Ultra Colour Options Leaked; Tipped to Get 2K Resolution Display
    March 19, 2025
    Latest News
    Vivo X Fold 5 Colour Options, Specifications Teased Ahead of India Launch
    July 2, 2025
    Alienware Area-51, Alienware Aurora Desktops With Latest Intel Core Ultra CPUs Launched in India
    July 2, 2025
    Grammarly Announces Plans to Acquire Email App Superhuman to Create Agentic Productivity Platform
    July 2, 2025
    Amazon Prime Day 2025 Sale: Discounts on Electronics and Bank Offers Revealed
    July 2, 2025
  • Digital Marketing
    • Social Media Updates
    • PPC & Ads Insights
    • SEO Trends
    • Content Marketing Strategies
    Digital MarketingShow More
    70% of Senior Marketers Support Google’s Decision to Retain Third-Party Cookies on Chrome
    December 6, 2024
  • Lifestyle & Productivity
    • Personal Productivity Tools
    • Smart Home Tech
    • Wearables
    • Wellness Gadgets
    Lifestyle & ProductivityShow More
    Allu Arjun’s Bail Hearing Postponed to January 3
    December 31, 2024
    Pushpa 2 Full Movie Leaked Online
    Pushpa 2 Full Movie Leaked Online: A Major Setback Despite Record Pre-Sales
    December 5, 2024
    Pushpa 2: The Rule Movie Review – A Gripping Mass Entertainer
    December 5, 2024
  • Automobile
    AutomobileShow More
    New Petrol Price in India: Crude Oil Prices Fall – Check Today’s Rates
    January 25, 2025
    All-New Honda Amaze 2025 Launched in India – Prices Start at ₹7.99 Lakh
    December 5, 2024
    Mahindra XEV 9e Launched In India Priced At ₹ 21.90 Lakh: Check Range, Features, and More
    November 27, 2024
Reading: Alibaba Qwen 2.5 Omni AI Model With Real-Time Speech Generation Released
Share
Font ResizerAa
Pratzo - Daily NewsPratzo - Daily News
Search
Follow US
Pratzo - Daily News > Technology > Alibaba Qwen 2.5 Omni AI Model With Real-Time Speech Generation Released
Technology

Alibaba Qwen 2.5 Omni AI Model With Real-Time Speech Generation Released

admin
Last updated: March 28, 2025 4:56 am
admin Published March 28, 2025
Share
SHARE

Alibaba’s Qwen team released a new artificial intelligence (AI) model in the Qwen 2.5 family on Wednesday. Dubbed Qwen 2.5 Omni, it is a flagship-tier end-to-end multimodal model. The company claims it can process a wide range of inputs, including text, images, audio, and videos, while generating real-time text and natural speech responses. It is said to enable the building and deployment of cost-effective AI agents due to its diverse skill set. Alibaba has also employed a new “Thinker-Talker” architecture for the Qwen 2.5 Omni AI model.

Qwen 2.5 Omni AI Model Released

In a blog post, the Qwen team detailed the new Qwen 2.5 Omni AI model, which is a seven-billion-parameter system. The most notable capability of this omnimodal model is the real-time speech generation and video chat capability, which will allow the large language model (LLM) to answer queries and interact with users verbally in a humanlike manner. So far, this capability is only available with Google and OpenAI’s models, which are closed-source. Alibaba, on the other hand, has open-sourced the technology.

Coming to the features, it accepts text, images, audio, and video as input as well as output. The model is also capable of real-time voice interactions and video chats. The Qwen team also highlights that the model will also offer real-time streaming of speech in a natural manner. Additionally, it is claimed to come with enhanced performance in end-to-end speech instruction.

The Qwen team highlighted that the Omni model is built on a novel “Thinker-Talker” architecture. The Thinker component functions like a brain and is responsible for processing and understanding input across modalities, and generating text output. It is essentially a Transformer decoder that encodes audio and image and assists with information extraction.

qwen omni benchmark Qwen Omni benchmark

Qwen 2.5 Omni benchmark
Photo Credit: Alibaba

 

On the other hand, the Talker component operates like a human mouth, the researchers said. It streams the information produced by the Thinker component and generates a stream-like output for speech fluidity. It is designed as a dual-track autoregressive Transformer decoder. This entire architecture operates as a single model, allowing real-time text and speech generation, enabling end-to-end training and inference.

Based on internal testing, the Qwen 2.5 Omni AI model is said to outperform the Gemini 1.5 Pro model on the OmniBench. It also outperforms Qwen 2.5-VL-7B, Qwen2-Audio on single-modality tasks.

The AI model is now available on Alibaba’s Hugging Face listing and GitHub listing. Additionally, users can test out the new model via Qwen Chat as well as the company’s community ModelScope.

source

You Might Also Like

Vivo X Fold 5 Colour Options, Specifications Teased Ahead of India Launch

Alienware Area-51, Alienware Aurora Desktops With Latest Intel Core Ultra CPUs Launched in India

Grammarly Announces Plans to Acquire Email App Superhuman to Create Agentic Productivity Platform

Amazon Prime Day 2025 Sale: Discounts on Electronics and Bank Offers Revealed

WWE 2K25 Launches on Nintendo Switch 2 This Month, Pre-Orders Now Live

TAGGED:Satellite TechnologySpace TechnologyTechnology
Share This Article
Facebook Twitter Email Print
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Current Gold Rate: 3681.90 INR per gram

Follow US

Find US on Social Medias
FacebookLike
TwitterFollow
YoutubeSubscribe
TelegramFollow

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!

    Popular News
    Technology

    UPI Services Hit by Another Outage, NPCI Says ‘Working to Resolve the Issue’

    admin admin April 12, 2025
    China’s DeepSeek Unveils Latest Update in Race With OpenAI
    Anthropic Researchers Make Major Breakthrough In Understanding How an AI Model Thinks
    Google's March Pixel Drop Brings Gemini Live Upgrades, Scam Detection in Messages and More
    Audible to Partner With Publishers to Create AI-Voiced Audiobooks
    - Advertisement -
    Ad imageAd image

    Always Stay Up to Date

    Subscribe to our newsletter to get our newest articles instantly!

      About US

      At News.Pratzo.com, we are shaping the conversation in business and technology with reliable insights and updates. As part of the Pratzo.com brand, we aim to be your trusted source for impactful stories and trends, empowering professionals and enthusiasts alike. Stay informed, inspired, and ahead with us!
      Quick Link
      • Automobile
      • News
      • Cricket
      • Lifestyle & Productivity
      • Entertainment
      • Reviews & Comparisons
      • Digital Marketing
      • SEO Trends
      • Technology
      • AI & Machine Learning

      © Flair Hair & Beauty Salon London 2025

      © Pratzo News Network. Assets of Pratzo.com . All Rights Reserved.
      Go to mobile version