Table of Contents
What Is Qwen Max?
Is Qwen Better Than DeepSeek?
Benchmark Comparisons
Home Technology peripherals AI This Chinese Tech Giant Just Launched an AI Chatbot—and It's Better Than DeepSeek

This Chinese Tech Giant Just Launched an AI Chatbot—and It's Better Than DeepSeek

Feb 07, 2025 am 06:01 AM

Before DeepSeek could fade out the headline, another Chinese AI model has come to upseat it. Chinese e-commerce giant Alibaba has announced a new version of its Qwen family of AI models, and there's a lot to like—it's better than DeepSeek and ChatGPT in some key areas.

What Is Qwen Max?

Qwen Max is the latest and the most powerful AI model in Alibaba's Qwen AI family. Currently, the following Qwen AI models are available:

  • Qwen2.5-Plus
  • Qwen2.5-Max
  • Qwen2.5-VL-72B-Instruct
  • Qwen2.5-14B-Instruct-1M
  • QVQ-72B-Preview
  • QwQ-32B-Preview
  • Qwen2.5-Coder-32B-Instruct
  • Qwen2.5-Turbo
  • Qwen2.5-72B-Instruct

All of the models above are free to use once you've created an account via email, Google, or GitHub. Qwen's AI models are also open-source, meaning you can find them on GitHub or HuggingFace. You can also install them locally on your device (depending on its specs), allowing you to run the AI offline.

This Chinese Tech Giant Just Launched an AI Chatbot—and It's Better Than DeepSeek

Qwen2.5-Max is a 72-billion parameter Mixture-of-Experts (MoE) model, supports 29 languages, and is trained on over 20 trillion tokens. It can also handle up to 128,000 tokens in a single conversation, meaning running lengthy documents through the AI will not be an issue. If you're working with data, Qwen can process structured formats like tables, CSVs, and JSON files.

As the names suggest, some Qwen models are better at specific tasks. For example, Qwen2.5-Coder-32B-Instruct excels at coding tasks, while QwQ-32B-Preview is capable of thinking and reasoning. Not all of the models can do everything, but most models can handle text prompts, as well as image and video generation.

Another rather unique feature is the ability to combine two models together. In my experience, you can get slightly better results when combining two versions into a stronger pairing. For example, pairing Qwen2.5-Max with Qwen2.5-Coder-32B-Instruct helped me generate code with fewer prompts and issues in the output.

This Chinese Tech Giant Just Launched an AI Chatbot—and It's Better Than DeepSeek

Qwen's website is the only place to officially access the AI model. You can type in prompts and work with text, but there are image and video generation capabilities as well and in multiple aspect ratios. There's a Web Search feature that has yet to be launched.

On the downside, sometimes, Qwen takes a while to process your prompts. So much so that, at first, I thought the website wasn't functioning correctly. I found that the first prompt you send in a conversation can take about 30 seconds to generate a response, after which the responses speed up.

Images and videos are generated faster than I expected. They aren't top-of-the-line when it comes to quality or realism, but if you need to generate a quick image in a pinch, they'll do. You can expect a fair amount of random artifacts in most generated media as well.

This Chinese Tech Giant Just Launched an AI Chatbot—and It's Better Than DeepSeek

The increased response time could just be because of server load, as was the case with almost every AI chatbot at launch, including DeepSeek and ChatGPT. I did receive errors connecting to Qwen as there were too many requests in the queue from time to time.

Is Qwen Better Than DeepSeek?

Technically speaking, Qwen is better than DeepSeek across the board. Alibaba's model feels more natural to interact with and runs ever so slightly faster. However, if you were to ignore benchmark results, you'd be hard-pressed to find differences between the two.

Qwen's major advantage over DeepSeek is its better alignment with human preferences, making it easier to type in more complicated prompts and get accurate responses without much fine-tuning. Even simple one-liners can generate quite a detailed response with lots of information. DeepSeek, while a terrific AI model, can take a few tries and some prompt engineering before you get your desired results.

This Chinese Tech Giant Just Launched an AI Chatbot—and It's Better Than DeepSeek

When it comes to general knowledge and factual accuracy, both models perform similarly, but Qwen does have a slight edge when it comes to factual consistency.

One area where DeepSeek is the clear winner, though, is the usage cost. DeepSeek costs $0.25 per million tokens, while Qwen costs $0.38. That said, it's still significantly cheaper than the $5 and $3 rates offered by GPT-4o and Claude 3.5, respectively.

Benchmark Comparisons

As mentioned before, Qwen outperforms DeepSeek pretty much across the board when it comes to benchmarks.

Benchmark

Qwen 2.5 Max

DeepSeek V3 R1

Arena-Hard

89.4

85.5

MMLU-Pro

76.1

75.9

GPQA-Diamond

60.1

59.1

LiveCodeBench

38.7

37.6

LiveBench

62.2

60.5

Benchmark
Qwen 2.5 Max DeepSeek V3 R1
Arena-Hard 89.4 85.5
MMLU-Pro 76.1 75.9
GPQA-Diamond 60.1 59.1
LiveCodeBench 38.7 37.6
LiveBench 62.2 60.5

The benchmarks clearly show Qwen's better understanding and alignment with human values. Apart from that, in terms of knowledge and reasoning, general knowledge, coding, and overall ability, it's only slightly better than DeepSeek.

These two AI models from China have introduced a new benchmark for AI development. There are security and privacy concerns, though, especially considering DeepSeek has already suffered its first data breach. Still, Qwen and DeepSeek's AI models are clearly better than their Western counterparts in terms of performance and have really put the AI world on edge.

The above is the detailed content of This Chinese Tech Giant Just Launched an AI Chatbot—and It's Better Than DeepSeek. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Nordhold: Fusion System, Explained
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1670
14
PHP Tutorial
1276
29
C# Tutorial
1256
24
How to Build MultiModal AI Agents Using Agno Framework? How to Build MultiModal AI Agents Using Agno Framework? Apr 23, 2025 am 11:30 AM

While working on Agentic AI, developers often find themselves navigating the trade-offs between speed, flexibility, and resource efficiency. I have been exploring the Agentic AI framework and came across Agno (earlier it was Phi-

How to Add a Column in SQL? - Analytics Vidhya How to Add a Column in SQL? - Analytics Vidhya Apr 17, 2025 am 11:43 AM

SQL's ALTER TABLE Statement: Dynamically Adding Columns to Your Database In data management, SQL's adaptability is crucial. Need to adjust your database structure on the fly? The ALTER TABLE statement is your solution. This guide details adding colu

OpenAI Shifts Focus With GPT-4.1, Prioritizes Coding And Cost Efficiency OpenAI Shifts Focus With GPT-4.1, Prioritizes Coding And Cost Efficiency Apr 16, 2025 am 11:37 AM

The release includes three distinct models, GPT-4.1, GPT-4.1 mini and GPT-4.1 nano, signaling a move toward task-specific optimizations within the large language model landscape. These models are not immediately replacing user-facing interfaces like

Beyond The Llama Drama: 4 New Benchmarks For Large Language Models Beyond The Llama Drama: 4 New Benchmarks For Large Language Models Apr 14, 2025 am 11:09 AM

Troubled Benchmarks: A Llama Case Study In early April 2025, Meta unveiled its Llama 4 suite of models, boasting impressive performance metrics that positioned them favorably against competitors like GPT-4o and Claude 3.5 Sonnet. Central to the launc

New Short Course on Embedding Models by Andrew Ng New Short Course on Embedding Models by Andrew Ng Apr 15, 2025 am 11:32 AM

Unlock the Power of Embedding Models: A Deep Dive into Andrew Ng's New Course Imagine a future where machines understand and respond to your questions with perfect accuracy. This isn't science fiction; thanks to advancements in AI, it's becoming a r

How ADHD Games, Health Tools & AI Chatbots Are Transforming Global Health How ADHD Games, Health Tools & AI Chatbots Are Transforming Global Health Apr 14, 2025 am 11:27 AM

Can a video game ease anxiety, build focus, or support a child with ADHD? As healthcare challenges surge globally — especially among youth — innovators are turning to an unlikely tool: video games. Now one of the world’s largest entertainment indus

Rocket Launch Simulation and Analysis using RocketPy - Analytics Vidhya Rocket Launch Simulation and Analysis using RocketPy - Analytics Vidhya Apr 19, 2025 am 11:12 AM

Simulate Rocket Launches with RocketPy: A Comprehensive Guide This article guides you through simulating high-power rocket launches using RocketPy, a powerful Python library. We'll cover everything from defining rocket components to analyzing simula

Google Unveils The Most Comprehensive Agent Strategy At Cloud Next 2025 Google Unveils The Most Comprehensive Agent Strategy At Cloud Next 2025 Apr 15, 2025 am 11:14 AM

Gemini as the Foundation of Google’s AI Strategy Gemini is the cornerstone of Google’s AI agent strategy, leveraging its advanced multimodal capabilities to process and generate responses across text, images, audio, video and code. Developed by DeepM

See all articles