Software:Qwen

From HandWiki
Qwen
Screenshot
Screenshot of an example of a Qwen 3 answer describing Wikipedia, with the "Thinking" feature enabled
Developer(s)Alibaba Cloud
Initial releaseApril 2023; 2 years ago (2023-04)
Stable release
Qwen3-Max /
September 5, 2025; 5 months ago (2025-09-05)
Qwen3-235B-A22B /
July 25, 2025; 6 months ago (2025-07-25)
Qwen3-Next /
September 11, 2025; 4 months ago (2025-09-11)
Repositorygithub.com/QwenLM/Qwen
Written inPython
Operating system
TypeChatbot
LicenseApache-2.0
Qwen Research License
Qwen License
Websitechat.qwen.ai
Qwen
Script error: No such module "Infobox multi-lingual name".

Qwen (also known as Tongyi Qianwen, Chinese: 通义千问; pinyin: Tōngyì Qiānwèn) is a family of large language models developed by Alibaba Cloud. Many Qwen variants are distributed as open‑weight models under the Apache‑2.0 license, while others are served through Alibaba Cloud.[1]

In July 2024, South China Morning Post reported that benchmarking platform SuperCLUE ranked Qwen2‑72B‑Instruct behind OpenAI's GPT‑4o and Anthropic’s Claude 3.5 Sonnet and ahead of other Chinese models.[2]

Models

Alibaba launched a beta of Qwen in April 2023 under the name Tongyi Qianwen, then opened it for public use in September 2023 after regulatory clearance.[3][4]

The model's architecture was based on the Llama architecture developed by Meta AI.[5][6] In December 2023, it released its 72B and 1.8B models for download, while Qwen 7B weights were released in August.[7][8] Their models are sometimes described as open source, but the training code has not been released nor has the training data been documented, and they do not meet the terms of either the Open Source AI Definition or the Model Openness Framework from the Linux Foundation.

In June 2024 Alibaba launched Qwen2 and in September it released some of its models with open weights, while keeping its most advanced models proprietary.[9][10] Qwen2 contains both dense and sparse models.[11]

In November 2024, QwQ-32B-Preview, a model focusing on reasoning similar to OpenAI's o1, was released under the Apache 2.0 License, although only the weights were released, not the dataset or training method.[12][13] QwQ has a 32K token context length and performs better than o1 on some benchmarks.[14]

The Qwen-VL series is a line of visual language models that combines a vision transformer with a LLM.[5][15] Alibaba released Qwen2-VL with variants of 2 billion and 7 billion parameters.[16][17][18]

In January 2025, Qwen2.5-VL was released with variants of 3, 7, 32, and 72 billion parameters.[19] All models except the 72B variant are licensed under the Apache 2.0 license.[20] Qwen-VL-Max is Alibaba's flagship vision model as of 2024, and is sold by Alibaba Cloud at a cost of US$0.41 per million input tokens.[21]

Alibaba has released several other model types such as Qwen-Audio and Qwen2-Math.[22] In total, it has released more than 100 open weight models, with its models having been downloaded more than 40 million times.[10] Fine-tuned versions of Qwen have been developed by enthusiasts, such as "Liberated Qwen", developed by San Francisco-based Abacus AI, which is a version that responds to any user request without content restrictions.[23]

On January 29, 2025, Alibaba launched Qwen2.5-Max. According to a blog post from Alibaba, Qwen2.5-Max outperforms other foundation models such as GPT-4o, DeepSeek-V3, and Llama-3.1-405B in key benchmarks.[24][25] In February 2025, Alibaba announced on their official X account that the 2.5-Max model would be opened up, however it has not been released.[26]

On March 24, 2025, Alibaba launched Qwen2.5-VL-32B-Instruct as a successor to the Qwen2.5-VL model. It was released under the Apache 2.0 license.[27][28]

On March 26, 2025, Qwen2.5-Omni-7B was released under the Apache 2.0 license and made available through chat.qwen.ai, as well as platforms like Hugging Face, GitHub, and ModelScope.[29] The Qwen2.5-Omni model accepts text, images, videos, and audio as input and can generate both text and audio as output, allowing it to be used for real-time voice chatting, similar to OpenAI's GPT-4o.[29]

On April 28, 2025, the Qwen3 model family was released,[30] with all models licensed under the Apache 2.0 license. The Qwen3 model family includes both dense (0.6B, 1.7B, 4B, 8B, 14B, and 32B parameters) and sparse models (30B with 3B activated parameters, 235B with 22B activated parameters). They were trained on 36 trillion tokens in 119 languages and dialects.[31] All models except the 0.6B, 1.7B, and 4B variants have a 128K token context window. Like OpenAI's o1 and QwQ 32B, the Qwen3 models support reasoning, which can be enabled or disabled through the tokenizer. The Qwen3 models are available through chat.qwen.ai and can be downloaded via Hugging Face and ModelScope.[32]

On September 5, 2025, Alibaba launched Qwen3-Max.[33] According to Alibaba's official X account, it outperforms other foundation non-reasoning models such as Qwen3-235B-A22B-Instruct-2507, Kimi K2, Claude 4 Opus Non-thinking, and DeepSeek V3.1.[34] There is no dedicated thinking mode for Qwen3-Max as of yet.[35]

On September 10, 2025, Qwen3-Next was released under the Apache 2.0 license and made available through chat.qwen.ai, as well as platforms like Hugging Face and Model Scope. Qwen3-Next includes two post-trained Instruct and Thinking models. Qwen3-Next was created with a new model-architecture called Qwen3-Next, in the belief that Context Length Scaling and Total Parameter Scaling are two major trends in the future of large models. Qwen3-Next introduces several key improvements over the Qwen3 architecture: a hybrid attention mechanism, a highly sparse mixture-of-experts (MoE) structure, training-stability-friendly optimizations, and a multi-token prediction mechanism for faster inference. Based on the Qwen3-Next architecture, a model with 80B total parameters and 3B active parameters was created. The Qwen3-Next model performs comparable to, or in some cases better than, Qwen3-32b while using less than 10% of its training cost (in GPU hours). In inference, especially with contexts greater than 32K tokens, it reaches greater than 10x higher throughput. Qwen3.5 will use a refined version of the Qwen3-Next architecture.[36]

On September 22, 2025, Qwen3-Omni was release under the Apache 2.0 license and made available through chat.qwen.ai, as well as platforms like Hugging Face and Model Scope. Qwen3-Omni is a mixed/multimodal model that can process text, images, audio, and video, and deliver real-time streaming responses in both text and natural speech.[37]

List of models
Version Release date Template:Reference column heading
Tongyi Qianwen 2023-9 [38]
Qwen-VL 2023-8 [39]
Qwen2 2024-6 [10]
Qwen2-Audio 2024-8 [40]
Qwen2-VL 2024-12 [16]
Qwen2.5 2024-9 [41]
Qwen2.5-Coder 2024-11 [42]
QvQ 2024-12 [43]
Qwen2.5-VL 2025-1 [44]
QwQ-32B 2025-3 [45]
Qwen2.5-Omni 2025-3 [29]
Qwen3 2025-4 [30]
Qwen3-Coder July 2025 [46]
Qwen3-Max September 2025 [33]
Qwen3-Next September 2025 [47]
Qwen3-Omni September 2025 [37]

See also

References

  1. Mo, Liam; Hall, Casey (19 September 2024). "Alibaba accelerates AI push by releasing new open-source models, text-to-video". Reuters. https://www.reuters.com/technology/alibaba-accelerates-ai-push-by-releasing-new-open-source-models-text-to-video-2024-09-19/. 
  2. Jiang, Ben (11 July 2024). "Alibaba's open-source AI model tops Chinese rivals, ranks 3rd globally" (in en). https://www.scmp.com/tech/big-tech/article/3270079/alibabas-ai-model-outperforms-chinese-rivals-ranks-just-behind-openai-anthropic. 
  3. Horwitz, Josh; Ye, Josh (11 April 2023). "Alibaba to roll out generative AI across apps". Reuters. https://www.reuters.com/technology/alibaba-unveils-tongyi-qianwen-an-ai-model-similar-gpt-2023-04-11/. 
  4. Hall, Casey (13 September 2023). "Alibaba opens AI model Tongyi Qianwen to the public". Reuters. https://www.reuters.com/business/retail-consumer/alibaba-opens-ai-model-tongyi-qianwen-public-2023-09-13/. 
  5. 5.0 5.1 Bai, Jinze; et al. (28 September 2023). "Qwen Technical Report". arXiv:2309.16609 [cs.CL].
  6. "Qwen/techmemo-draft.md" (in en). August 3, 2023. https://github.com/QwenLM/Qwen/blob/ba2d85a13b28ed1ee0dde2d6c3e4d5a55dc5964c/techmemo-draft.md. 
  7. Fan, Feifei (2023-12-01). "Alibaba unveils new Tongyi Qianwen AI language model". https://global.chinadaily.com.cn/a/202312/01/WS6569d3eca31090682a5f1046.html. 
  8. Ye, Josh (August 3, 2023). "Alibaba rolls out open-sourced AI model to take on Meta's Llama 2". https://www.reuters.com/technology/alibaba-unveils-open-sourced-ai-model-similar-metas-llama-2-2023-08-03/. 
  9. Jiang, Ben (7 June 2024). "Alibaba says new AI model Qwen2 bests Meta's Llama 3 in tasks like maths and coding" (in en). https://www.scmp.com/tech/big-tech/article/3265845/alibaba-says-new-ai-model-qwen2-bests-metas-llama-3-tasks-maths-and-coding. 
  10. 10.0 10.1 10.2 Kharpal, Arjun (19 September 2024). "China's Alibaba launches over 100 new open-source AI models, releases text-to-video generation tool" (in en). https://www.cnbc.com/2024/09/19/alibaba-launches-over-100-new-ai-models-releases-text-to-video-generation.html. 
  11. Yang, An; et al. (10 September 2024). "Qwen2 Technical Report". arXiv:2407.10671 [cs.CL].
  12. Dickson, Ben (29 November 2024). "Alibaba releases Qwen with Questions, an open reasoning model that beats o1-preview". https://venturebeat.com/ai/alibaba-releases-qwen-with-questions-an-open-reasoning-model-that-beats-o1-preview/. 
  13. "阿里通义千问 QwQ 登场:开源 AI 推理新王,MATH 测试超 OpenAI o1 模型 - IT之家". 2024-11-28. https://www.ithome.com/0/813/799.htm. 
  14. Wiggers, Kyle (27 November 2024). "Alibaba releases an 'open' challenger to OpenAI's o1 reasoning model". https://techcrunch.com/2024/11/27/alibaba-releases-an-open-challenger-to-openais-o1-reasoning-model/. 
  15. Browne, Ryan (31 December 2024). "Alibaba slashes prices on large language models by up to 85% as China AI rivalry heats up" (in en). https://www.cnbc.com/2024/12/31/alibaba-baba-cloud-unit-slashes-prices-on-ai-models-by-up-to-85percent.html. 
  16. 16.0 16.1 Franzen, Carl (29 August 2024). "Alibaba releases new AI model Qwen2-VL that can analyze videos more than 20 minutes long". https://venturebeat.com/ai/alibaba-releases-new-ai-model-qwen2-vl-that-can-analyze-videos-more-than-20-minutes-long/. 
  17. "阿里通义千问推出 Qwen2-VL:开源 2B / 7B 参数 AI 大模型,处理任意分辨率图像无需分割成块". 2024-08-30. https://www.ithome.com/0/792/161.htm. 
  18. Wang, Peng; Bai, Shuai; Tan, Sinan; Wang, Shijie; Fan, Zhihao; Bai, Jinze; Chen, Keqin; Liu, Xuejing et al. (September 18, 2024). "Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution". Cs.CV. 
  19. "Qwen2.5 VL! Qwen2.5 VL! Qwen2.5 VL!" (in en). 2025-01-26. https://qwenlm.github.io/blog/qwen2.5-vl/. 
  20. "Qwen/Qwen2.5-VL-72B-Instruct · Hugging Face". 2025-04-28. https://huggingface.co/Qwen/Qwen2.5-VL-72B-Instruct. 
  21. Jiang, Ben (31 December 2024). "Alibaba Cloud cuts AI visual model price by 85% on last day of the year" (in en). https://www.scmp.com/tech/big-tech/article/3292916/alibaba-cuts-ai-visual-model-cost-85-last-day-year-price-war-rages. 
  22. Franzen, Carl (8 August 2024). "Alibaba claims no. 1 spot in AI math models with Qwen2-Math". https://venturebeat.com/ai/alibaba-claims-no-1-spot-in-ai-math-models-with-qwen2-math/. 
  23. Mims, Christopher (April 19, 2024). "Here Come the Anti-Woke AIs". https://www.wsj.com/tech/ai/here-come-the-anti-woke-ais-76a1cfcc. 
  24. "Qwen2.5-Max: Exploring the Intelligence of Large-scale MoE Model" (in en). 29 January 2025. https://qwenlm.github.io/blog/qwen2.5-max/. 
  25. Baptista, Eduardo (January 29, 2025). "Alibaba releases AI model it says surpasses DeepSeek". https://www.reuters.com/technology/artificial-intelligence/alibaba-releases-ai-model-it-claims-surpasses-deepseek-v3-2025-01-29/. 
  26. Qwen, Alibaba (February 24, 2025). "QwQ-Max-Preview". https://x.com/Alibaba_Qwen/status/1894130603513319842. 
  27. "Qwen2.5-VL-32B: Smarter and Lighter" (in en). 2025-03-24. https://qwenlm.github.io/blog/qwen2.5-vl-32b/. 
  28. Nikhil (2025-03-24). "Qwen Releases the Qwen2.5-VL-32B-Instruct: A 32B Parameter VLM that Surpasses Qwen2.5-VL-72B and Other Models like GPT-4o Mini" (in en-US). https://www.marktechpost.com/2025/03/24/qwen-releases-the-qwen2-5-vl-32b-instruct-a-32b-parameter-vlm-that-surpasses-qwen2-5-vl-72b-and-other-models-like-gpt-4o-mini/. 
  29. 29.0 29.1 29.2 Dotson, Kyt (27 March 2025). "Alibaba releases new open-source AI model to power intelligent voice applications". https://siliconangle.com/2025/03/27/alibaba-releases-new-open-source-ai-model-power-intelligent-voice-applications/. 
  30. 30.0 30.1 Ara Shaikh, Jasmeen (April 28, 2025). "Alibaba unveils advanced Qwen 3 AI as Chinese tech rivalry intensifies". https://www.reuters.com/business/media-telecom/alibaba-unveils-advanced-qwen-3-ai-chinese-tech-rivalry-intensifies-2025-04-29/. 
  31. Wiggers, Kyle (28 April 2025). "Alibaba unveils Qwen3, a family of 'hybrid' AI reasoning models". https://techcrunch.com/2025/04/28/alibaba-unveils-qwen-3-a-family-of-hybrid-ai-reasoning-models/. 
  32. "Qwen3: Think Deeper, Act Faster" (in en). 2025-04-29. https://qwenlm.github.io/blog/qwen3/. 
  33. 33.0 33.1 Bastian, Matthias (2025-09-07). "Alibaba unveils Qwen3-Max-Preview, its largest language model yet" (in en-US). https://the-decoder.com/alibaba-unveils-qwen3-max-preview-its-largest-language-model-yet/. 
  34. "Big news: Introducing Qwen3-Max-Preview..." (in en). https://x.com/Alibaba_Qwen/status/1963991502440562976. 
  35. "Qwen3 Max - API, Providers, Stats" (in en). https://openrouter.ai/qwen/qwen3-max. 
  36. "Qwen3-Next: Towards Ultimate Training & Inference Efficiency". September 10, 2025. https://qwen.ai/blog?id=4074cca80393150c248e508aa62983f9cb7d27cd&from=research.latest-advancements-list. 
  37. 37.0 37.1 "Qwen/Qwen3-Omni-30B-A3B-Instruct · Hugging Face". 2025-09-22. https://huggingface.co/Qwen/Qwen3-Omni-30B-A3B-Instruct. 
  38. Jiang, Ben (13 September 2023). "Alibaba opens Tongyi Qianwen model to public as new CEO embraces AI" (in en). https://www.scmp.com/tech/big-tech/article/3234385/alibaba-opens-ai-model-tongyi-qianwen-public-competition-baidu-tencent-and-other-chinese-big-tech. 
  39. Kharpal, Arjun (25 August 2023). "Alibaba launches AI model that can understand images and have more complex conversations" (in en). https://www.cnbc.com/2023/08/25/alibaba-new-ai-model-can-understand-images-more-complex-conversations.html. 
  40. "阿里通义千问开源 Qwen2-Audio 7B 语音交互大模型:自由互动,无需输入文本". 2024-08-13. https://www.ithome.com/0/788/116.htm. 
  41. "Alibaba accelerates AI push by releasing new open-source models, text-to-video". September 19, 2024. https://www.reuters.com/technology/alibaba-accelerates-ai-push-by-releasing-new-open-source-models-text-to-video-2024-09-19/. 
  42. Nuñez, Michael (12 November 2024). "Qwen2.5-Coder just changed the game for AI programming—and it's free". https://venturebeat.com/ai/alibaba-new-ai-can-code-in-92-languages-and-its-completely-free/. 
  43. Dotson, Kyt (26 December 2024). "Alibaba announces advanced experimental visual reasoning QVQ-72B AI model". https://siliconangle.com/2024/12/26/alibaba-announces-advanced-experimental-visual-reasoning-qvq-72b-ai-model/. 
  44. Wiggers, Kyle (27 January 2025). "Alibaba's Qwen team releases AI models that can control PCs and phones". https://techcrunch.com/2025/01/27/alibabas-qwen-team-releases-ai-models-that-can-control-pcs-and-phones/. 
  45. Franzen, Carl (5 March 2025). "Alibaba's new open source model QwQ-32B matches DeepSeek-R1 with way smaller compute requirements". https://venturebeat.com/ai/alibabas-new-open-source-model-qwq-32b-matches-deepseek-r1-with-way-smaller-compute-requirements/. 
  46. "Alibaba rolls out new AI coding model Qwen3-Coder, says it's their most powerful" (in en). https://www.computerworld.com/article/4027149/alibaba-rolls-out-new-ai-coding-model-qwen3-coder-says-its-their-most-powerful.html. 
  47. "Qwen/Qwen3-Next-80B-A3B-Instruct · Hugging Face". 2025-09-11. https://huggingface.co/Qwen/Qwen3-Next-80B-A3B-Instruct. 

Template:Generative AI chatbots

Template:Artificial intelligence navbox