Qwen2 Archives

Qwen Model Family

Oct 8, 2024

Qwen (Chinese: 通义千问; pinyin: Tongyi Qianwen) represents the advanced large language and multimodal model series developed by the Qwen Team at Alibaba Group. These models excel in a variety of tasks, including natural language understanding, text generation, visual and audio comprehension, tool utilization, role-playing, and acting as AI agents. The Qwen models are pre-trained on…

Qwen2 Language Model Evaluation

Sep 10, 2024

The evaluation of Qwen2 Language Model Family primarily emphasizes their performance in natural language understanding, general question answering, coding, mathematics, scientific knowledge, reasoning, and multilingual capabilities. The Qwen2 Language Model evaluation encompasses a diverse set of datasets across various tasks: English tasks include MMLU (5-shot), MMLU-Pro (5-shot), GPQA (5-shot), Theorem QA (5-shot), BBH (3-shot), HellaSwag…

Introduction to Qwen2

Sep 5, 2024

Alibaba Group recently released Qwen2, a new AI model that surpasses LLaMA 3 in various aspects. This article delves into the specifics of Qwen2, comparing its performance across different versions and exploring its unique capabilities. We will test both the smallest and largest variants of Qwen2 to assess their speed and quality. Additionally, we will…

Tag: Qwen2

Qwen Model Family

Qwen2 Language Model Evaluation

Introduction to Qwen2