Tag: Qwen2

  • Qwen Model Family

    Qwen Model Family

    Qwen (Chinese: 通义千问; pinyin: Tongyi Qianwen) represents the advanced large language and multimodal model series developed by the Qwen Team at Alibaba Group. These models excel in a variety of tasks, including natural language understanding, text generation, visual and audio comprehension, tool utilization, role-playing, and acting as AI agents. The Qwen models are pre-trained on…

  • Qwen2 Language Model Evaluation

    Qwen2 Language Model Evaluation

    The evaluation of Qwen2 Language Model Family primarily emphasizes their performance in natural language understanding, general question answering, coding, mathematics, scientific knowledge, reasoning, and multilingual capabilities. The Qwen2 Language Model evaluation encompasses a diverse set of datasets across various tasks: English tasks include MMLU (5-shot), MMLU-Pro (5-shot), GPQA (5-shot), Theorem QA (5-shot), BBH (3-shot), HellaSwag…

  • Introduction to Qwen2

    Introduction to Qwen2

    Alibaba Group recently released Qwen2, a new AI model that surpasses LLaMA 3 in various aspects. This article delves into the specifics of Qwen2, comparing its performance across different versions and exploring its unique capabilities. We will test both the smallest and largest variants of Qwen2 to assess their speed and quality. Additionally, we will…