Tag: Qwen Models

QVQ-72B-Preview

Dec 25, 2024

QVQ-72B-Preview is an experimental research model developed by the Qwen team, focusing on enhancing visual reasoning capabilities. Key Insights • MMMU Benchmark: QVQ-72B-Preview scored an impressive 70.3%, reflecting its strong multidisciplinary reasoning and comprehension capabilities. • MathVision: The model demonstrated significant progress in mathematical reasoning, outperforming earlier benchmarks. • OlympiadBench: It showed an enhanced ability to…
Qwen2.5 Technical Report

Dec 20, 2024

Qwen2.5 represents a significant leap in the development of large language models (LLMs). This latest version builds upon its predecessors with comprehensive improvements in pre-training and post-training methodologies. By leveraging a vast dataset of 18 trillion tokens, Qwen2.5 exhibits remarkable advancements in common-sense reasoning, domain expertise, and overall language understanding. Key Features 1. Model Configurations:…
Benchmarks of providers of Qwen2.5

Dec 10, 2024

Benchmarks of providers of Qwen2.5, a leading open-source model family. Qwen2.5 family of models includes Qwen2.5 72B, Qwen2.5 Coder 32B and a range of smaller models including 1.5B and 0.5B models for ‘edge’ use-cases. Qwen2.5 72B, the flagship model, is competitive in intelligence evaluations with frontier models including Llama 3.3 70B, GPT-4o and Mistral Large…
QwQ-32B-Preview

Nov 28, 2024

QwQ-32B-Preview is an experimental research model developed by the Qwen Team, focused on advancing AI reasoning capabilities. As a preview release, it demonstrates promising analytical abilities while having several important limitations: Specification For more details, please refer to our blog. You can also check Qwen2.5 GitHub, and Documentation. QwQ-32B-Preview Model Performance Through extensive exploration and countless experiments, a profound…
ModelBox Qwen2.5 Coder

Nov 16, 2024

ModelBox now supports Qwen2.5 Coder Inference! With Qwen2.5-Coder-32B-Instruct setting the bar as the current SOTA open-source code model, it matches the coding power of GPT-4o. Start building smart today. Qwen2.5 Coder Models Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream…
Qwen2.5-Coder-32B-Instruct

Nov 11, 2024

Now it is the time, Nov. 11! The perfect time for Alibaba Qwen’s best coder model ever! Qwen2.5-Coder-32B-Instruct! Wait wait… it’s more than a big coder! It is a family of coder models! Besides the 32B coder, they have coders of 0.5B / 1.5B / 3B / 7B / 14B! As usual, Dev Team not…
Qwen Model Family

Oct 8, 2024

Qwen (Chinese: 通义千问; pinyin: Tongyi Qianwen) represents the advanced large language and multimodal model series developed by the Qwen Team at Alibaba Group. These models excel in a variety of tasks, including natural language understanding, text generation, visual and audio comprehension, tool utilization, role-playing, and acting as AI agents. The Qwen models are pre-trained on…

Tag: Qwen Models

QVQ-72B-Preview

Qwen2.5 Technical Report

Benchmarks of providers of Qwen2.5

QwQ-32B-Preview

ModelBox Qwen2.5 Coder

Qwen2.5-Coder-32B-Instruct

Qwen Model Family

Related content