QwQ-32B Tongyi Qianwen Large Language Models
Alibaba Cloud unveils QwQ-32B, a groundbreaking 32B-parameter model challenging DeepSeek R1 (671B) through pure reinforcement learning, showcasing comparable performance in reasoning tasks.
• 5 min read
News