Chips & Hardware May 20 ago

Meta and Alibaba Cloud Advance AI Models: A Look at LLaMA and Qwen

Meta and Alibaba Cloud have made substantial advancements in their AI models, LLaMA and Qwen, respectively. The latest iterations highlight increased scale and enhanced functionalities for diverse applications.

GPUBeat Desk

Desk · GPUBeat Media

Published

May 20 · 17:36 ET

Reading

2 min · 457 words

NVIDIA — ai-agents — NVIDIA — Meta and Alibaba Cloud Advance AI Models: A Look at LLaMA and Qwen Source: GPUBeat

The AI sector has undergone significant changes with Meta's LLaMA and Alibaba Cloud's Qwen models. What are the main advancements in these AI frameworks, and how do they compare?

Meta's LLaMA series launched its first version in February 2023, featuring models with up to 65 billion parameters designed for research. The release of LLaMA 2 in summer 2023 brought improved stability and limited commercial use. By 2024, LLaMA 3 emerged with a redesigned architecture that greatly enhanced its text and code processing capabilities, highlighted by a flagship model boasting 405 billion parameters. The latest iteration, LLaMA 4, introduced in spring 2025, features Scout and Maverick—Meta's first open multimodal models built on a Mixture-of-Experts (MoE) architecture. Scout operates with 109 billion parameters and 16 experts, while Maverick offers around 400 billion parameters with 128 experts, significantly boosting their performance.

In contrast, Alibaba Cloud's Qwen models debuted in 2023, targeting a broad range of applications. The initial lineup included compact models starting at 1.8 billion parameters, scaling up to the substantial Qwen-72B. Qwen 2 marked a notable enhancement in instruction-following capabilities, with Qwen 2.5-Max gaining recognition for its improved performance through Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). The latest model, Qwen 3, released in April 2025, showcased improved reasoning abilities and multilingual support, focusing on agentic use cases.

Evolution of LLaMA and Qwen Models

The advancements in both LLaMA and Qwen models highlight a trend toward increased scale and multi-functional capabilities. For example, LLaMA 4 Scout supports a context window of up to 10 million tokens and can efficiently operate on a single NVIDIA H100 with INT4 quantization. On the other hand, Maverick aims to deliver superior quality for text and vision tasks but requires more powerful hardware for local deployment.

Meanwhile, Qwen's evolution from its first-generation models to Qwen 3.6-27B—a 27 billion parameter model designed for coding and agentic tasks—demonstrates Alibaba's dedication to enhancing AI's practical applications. The Qwen series now includes a diverse array of models, from smaller dense versions to larger MoE variants tailored for specific use cases.

Implications for AI Development

The progression of both Meta and Alibaba Cloud's AI models illustrates the growing complexity and capability of AI agents. As LLaMA continues to evolve its architecture and increase its parameter count, the implications for research and commercial applications are significant. Likewise, Qwen's development reflects a strategic emphasis on enhancing reasoning and multilingual capabilities, positioning it as a strong competitor in the AI market.

As we look forward, the ongoing development of LLaMA and Qwen models will likely influence the future of AI applications, making them essential tools for industries pursuing advanced automation and intelligent solutions. Both companies seem committed to expanding the limits of AI, setting the stage for future innovations and applications in the field.

GPUBeat Desk

Desk · joined 2026

GPUBeat Desk covers AI infrastructure — chips, foundation models, inference economics, datacenter buildouts, and the geopolitics of compute.

2033 stories

Evolution of LLaMA and Qwen Models

Implications for AI Development

GPUBeat Desk

More on chips & hardware

Norway’s National Library Leverages 2 PB of Huawei Storage for LLM Training

China’s AI Development: Adapting to U.S. Export Controls on Nvidia

DeepSeek Cuts V4-Pro AI Model Prices by 75% Amid Increased Competition