← Back to hard AIs

Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Qwen

Qwen3-30B-A3B

Model family: qwen3

The general-purpose 30B-A3B MoEA model architecture that splits the model into many smaller specialized "expert" networks, only activating a handful per input rather than running the whole model every time. The practical effect: you get the knowledge capacity of a big model with the compute cost of a much smaller one. Mistral Large 3 and Mistral Small 4 are both MoE models. — fast, single-GPUThe specialized chip that runs most AI models. Originally designed for 3D graphics, GPUs turned out to be excellent at the math AI requires. Nvidia dominates the AI GPU market; common datacenter models include the H100, H200, and B200. Running an AI model without a GPU is possible but painfully slow for anything but the smallest models.-friendly, Apache 2.0; the all-rounder counterpart to Qwen3-Coder.

Identity

Creator
Qwen
Model family
qwen3
Release date
2025-04-27

Technical specs

Parameter count
30.5B
Context window
262K tokens
Modalities
  • Text
Primary capabilities
  • Chat
  • Coding
  • Instruction Following
  • Long Context
  • Multilingual
  • Reasoning
  • Tool Use

License

License
Apache License 2.0
Commercial use
  • Allowed
Terms
  • Modification
  • Redistribution
  • Attribution

Access

Openness
  • Open Weight
Access methods
  • Api Third Party
  • Local Runtime Llama Cpp
  • Local Runtime Ollama
  • Local Runtime Vllm
  • Weights Download Hf
Cost tier
  • Mixed

Full model card →