Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Qwen · Qwen2.5-32B

Feature-frozen. The creator has frozen feature development on this model (critical fixes only).

DeepSeek-R1-Distill-Qwen-32B

distillation derivative of Qwen2.5-32B by DeepSeek

Fine-tuned (distilled) from Qwen2.5-32B on 800K reasoning samples generated by DeepSeek-R1, transferring R1's step-by-step chain-of-thought reasoning into a smaller dense model.

Size

mid (32.0B params)

Context

131,072 tokens

Released

2025-01-19

Openness

open-weight

License

MIT License (over Apache 2.0 base) · commercial: yes

Cost tier

mixed

Rating

4.5 ★ — The best of the R1 distills on practical value — o1-mini-class reasoning, self-hostable on one high-end GPU, under a clean MIT-over-Apache license with no Llama-style carve-out.

Modalities

text

Capabilities

coding, math, reasoning

Access

local-runtime-llama-cpp, local-runtime-lm-studio, local-runtime-ollama, local-runtime-vllm, weights-download-hf

llm
open-weight
commercial-friendly
mid-size
reasoning
math
self-hostable
distillation
china-based
apache-2-0

Quick Take

The standout R1 distill: o1-mini-class reasoning on a single high-end GPU, built on Qwen2.5-32B, under a clean MIT-over-Apache license.

Plain-English Description

The 32B is the strongest Qwen-based R1 distill and, for many, the best value in the whole distill family. Built on Qwen2.5-32B, it reaches reasoning quality comparable to OpenAI's o1-mini on several benchmarks (around 72.6 on AIME 2024 and 94.3 on MATH-500) while running on a single high-end consumer or workstation GPU.

That combination — proprietary-grade reasoning, self-hostable, no API — is exactly what a privacy-sensitive or cost-conscious business wants for hard reasoning tasks. And unlike the comparable Llama-70B distill, it carries a clean MIT-over-Apache license with no user-count carve-out.

It remains a reasoning specialist, so pair it with a generalist for open-ended work.

Best For

Top-tier reasoning you can self-host on a single high-end GPU.
Hard math, logic, and code problems where o1-mini-class quality matters.
Privacy-sensitive or cost-conscious teams that can't or won't use a reasoning API.
Fine-tuning a strong, cleanly-licensed reasoning model on private data.

Not For

General chat or writing — it's a reasoning specialist.
Laptop or small-GPU setups — step down to DeepSeek-R1-Distill-Qwen-14B or the 7B.
The absolute strongest distill reasoning, where the heavier DeepSeek-R1-Distill-Llama-70B edges ahead.
Multimodal tasks — text only.

License — Plain-English Summary

This distill is unusually clean on licensing. DeepSeek released its fine-tuned weights under the permissive MIT license, and the base it was built on — Qwen2.5-32B — is Apache 2.0. Both layers allow commercial use, modification, fine-tuning, and redistribution with no royalties and no user-count carve-outs; you just keep the respective notices. That's a meaningful contrast with the Llama-based R1 distills, which inherit Meta's community license and its 700M-monthly-user clause. If clean commercial terms matter, the Qwen-based distills like this one are the easier choice.

How It Compares

Against DeepSeek-R1-Distill-Llama-70B, the 32B is a bit behind on peak reasoning but much lighter (one GPU vs multi-GPU) and cleaner on licensing (Apache-over-MIT vs Llama community license) — for most teams the better practical choice. Against DeepSeek-R1-Distill-Qwen-14B, it's the stronger option when you have the GPU for it. Against its parent DeepSeek-R1, it's the self-hostable stand-in that gets closest to the full model's reasoning.

Cost

Self-hosted cost: $0.00 beyond compute
Notes: Free to self-host under Apache 2.0; also served by third-party hosts.

Comparable models

Commercial-use conditions

DeepSeek released the distilled weights under MIT; the base model (Qwen2.5) is Apache 2.0. Both layers are permissive and allow commercial use, so there are no carve-outs to worry about here.