← Back to hard AIs

Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Qwen

Qwen3.7-Max

Model family: qwen3-7

Context
1,048,576 tokens
Released
2026-05-19
Openness
closed-api
License
Cost tier
paid-api
Rating
4.0 — Frontier-tier agentic and coding capability at roughly a quarter of the closed-Western price — but it's closed and API-only, so no self-hosting, no data-residency control, and the China-jurisdiction question with no escape hatch.
Modalities
text
Capabilities
chat, coding, function-calling, instruction-following, long-context, reasoning, tool-use
Access
api-first-party, api-third-party

Quick Take

Alibaba's closed, agent-first flagship: frontier-tier coding and reasoning with a million-tokenThe basic unit of text a model reads and writes. Tokens are roughly three-quarters of a word in English — so 100 tokens is about 75 words. Models don't see letters or words directly; they see tokens. Pricing is almost always quoted per million tokens, and context windows are measured in tokens rather than words. memory, priced at roughly half its Western rivals — but API-only, with no weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself. to own.

Plain-English Description

Qwen3.7-Max, announced at the Alibaba Cloud Summit in May 2026, is the top of Qwen's range and a deliberate break from the family's open-weightA model where the trained weights are freely downloadable — you can run it yourself without contacting the creator. Llama, Mistral, Qwen, and Gemma are open-weight. Open-weight does not mean open-source: the training data and code often stay private. The license still governs what you can do with the weights, including whether you can use them commercially. tradition. Unlike the Qwen models you can download and run yourself, Max is closed: it lives behind Alibaba's API, with no public weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself.. The trade Alibaba is offering is "rent our best model" rather than "own a very good one."

What you're renting is an agent-tuned frontier model — built for long-horizon, tool-using workflows like coding agents that read a repository, plan, and execute over many steps. It pairs a one-million-tokenThe basic unit of text a model reads and writes. Tokens are roughly three-quarters of a word in English — so 100 tokens is about 75 words. Models don't see letters or words directly; they see tokens. Pricing is almost always quoted per million tokens, and context windows are measured in tokens rather than words. context windowThe maximum amount of text the model can "see" at once — prompt plus prior conversation plus any documents you give it. Measured in tokens (which are roughly three-quarters of a word each). A 128K context window is about 96,000 words of input — roughly a 400-page book. Larger context windows let the model work with bigger documents but cost more to run. with a native extended-thinking mode, and on independent measures it lands in the top tier: it scored 56.6 on Artificial AnalysisAn independent benchmarking site that runs standardized tests across commercial and open-weight models and publishes comparable results on capability, speed, and cost. Widely cited for API provider comparisons — if you want to know whether Llama 3.3 70B is faster on Groq or Together, Artificial Analysis is the reference.'s Intelligence Index at launch, the highest-placed Chinese model on that leaderboard, and posts strong agentic-coding numbers (around 60.6 on SWE-Bench Pro, 69.7 on Terminal-Bench 2.0, 92.4 on the GPQA Diamond science exam). It also recorded the lowest hallucination rate among frontier models tested (about 22.9%).

The headline for cost-conscious buyers: at $2.50 in / $7.50 out per million tokens, it's roughly half the price of comparable closed models from OpenAI and Anthropic, and it speaks the Anthropic Messages API format as a drop-in. The headline for everyone else: it's closed, so the data-governance and lock-in questions are real and there's no self-hosting fallback.

Best For

  • Long-horizon coding and agentic workflows where you want frontier quality but balk at frontier-level API bills.
  • Teams already on the Anthropic API format who want a cheaper drop-in to test against.
  • Very-long-context tasks (whole repositories, large document sets) that benefit from the million-tokenThe basic unit of text a model reads and writes. Tokens are roughly three-quarters of a word in English — so 100 tokens is about 75 words. Models don't see letters or words directly; they see tokens. Pricing is almost always quoted per million tokens, and context windows are measured in tokens rather than words. window and the 90% cached-input discount.
  • Buyers comparing closed frontier models on price-per-quality, where Qwen3.7-Max is one of the most aggressive options.

Not For

  • Anyone who needs to self-host or keep data on their own infrastructure — Max is API-only, full stop. If that's a requirement, use the open Qwen3.5-397B-A17B instead.
  • Regulated or privacy-sensitive workloads that can't route data to Alibaba Cloud under Chinese jurisdiction — and here there's no open-weightA model where the trained weights are freely downloadable — you can run it yourself without contacting the creator. Llama, Mistral, Qwen, and Gemma are open-weight. Open-weight does not mean open-source: the training data and code often stay private. The license still governs what you can do with the weights, including whether you can use them commercially. escape hatch.
  • Image, audio, or video input — that's the closed Plus sibling's job; Max is text-only.
  • Teams that want to avoid vendor lock-in or fine-tuneA model that has been further trained on additional data to specialize it for a particular task, domain, or style. Fine-tuning a general model on medical literature produces a medical specialist; fine-tuning on your company's support tickets produces a support assistant that sounds like your team. Fine-tunes are much cheaper to create than training a model from scratch. the model themselves.

License — Plain-English Summary

There's no open license to summarize — Qwen3.7-Max is proprietary. You access it through Alibaba Cloud's API under their terms of service, which permit commercial use of the outputs but give you no rights to the model itself: no weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself., no modification, no redistribution. Practically, treat it like any other closed API (think GPT-5.5 or Claude): your commercial freedom is in what you build on top, and your main diligence items are Alibaba Cloud's acceptable-use policy and the data-routing implications of a China-hosted service. If owning or self-hosting the model matters to you, this isn't the Qwen to pick.

How It Compares

Against the open Qwen3.5-397B-A17B, Max is more capable on agentic tasks but closed — the 397B model is the answer when you need to self-host. Against the closed Western flagships (GPT-5.5, Claude Opus 4.7, Gemini 3.5), Qwen3.7-Max is competitive on a composite score and notably cheaper, but those vendors offer US-jurisdiction data residency and longer production track records; the deciding factor for most teams is access and data policy, not a single benchmark. Against DeepSeek's open MIT flagships, the contrast is sharp: DeepSeek keeps its frontier model open and downloadable, while Qwen has chosen to keep its very best behind the API — so if open weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself. at the frontier matter, DeepSeek wins that specific comparison.

Cost

API input (per 1M tokens)
$2.50
API output (per 1M tokens)
$7.50
API providers
alibaba-dashscope, openrouter, together
Notes
DashScope / Model Studio pricing is $2.50 input, $7.50 output per million tokens, with cached input at roughly $0.25 (a 90% discount) — meaningful for long-context agent workloads that reuse the same prefix. Roughly half the rate of the comparable closed Western flagships. No self-hosting; API only.

Comparable models

Commercial-use conditions

Commercial use is permitted through the API under Alibaba Cloud's terms of service. You are buying access, not the model — there are no weights to own, modify, or redistribute.

Sources