Mistral's licensing is more varied than it first appears, and the variation matters for business use.
Apache 2.0 covers most of the lineup. Mistral Large 3, Mistral Small 4, all Ministral 3 variants, Devstral Small 2 (24B), and Voxtral speech-recognition models are Apache 2.0 — genuinely permissive, commercial-use allowed, no user-count thresholds, modification and redistribution fine. For these models, self-hosting is unambiguously free and fine-tuning for commercial product development is unambiguously fine.
Devstral 2 (the 123B flagship) is not Apache 2.0. Mistral calls the license "modified MIT" in their materials, but it contains commercial restrictions that developers on X correctly flagged as functionally proprietary. Commercial use is conditional and revenue-capped; exceeding the threshold requires a separate commercial agreement. If you're planning to deploy Devstral 2 in a revenue-generating product, read the license carefully or consult counsel before self-hosting. The 24B Devstral Small 2 has none of these restrictions and is the cleaner default for most commercial use.
Voxtral TTS is CC BY-NC 4.0. Creative Commons Attribution-NonCommercial 4.0 International means: free to use, modify, and redistribute with attribution, but commercial use is prohibited without a separate license from Mistral. The hosted APIAccessing a model by sending requests to the creator's (or a provider's) servers, typically pay-per-use. Hosted APIs handle all the operational work — scaling, hardware, uptime — in exchange for a per-token or per-request fee. Every closed-API model is hosted; many open-weight models are also available via hosted APIs from providers like Together, Fireworks, or Groq. at $0.016/1K characters is the commercial license. If you want to self-host Voxtral TTS weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself. in a revenue-generating product, you need to contact Mistral for a commercial agreement. Research, evaluation, and personal use are unrestricted.
The closed-APIA model that's only accessible through the creator's own API or product — you can't download it, run it yourself, or inspect its weights. GPT-4, Claude, and Gemini Pro are closed-API models. The tradeoff is convenience and often capability (closed-API models are frequently the strongest) versus loss of control over data, pricing, and availability. tier is growing. Mistral Medium 3, Mistral Embed, Codestral Embed, Mistral Moderation, Mistral OCR, and Mistral Saba (Middle Eastern / South Asian languages) are proprietary API-only products — no open weights, no self-hosting. For some of these (notably Codestral Embed), Mistral's offering is the only first-party option in the category, so if the task matches the specialty, the closed API is often the path of least resistance.
Beyond licensing, one practical note for EU-based deployments: Mistral is the only frontier AI lab where "my data will not leave the EU" is a default configuration rather than a paid enterprise tier. For regulated industries in Europe, this alone is often the deciding factor.