Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Catalog entry last reviewed 92 days ago.

Mistral AI

4.5 ★ — Consistent open-weight releases under Apache 2.0, rapid release cadence, and European jurisdiction that appeals to GDPR-sensitive deployments — tempered by license diversity across the lineup (Apache 2.0 for most, custom for Devstral 2 flagship with revenue restrictions, CC BY-NC 4.0 for Voxtral TTS) that requires reading carefully before you commit.

Type

ai-native-company

Country

Founded

2023

Website

https://mistral.ai

open-weight
eu-based
ai-native-company
commercial-friendly
apache-licensed

Quick Take

Mistral is France's frontier AI lab and the most prolific open-weight shipper in Europe — their Apache 2.0 model releases dominate the EU AI ecosystem and compete with U.S. labs on both capability and price.

Who They Are

Mistral AI is a Paris-based AI research and product company founded in 2023 by three researchers from Meta and Google DeepMind: Arthur Mensch (CEO), Guillaume Lample, and Timothée Lacroix. Within two and a half years, the company went from a €105M seed round to a €830M Series C in March 2026, a $13.8 billion valuation, and an annual recurring revenue of roughly $400 million. Headquartered in Paris with an expanding engineering presence in Sweden, Mistral is the clearest European counterweight to the U.S. concentration of frontier AI labs.

Mistral's early bet — ship foundation-model weights under a genuinely permissive license and let the ecosystem develop on top of them — has aged well. Where OpenAI, Anthropic, and most of Google's tier-one work lives behind closed APIs, Mistral's default is to publish the weights. Llama-scale models, code specialists, speech models, edge models: most of it goes out to Hugging Face under Apache 2.0 and is simultaneously available as a paid API on api.mistral.ai for teams who don't want to self-host. The resulting dynamic — commercial-grade models freely available for self-deployment and metered access for everyone else — makes Mistral the closest thing the AI industry has to a "best of both worlds" vendor for business users.

What makes the company distinctive beyond licensing is the release cadence. In a single 15-day stretch in March 2026, Mistral shipped Mistral Small 4 (a new flagship-small unified reasoning/coding/vision model), Voxtral TTS (their first text-to-speech model), Leanstral (a formal-proof coding agent), Forge (an enterprise training platform), Spaces CLI (a developer tooling product), and an NVIDIA partnership announcement. That pace is hard to sustain, and the quality hasn't suffered — Mistral Large 3 sits in the top tier of open-weight models on LMArena, and Ministral 3 14B's reasoning variant hits 85% on AIME 2025, which is genuinely state-of-the-art for its size class.

Model Philosophy

Mistral's positioning is roughly: frontier quality, permissive license, European jurisdiction, and pricing aggressive enough to pressure the U.S. closed-API incumbents. The pitch to enterprises is usually some version of "you can fine-tune and self-host on your own GPUs without legal friction, or you can use our hosted API for what we charge, which is generally a fraction of what Claude or GPT-4-class models cost." For European companies specifically, the fact that Mistral is French — subject to GDPR and the EU AI Act by default rather than by bolt-on — is a real procurement advantage that shows up in contracts with ASML, Ericsson, BNP Paribas, and similar names.

The product naming follows a convention worth understanding. "Mistral" models are general-purpose language models (Small, Medium, Large). "Ministral" models are edge-size (3B, 8B, 14B). The -stral suffix marks specialists: Devstral for code, Voxtral for audio, Leanstral for formal proofs, Codestral for code embeddings. Numbered generations (3, 4) replace the older dated-version scheme (2501, 2506) for new flagships, though dated checkpoints are still how Mistral identifies individual releases on Hugging Face — you'll see Mistral-Small-4-119B-2603, where "2603" encodes March 2026 as the release date.

What To Know Before You Commit

Mistral's licensing is more varied than it first appears, and the variation matters for business use.

Apache 2.0 covers most of the lineup. Mistral Large 3, Mistral Small 4, all Ministral 3 variants, Devstral Small 2 (24B), and Voxtral speech-recognition models are Apache 2.0 — genuinely permissive, commercial-use allowed, no user-count thresholds, modification and redistribution fine. For these models, self-hosting is unambiguously free and fine-tuning for commercial product development is unambiguously fine.

Devstral 2 (the 123B flagship) is not Apache 2.0. Mistral calls the license "modified MIT" in their materials, but it contains commercial restrictions that developers on X correctly flagged as functionally proprietary. Commercial use is conditional and revenue-capped; exceeding the threshold requires a separate commercial agreement. If you're planning to deploy Devstral 2 in a revenue-generating product, read the license carefully or consult counsel before self-hosting. The 24B Devstral Small 2 has none of these restrictions and is the cleaner default for most commercial use.

Voxtral TTS is CC BY-NC 4.0. Creative Commons Attribution-NonCommercial 4.0 International means: free to use, modify, and redistribute with attribution, but commercial use is prohibited without a separate license from Mistral. The hosted API at $0.016/1K characters is the commercial license. If you want to self-host Voxtral TTS weights in a revenue-generating product, you need to contact Mistral for a commercial agreement. Research, evaluation, and personal use are unrestricted.

The closed-API tier is growing. Mistral Medium 3, Mistral Embed, Codestral Embed, Mistral Moderation, Mistral OCR, and Mistral Saba (Middle Eastern / South Asian languages) are proprietary API-only products — no open weights, no self-hosting. For some of these (notably Codestral Embed), Mistral's offering is the only first-party option in the category, so if the task matches the specialty, the closed API is often the path of least resistance.

Beyond licensing, one practical note for EU-based deployments: Mistral is the only frontier AI lab where "my data will not leave the EU" is a default configuration rather than a paid enterprise tier. For regulated industries in Europe, this alone is often the deciding factor.

Original Models

Mistral Small

Eagle speculative-decoding head for Mistral Small 4 — pair it with the base model for faster inference throughput. Architectural extension, not standalone.

Mistral's specialist code agent for Lean 4 formal proof engineering — derived from Small 4, Apache 2.0, beats Claude Sonnet on formal proof benchmarks at ~1/15th the cost. The first genuinely open-source formal-proof agent.

Mistral's unified mid-tier workhorse — 119B MoE with 6B active, configurable reasoning depth, multimodal, Apache 2.0, and among the cheapest capable API options available.

Mistral Small 3 24B — an Apache 2.0 24B base, used as a fine-tuning foundation (e.g. for DeepHermes).

Voxtral

Mistral's first text-to-speech model — 9 languages, zero-shot voice cloning from 3 seconds of audio, and roughly 27% of ElevenLabs' per-character cost through Mistral's API.

Streaming-ASR Voxtral — processes live audio incrementally for real-time transcription and voice agents. Apache 2.0.

Transcription-optimized Voxtral — $0.003/min via Mistral's API, batch ASR for meetings, podcasts, and recordings. Apache 2.0.

Edge-deployable Voxtral — 3B sibling of Voxtral Small 24B with the same speech-understanding architecture at a smaller scale. Apache 2.0.

Mistral's speech-understanding flagship — a 24B audio-text model that transcribes, translates, and directly answers questions from audio input.

Devstral

Mistral's 123B agentic coding flagship — 72.2% on SWE-Bench Verified — with a custom "modified MIT" license that has commercial-use restrictions tied to revenue. Powerful but legally non-trivial to deploy commercially.

Mistral's laptop-class coding specialist — 24B parameters, Apache 2.0, runs on a single consumer GPU, and beats 70B-class competitors on software-engineering benchmarks.

Ministral 3

Pretrained base variant of Ministral 3 14B — largest edge-model base for custom fine-tuning and domain adaptation. Apache 2.0.

Mistral's biggest edge-class model — 14B parameters, vision-capable, 256K context, runs on a single consumer GPU, and performs like a 24B model.

Largest reasoning-tuned Ministral 3 — hits 85% on AIME 2025, state-of-the-art for 14B models. Laptop-class deployment, Apache 2.0.

Pretrained base variant of Ministral 3 3B — smallest Ministral 3, for teams doing their own instruction-tuning or domain adaptation at the edge scale.

Smallest Ministral 3 instruct variant — 3B parameters, multimodal, fits in 8GB VRAM in FP8. Apache 2.0, edge- and smartphone-class deployment.

Reasoning-tuned variant of Ministral 3 3B — extended chain-of-thought in an edge-deployable 3B model. Apache 2.0.

Pretrained base variant of Ministral 3 8B — mid-size edge model for teams doing custom instruction-tuning or domain adaptation. Apache 2.0.

Balanced mid-size Ministral 3 — 8B parameters with vision, multilingual, 256K context. Fits in 12GB VRAM in FP8. Apache 2.0.

Reasoning-tuned Ministral 3 8B — extended chain-of-thought at mid-size, laptop-deployable, Apache 2.0.

Mistral Large 3

Pretrained base variant of Mistral's Large 3 flagship — 675B MoE, Apache 2.0, for teams that want to run their own instruction-tuning or domain adaptation.

Mistral's open-weight frontier model — 675B total parameters, Apache 2.0, 256K context, multimodal — the strongest permissive license you'll find on a model this capable.

Mistral Medium

Mistral's closed-API enterprise tier — sits between Small 4 and Large 3 on capability and cost, with hybrid and on-prem deployment available but no published weights.

Embeddings

Mistral's code-specialized embedding model — purpose-built for code retrieval, outperforms Voyage Code 3 and OpenAI's embeddings on code benchmarks, and lets you pick your output dimensions to trade q

Mistral's general-purpose text embedding model — competitive with OpenAI and Cohere for standard RAG and semantic-search workloads, at the usual Mistral advantage of EU jurisdiction and aggressive pri

Mistral Other

Mistral's document-understanding API. Extracts markdown + HTML tables from PDFs, images, and handwriting at $2/1,000 pages ($1 batch). 74% win rate over OCR 2 as of December 2025, undercuts AWS/Google/Azure on price.

Regional

Mistral's Middle Eastern and South Asian language specialist — 24B model with strong Arabic, Farsi, Urdu, Hebrew, and Hindi performance. Closed-API.

Safety

Mistral's content-moderation classifier — nine harm categories, multilingual, closed-API at $0.10 per million tokens.

Mistral AI

Quick Take

Who They Are

Model Philosophy

What To Know Before You Commit

Original Models

Mistral Small

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Voxtral

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Devstral

Identity

Technical specs

License

Access

Sources

Ministral 3

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources