Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Mistral AI · Mistral Small 3 24B

Feature-frozen. The creator has frozen feature development on this model (critical fixes only).

DeepHermes 3 Mistral 24B

fine-tune derivative of Mistral Small 3 24B by Nous Research

Nous Research's reasoning-focused fine-tune of Mistral Small 3 24B — unified intuitive + toggleable chain-of-thought reasoning, on a non-Llama base with a permissive license.

Size

mid (24.0B params)

Context

32,768 tokens

Released

2025-03-10

Openness

open-weight

License

Apache License 2.0 (Nous fine-tune over Mistral Small 3 24B) · commercial: yes

Cost tier

mixed

Rating

4.0 ★ — Toggleable reasoning at a useful size with a clean Apache license (no Llama strings) — a genuinely appealing 4.0.

Modalities

text

Capabilities

chat, coding, instruction-following, reasoning, tool-use

Access

local-runtime-llama-cpp, local-runtime-ollama, local-runtime-vllm, weights-download-hf

llm
open-weight
commercial-friendly
mid-size
reasoning
self-hostable
fine-tune
apache-2-0

Quick Take

DeepHermes on a Mistral base: a 24B toggleable-reasoning fine-tune of Mistral Small 3 24B — single-GPU, with a clean Apache 2.0 license (no Llama strings).

Plain-English Description

DeepHermes 3 Mistral 24B applies Nous's unified intuitive/reasoning tuning to Mistral's Small 3 24B rather than a Llama base. That's significant for two reasons: it's a more capable size than the 8B DeepHermes, and because the base is Apache 2.0, the whole model carries a clean, unrestricted license — no Llama community-license carve-out.

It offers the same toggleable reasoning (direct answers or explicit chain-of-thought via system prompt) at a single-GPU 24B scale, with Hermes's steerability. For teams that want a reasoning-toggle model they can self-host commercially with minimal license friction, it's an appealing option.

License details below.

Best For

Single-GPU reasoning with a mode toggle and a clean Apache license.
Teams that want DeepHermes behavior without Llama's license terms.
Steerable, self-hostable reasoning for commercial use.
Math, logic, and structured problems at mid scale.

Not For

The absolute strongest reasoning — larger models go further.
Laptop-only setups — the 24B wants a real GPU; use DeepHermes 3 8B.
Multimodal tasks — text only.

License — Plain-English Summary

Clean and permissive — the contrast with the Llama-based Hermes models. Nous releases the DeepHermes weights openly, and the base (Mistral Small 3 24B) is Apache 2.0, so both layers allow unrestricted commercial use, modification, and redistribution with no royalties and no user-count carve-out. Just retain the Apache notices.

How It Compares

Against DeepHermes 3 8B, the Mistral 24B is more capable and, crucially, cleaner on licensing (Apache vs Llama). Against its base Mistral Small 3 24B, it's Nous's reasoning-toggle tuning. Against the much larger Hermes 4 70B, it's lighter and Apache-clean but without Hermes 4's frontier-scale capability.

Cost

Self-hosted cost: $0.00 beyond compute
Notes: Free to self-host; the base model's license governs commercial use (see License).

Comparable models

Commercial-use conditions

Nous releases the Hermes weights openly and the base (Mistral Small 3 24B) is Apache 2.0 — both layers are permissive, with unrestricted commercial use and no carve-outs.