Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Meta · Llama 3.1 405B

Hermes 4 405B

fine-tune derivative of Llama 3.1 405B by Nous Research

Nous Research's post-training of Llama 3.1 405B into Hermes 4 — adding hybrid reasoning (toggleable <think> tags), stronger schema-adherent output, and steerable, low-refusal instruction following.

Size

frontier (405.0B params)

Context

131,072 tokens

Released

2025-08-25

Openness

open-weight

License

Llama 3.1 Community License (Nous fine-tune) · commercial: conditional

Cost tier

mixed

Rating

4.0 ★ — SOTA-class open reasoning with hybrid think tags and strong steerability, but cluster-scale hardware and the inherited Llama license keep it at 4.0.

Modalities

text

Capabilities

chat, coding, function-calling, instruction-following, long-context, math, reasoning, tool-use

Access

local-runtime-llama-cpp, local-runtime-ollama, local-runtime-vllm, weights-download-hf

llm
open-weight
large
reasoning
agentic
self-hostable
fine-tune
us-based
llama-derivative

Quick Take

Nous Research's flagship: a 405B hybrid-reasoning fine-tune of Llama 3.1 405B, state-of-the-art among open-weight models on reasoning, with toggleable chain-of-thought.

Plain-English Description

Hermes 4 405B is the top of Nous Research's lineup — their post-training applied to Meta's largest Llama 3.1 model. The headline feature is hybrid reasoning: a system prompt toggles tags on or off, so the same model can answer directly for simple queries or deliberate step-by-step for hard ones. It reaches frontier-level open-weight performance through post-training alone, no new pretraining.

Beyond reasoning, Hermes is known for steerability — it adopts strong personas, follows detailed system prompts closely, and refuses less than Meta's own Instruct release, which makes it a favorite for builders who want control over voice and behavior. It's a serious model that needs serious hardware: at 405B it's a multi-GPU or cloud-inference deployment.

Like all Llama-based Hermes models, its license is inherited from Llama (see below).

Best For

Top-tier open reasoning you can self-host (with cluster-scale hardware).
Applications wanting strong steerability and low-refusal instruction following at the frontier.
Agentic and schema-adherent output (structured tool calls) at maximum capability.
Research and high-end deployments where owning the weights matters.

Not For

Anyone without multi-GPU/cluster capacity — use Hermes 4 70B.
Products near the 700M-MAU mark, which trip Llama's license carve-out.
Teams wanting a clean, unrestricted license — the Apache-based Hermes 4.3 36B avoids Llama's terms.
Multimodal tasks — text only.

License — Plain-English Summary

Two layers. Nous releases the Hermes 4 weights openly, but the base is Meta's Llama 3.1 405B, so Meta's Llama 3.1 Community License governs the model and travels with it: commercial use is allowed, but you must display "Built with Llama," observe the acceptable-use terms, and secure a separate Meta license only if your product exceeds 700 million monthly active users. That threshold is irrelevant for nearly all businesses, hence "conditional." For a similar model without Llama's strings, the Apache-licensed Hermes 4.3 36B is the alternative.

How It Compares

Against Hermes 4 70B, the 405B is more capable but far heavier — the 70B is what Nous recommends for hosted use. Against the prior Hermes 3 405B, Hermes 4 adds hybrid reasoning and sharper outputs. Against its base Llama 3.1 405B, Hermes is the steerable, lower-refusal, reasoning-toggle alternative to Meta's own Instruct tuning.

Cost

Self-hosted cost: $0.00 beyond compute
Notes: Free to self-host; the base model's license governs commercial use (see License).

Comparable models

Commercial-use conditions

Nous releases the Hermes weights openly, but the base is Meta's Llama 3.1, so Meta's Llama 3.1 Community License governs the model — including the clause requiring a separate Meta license if your product exceeds 700 million monthly active users.