Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Meta · Llama 3.1 405B

Feature-frozen. The creator has frozen feature development on this model (critical fixes only).

Hermes 3 405B

fine-tune derivative of Llama 3.1 405B by Nous Research

Nous Research's post-training of Llama 3.1 405B into Hermes 3 — agentic capability, structured <tool_call> output, strong roleplay and steerability.

Size

frontier (405.0B params)

Context

131,072 tokens

Released

2024-08-14

Openness

open-weight

License

Llama 3.1 Community License (Nous fine-tune) · commercial: conditional

Cost tier

mixed

Rating

3.5 ★ — A landmark open model in its day, but superseded by Hermes 4 405B and carrying the Llama license — 3.5 as a still-capable but older flagship.

Modalities

text

Capabilities

chat, coding, function-calling, instruction-following, long-context, reasoning, tool-use

Access

local-runtime-llama-cpp, local-runtime-ollama, local-runtime-vllm, weights-download-hf

llm
open-weight
large
agentic
self-hostable
fine-tune
us-based
llama-derivative

Quick Take

The Hermes 3 flagship: a 405B fine-tune of Llama 3.1 405B that was state-of-the-art among open-weight models at its 2024 release — now succeeded by Hermes 4.

Plain-English Description

Hermes 3 405B was Nous Research's headline model before Hermes 4, and at its August 2024 launch it posted state-of-the-art results among open-weight models on several public benchmarks. It introduced the agentic and structured-output capabilities — XML, strong multi-turn coherence, long-context retention — that define the modern Hermes line.

It remains a capable model, but Hermes 4 405B supersedes it with hybrid reasoning and sharper outputs, so for new work the 4 line is the better starting point. This entry mainly documents the lineage and supports teams already running it.

License is inherited from Llama (see below).

Best For

Existing deployments standardized on Hermes 3 405B.
Understanding the Hermes lineage and the jump to Hermes 4.
Agentic/structured-output work at frontier scale where a 3-line model is already validated.

Not For

New deployments — start on Hermes 4 405B.
Anyone without cluster-scale hardware — use a smaller Hermes.
Products near 700M MAU (Llama carve-out).
Multimodal tasks — text only.

License — Plain-English Summary

Two layers. Nous's open Hermes 3 weights sit on Meta's Llama 3.1 405B, so the Llama 3.1 Community License governs — commercial use allowed with "Built with Llama" attribution and the 700M-MAU carve-out. Same terms as the rest of the Llama-based Hermes family.

How It Compares

Against Hermes 4 405B, Hermes 3 405B is the older flagship — Hermes 4 is the upgrade. Against Hermes 3 70B, it's the larger, heavier sibling of the same generation. Against its base Llama 3.1 405B, it's Nous's steerable tuning rather than Meta's Instruct.

Cost

Self-hosted cost: $0.00 beyond compute
Notes: Free to self-host; the base model's license governs commercial use (see License).

Comparable models

Commercial-use conditions

Nous releases the Hermes weights openly, but the base is Meta's Llama 3.1, so Meta's Llama 3.1 Community License governs the model — including the clause requiring a separate Meta license if your product exceeds 700 million monthly active users.