Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · ByteDance · Seed-OSS-36B-Instruct

Hermes 4.3 36B

fine-tune derivative of Seed-OSS-36B-Instruct by Nous Research

Nous Research's fine-tune of ByteDance's Seed-OSS-36B — the most recent Hermes and the first trained on Nous's Psyche decentralized network rather than a centralized cluster.

Size

mid (36.0B params)

Context

524,288 tokens

Released

2025-12-01

Openness

open-weight

License

Apache License 2.0 (Nous fine-tune over Seed-OSS-36B) · commercial: yes

Cost tier

mixed

Rating

4.0 ★ — The newest Hermes, with a 512K context window and clean Apache licensing from its Seed-OSS base; 4.0, with the caveat that it is recent and trained by a novel decentralized method.

Modalities

text

Capabilities

chat, coding, function-calling, instruction-following, long-context, math, reasoning, tool-use

Access

local-runtime-llama-cpp, local-runtime-ollama, local-runtime-vllm, weights-download-hf

llm
open-weight
commercial-friendly
mid-size
reasoning
long-context
self-hostable
fine-tune
apache-2-0

Quick Take

The newest Hermes: a 36B fine-tune of ByteDance's Seed-OSS-36B with a 512K context window, Apache-clean, and the first Hermes trained on Nous's decentralized Psyche network.

Plain-English Description

Hermes 4.3 36B (December 2025) is Nous Research's most recent model and a notable departure: instead of a Llama base, it's built on ByteDance's Apache 2.0 Seed-OSS-36B — which means it inherits both a 512K-token context window and a clean, unrestricted license. It's also the first Hermes model trained on Nous's Psyche decentralized training network rather than a centralized GPU cluster, a milestone for the lab's distributed-training ambitions.

The result is a mid-size open model with an unusually large context window and Hermes's steerable, reasoning-capable tuning, all under Apache 2.0. For document-heavy or long-session work where you also want to self-host commercially with minimal license friction, the 512K window plus the clean license is a strong combination.

License details below.

Best For

Long-context work — the 512K window handles very large documents or long sessions.
Self-hosted, commercially-clean (Apache) deployments wanting Hermes steerability.
Mid-size reasoning and agentic tasks on a single high-end GPU.
Teams wanting the newest Hermes without Llama's license terms.

Not For

Maximum capability — the frontier-scale Hermes 4 405B goes higher.
Laptop-only setups — a 36B with huge context wants real hardware.
Buyers who need a fully proven track record — it's recent and trained via a novel decentralized method.
Multimodal tasks — text only.

License — Plain-English Summary

Clean and permissive. Nous releases the Hermes 4.3 weights openly, and the base — ByteDance's Seed-OSS-36B — is Apache 2.0, so both layers allow unrestricted commercial use, modification, and redistribution with no carve-outs; retain the Apache notices. This is the licensing upside of moving off a Llama base: no "Built with Llama," no 700M-MAU clause.

How It Compares

Against Hermes 4 70B, the 4.3 36B is smaller but carries a far larger context window (512K vs 128K) and a cleaner Apache license. Against its base Seed-OSS-36B-Instruct, it's Nous's steerable Hermes tuning over ByteDance's foundation. Against Hermes 4 405B, it gives up frontier capability for portability, long context, and clean licensing.

Cost

Self-hosted cost: $0.00 beyond compute
Notes: Free to self-host; the base model's license governs commercial use (see License).

Comparable models

Commercial-use conditions

Nous releases the Hermes weights openly and the base (Seed-OSS-36B) is Apache 2.0 — both layers are permissive, with unrestricted commercial use and no carve-outs.