← Back to hard AIs

Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models

Anthropic

4.0 ★ — A frontier lab with a strong reputation for coding quality and safety, a clean three-tier lineup, and 1M-token context — but entirely closed, premium-priced, and narrower than the labs that also ship open weights, which is why it lands at 4.0 on a catalog that weights openness and breadth.

Type
ai-native-company
Country
US
Founded
2021
License posture
predominantly-closed
Website

Quick Take

The AI safety company behind Claude. Anthropic fields a clean three-tier closed lineup — Opus (most capable), Sonnet (balanced default), Haiku (fast and cheap) — known for coding quality and a safety-first posture.

Who They Are

Anthropic is an AI-native company founded in 2021 by a group of former OpenAI researchers, structured as a public-benefit corporation with an explicit focus on AI safety. Its single product line is Claude, a family of models offered in three tiers across successive generations. Amazon and Google are major investors, and Claude is available not only through Anthropic's own API but also via Amazon Bedrock and Google Cloud Vertex AI.

For business readers, the model is simpler than OpenAI's sprawling catalog: instead of dozens of variants, Anthropic offers three tiers per generation. Opus is the flagship for the hardest work, Sonnet is the balanced default most teams should use, and Haiku is the fast, cheap tier for high-volume tasks. Pick the tier that matches the job.

Model Philosophy

Anthropic is closed by design — there are no open weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself., and there's no equivalent of OpenAI's gpt-oss or Google's Gemma. You rent Claude through an API; you don't download or self-host it. The company frames this partly as a safety choice (retaining the ability to monitor and adjust deployed models), and partly it's the business model.

The lineup has a notable dynamic: capability creeps upward across tiers over time, so each new Sonnet generation tends to match or beat the previous generation's Opus on coding and document tasks — at Sonnet pricing. The practical implication is that the flagship is often overkill; the mid-tier frequently does the job. Anthropic's models are particularly associated with coding reliability, careful instruction-following, and long-context work (Opus and Sonnet carry 1M-tokenThe basic unit of text a model reads and writes. Tokens are roughly three-quarters of a word in English — so 100 tokens is about 75 words. Models don't see letters or words directly; they see tokens. Pricing is almost always quoted per million tokens, and context windows are measured in tokens rather than words. windows).

What To Know Before You Commit

Two things matter most. First, it's closed and premium-priced: Claude pricing generally runs higher than OpenAI's comparable tiers, and there's no self-hosting option at all, so if owning the model or keeping everything fully in-house is a hard requirement, Claude isn't the fit — an open-weightA model where the trained weights are freely downloadable — you can run it yourself without contacting the creator. Llama, Mistral, Qwen, and Gemma are open-weight. Open-weight does not mean open-source: the training data and code often stay private. The license still governs what you can do with the weights, including whether you can use them commercially. line like Meta's Llama, Qwen, or Google's Gemma is. Second, the value lever is mostly Sonnet plus prompt caching: for the majority of workloads, Sonnet 4.6 with cached context is the practical default, with Haiku for routing and Opus reserved for work that justifies the premium.

On data governance, Anthropic is US-based with enterprise and zero-retention options, plus availability through Bedrock and Vertex for buyers who want it inside an existing cloud relationship. There are no open weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself., so the only deployment path is hosted.

How They Compare

Against the other closed Western frontier labs — OpenAI and Google — Anthropic competes most directly on coding quality and safety positioning, while OpenAI offers far more lineup breadth (and an open line in gpt-oss) and Google leads on native multimodality and ecosystem reach. Against the open-weightA model where the trained weights are freely downloadable — you can run it yourself without contacting the creator. Llama, Mistral, Qwen, and Gemma are open-weight. Open-weight does not mean open-source: the training data and code often stay private. The license still governs what you can do with the weights, including whether you can use them commercially. players (Meta, Qwen, DeepSeek, Gemma), the trade is stark: Claude offers no self-hosting at all, so those are the only options if open weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself. are required. The honest summary: Claude is a strong, premium, closed choice that many teams pay up for on coding and reliability — and a non-starter for anyone who needs to own the model.

Original Models

Claude Opus 4

Anthropic's flagship: top-tier agentic coding and careful reasoning with a 1M-tokenThe basic unit of text a model reads and writes. Tokens are roughly three-quarters of a word in English — so 100 tokens is about 75 words. Models don't see letters or words directly; they see tokens. Pricing is almost always quoted per million tokens, and context windows are measured in tokens rather than words. context windowThe maximum amount of text the model can "see" at once — prompt plus prior conversation plus any documents you give it. Measured in tokens (which are roughly three-quarters of a word each). A 128K context window is about 96,000 words of input — roughly a 400-page book. Larger context windows let the model work with bigger documents but cost more to run. — premium-priced, closed, and a common pick for high-stakes code and analysis.

Anthropic's prior flagship, now succeeded by the cost-neutral Opus 4.8 — still capable and available, but new deployments should generally start on 4.8.

Claude Opus 4.6 — an earlier Opus 4 flagship at $5/$25 with 1M context, superseded by 4.7/4.8.

Claude Opus 4.1 — a legacy flagship at the old $15/$75 premium tier, long superseded.

Claude Opus 4 — the first Opus 4 flagship, now several generations behind.

Claude Sonnet 4

The Claude most teams should actually use: near-Opus coding and reasoning at a third of the price, with a 1M-tokenThe basic unit of text a model reads and writes. Tokens are roughly three-quarters of a word in English — so 100 tokens is about 75 words. Models don't see letters or words directly; they see tokens. Pricing is almost always quoted per million tokens, and context windows are measured in tokens rather than words. window and strong prompt-caching economics.

Claude Sonnet 4.5 — the prior recommended default with strong coding/agents, superseded by 4.6.

Claude Sonnet 4 — the first Sonnet 4 mid-tier, superseded by later Sonnet releases.

Claude Haiku 4

The fast, cheap Claude: near-frontier quality at the lowest current-generation price, built for high-volume routing, extraction, classification, and moderation.

Claude Sonnet 3

Claude Sonnet 3.7 — a legacy Sonnet that introduced extended thinking, still available for compatibility.

Claude Haiku 3

Claude Haiku 3.5 — a prior fast/cheap tier, superseded by Haiku 4.5.

Claude Haiku 3 — the legacy ultra-cheap tier ($0.25/$1.25), still the cheapest Claude though dated.

Sources