Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Meta

4.5 ★ — Massive ecosystem impact, genuinely permissive license for the vast majority of businesses, and a steady release cadence — but the 700M MAU clause and the EU restrictions on multimodal models are real catches worth knowing.

Type

big-tech-lab

Country

Founded

2004

License posture

predominantly-open-weight

Website

https://ai.meta.com

open-weight
us-based
big-tech-lab
commercial-friendly

Quick Take

Meta is the biggest force in open-weight AI — their Llama family is the most-used foundation for fine-tunes and derivatives on the planet, with a license that works for most businesses.

Who They Are

Meta is the social-media giant behind Facebook, Instagram, and WhatsApp — and, less obviously to the general public, one of the two or three most influential AI research organizations in the world. Their AI work runs through two connected groups: the Fundamental AI Research lab (FAIR), which dates back to 2013 and publishes academic research, and the product-focused Meta AI group that ships the Llama models and the Meta AI assistant embedded across their apps.

When it comes to AI models specifically, Meta has done something that almost none of their peers have: they release the actual trained weights of their foundation models to the public, for free, under a license that allows most commercial use. That single decision has shaped the entire open-source AI landscape. The majority of open-weight fine-tunes, specialized models, and locally-runnable chatbots you'll encounter anywhere — on Hugging Face, in Ollama, in LM Studio — are descended from Meta's Llama family. When a research lab, a startup, or an individual developer wants to build their own AI without paying per-token fees to OpenAI or Anthropic, Llama is almost always where they start.

Meta's AI strategy is different from their peers in a way worth understanding. Google has Gemini (closed API) and Gemma (open-weight) as two tracks. OpenAI and Anthropic are primarily closed API shops. Meta has bet almost everything on open-weight — they ship the closed-model experience through their consumer products (Meta AI in WhatsApp, Instagram, etc.) but the models themselves go out to the public.

Model Philosophy

Meta's own framing is that opening up the weights accelerates AI progress broadly and builds an ecosystem Meta benefits from even when they're not the one charging for inference. The more developers, researchers, and businesses standardize on Llama as their foundation, the more Meta's technical decisions become industry defaults. That's a sound long-term bet and it's why Llama keeps shipping at a steady cadence even when the economics of giving away billion-dollar training runs look strange from the outside.

What you'll find in practice: Llama models are released quickly after announcement, with weights on Hugging Face and Meta's own download site. Licenses are permissive but not unconditional (see below). Each major version introduces real improvements rather than token refreshes — 3.1 added the 128K context window and tool use, 3.2 added vision and small on-device models, 3.3 brought 405B-class performance into a 70B model, and Llama 4 brought mixture-of-experts architecture and native multimodality. Behemoth, the largest Llama 4 model, remains in training as of this writing.

What To Know Before You Commit

Three things matter if you're thinking about building on Llama for a business.

The 700-million-user clause. The Llama Community License lets you use Llama commercially for free — unless, on the day the specific Llama version released, your product had more than 700 million monthly active users. In that case, you need a separate license from Meta. For virtually every small business, agency, startup, and mid-sized enterprise, this clause is irrelevant. For Apple, Google, Amazon, ByteDance, Tencent, and a handful of others, it matters. If you might be close to that threshold, get a lawyer. For everyone else, treat it as permissive commercial use.

The "trained on Llama outputs" clause. You cannot use Llama or its outputs to train another foundation model that isn't a Llama derivative. Meaning: you can fine-tune Llama, you can distill from Llama, you can generate synthetic data with Llama to train other Llama models, but you cannot use Llama to train a from-scratch competitor. Again, this affects essentially nobody outside of other AI labs.

The EU multimodal restriction (Llama 4 specifically). Llama 4's Acceptable Use Policy restricts use of its multimodal capabilities in the European Union due to EU AI Act compliance concerns. Text-only use appears unaffected. If your business is EU-based or serves EU customers and you need the multimodal features, verify current status with Meta before building on Llama 4 — this one is still evolving.

Beyond those three, Llama is among the most business-friendly open-weight licenses in existence. You can modify the models, redistribute them, fine-tune them for specific domains, and build commercial products on top of them, provided you include the license file and display "Built with Llama" in your product.

Original Models

Safeguards

Meta's multimodal safety classifier for LLM applications. Screens prompts and responses (text and images) against the MLCommons hazards taxonomy. Replaces Llama Guard 3 8B and Llama Guard 3 11B-Vision with a single model.

Llama Prompt Guard

Meta's smallest prompt-injection detector — 22M parameters, sub-millisecond CPU inference, English-only. Targeted at high-throughput pipelines where the 86M variant's latency or cost is prohibitive.

Meta's prompt-injection detector — 86M parameter multilingual classifier built on mDeBERTa (not Llama architecture). Labels incoming prompts as benign or malicious to catch jailbreak attempts before they reach your primary LLM. Runs on CPU.

Llama 4

Base pretrained variant of Meta's Llama 4 Maverick. Frontier-class MoE with 17B active parameters out of 400B total, 128 experts, natively multimodal, 1M context — designed for fine-tuning at scale rather than direct chat use.

Instruction-tuned variant of Meta's Llama 4 Maverick — Meta's largest open-weight release. 17B active / 400B total MoE with 128 experts, natively multimodal, 1M context window. Scores 80.5 on MMLU Pro.

Base pretrained variant of Meta's Llama 4 Scout. Mixture-of-Experts architecture with 17B active parameters out of 109B total, natively multimodal, designed for fine-tuning rather than direct chat use.

Meta's first mixture-of-experts and first natively multimodal open-weight model, with a 10-million-token context window and an EU restriction business owners need to know about.

Llama 3 3

Meta's efficiency milestone — a 70-billion-parameter model that Meta claims matches their much larger 405B model on most tasks, at a fraction of the cost to run.

Llama 3 2 Vision

Meta's 11B Llama 3.2 Vision base model — multimodal (text + image input) for research and fine-tuning. EU-domiciled entities are excluded from multimodal license rights. Most business use cases want the Instruct variant.

Meta's 11B Llama 3.2 Vision chat model — multimodal, handles text and image input for visual reasoning, document analysis, and chart reading. EU-domiciled entities are excluded from multimodal license rights.

Meta's 90B Llama 3.2 Vision base model — multimodal flagship of the 3.2 generation, for research and fine-tuning. EU-domiciled entities are excluded from multimodal license rights. Most business use cases want the Instruct variant.

Meta's 90B Llama 3.2 Vision chat model — multimodal flagship of the 3.2 generation, strong at complex visual reasoning and document analysis. EU-domiciled entities are excluded from multimodal license rights.

Llama 3 2

Base pretrained variant of Meta's smallest Llama 3.2 model. Text-only, 1.23B parameters, designed for on-device and edge deployment. For fine-tuning rather than direct chat use.

Meta's smallest instruction-tuned Llama 3.2 model. 1.23B parameters, text-only, designed for on-device chat, summarization, and prompt rewriting on phones or edge hardware. Commercial use allowed below the 700M MAU threshold.

Base pretrained variant of Meta's Llama 3.2 3B. Text-only, 3.21B parameters, designed for edge deployment and as a starting point for domain-specific fine-tuning.

Meta's Llama 3.2 3B Instruct — small instruction-tuned text model designed for mobile and edge AI assistants, with summarization, tool use, and multilingual dialogue capabilities. Commercial use allowed below the 700M MAU threshold.

Llama 3 1

Meta's Llama 3.1 405B base model — the largest open-weight dense model ever released at the time. Base variant for research and large-scale fine-tuning; production deployments use the Instruct variant via hosted APIs.

Meta's Llama 3.1 405B chat model — frontier-scale dense model with 128K context and strong multilingual reasoning. Practical access is through hosted API providers; self-hosting requires a multi-GPU server cluster.

Llama 3.1 70B — the 70B Llama 3.1 base, widely used as a fine-tuning foundation (e.g. for Hermes).

Meta's Llama 3.1 8B base model — 8B parameters, 128K context, multilingual. Base variant for fine-tuning; the Instruct variant has a full catalog entry.

Meta's small-but-serious open-weight model — fast, multilingual, and runs on a decent laptop with quantization, with a commercial license that works for almost any business.

Meta

Quick Take

Who They Are

Model Philosophy

What To Know Before You Commit

Original Models

Safeguards

Identity

Technical specs

License

Access

Sources

Llama Prompt Guard

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Llama 4

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Llama 3 3

Llama 3 2 Vision

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Llama 3 2

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License