← Back to hard AIs

Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Google

Gemini 3.1 Pro

Model family: gemini-3-1

Context
1,048,576 tokens
Released
2026-02-18
Openness
closed-api
License
Google Gemini API Terms · commercial: yes
Cost tier
paid-api
Rating
4.0 — A strong deep-reasoning and long-context flagship, but a transitional one: the newer 3.5 Flash already beats it on coding, and Gemini 3.5 Pro is due to succeed it within weeks — capable today, clearly on the way out.
Modalities
audio-input, image-input, text, video-input
Capabilities
chat, coding, function-calling, instruction-following, long-context, math, multilingual, reasoning, tool-use, vision
Access
api-first-party, api-third-party, hosted-chat-ui

Quick Take

Google's current Pro flagship: the model to reach for when a job leans on deep reasoning or precise retrieval across very long documents — capable today, but with a successor weeks away.

Plain-English Description

Gemini 3.1 Pro, released in February 2026, is the higher-end tier of the Gemini line — the model Google positions for the hardest work: dense reasoning, careful analysis, and finding the right detail in very large documents. Where the newer Gemini 3.5 Flash is the fast everyday default, 3.1 Pro is the deliberate, heavier model you call in when accuracy on a difficult problem matters more than speed or cost.

Here's the honest, slightly awkward part: the recently-released 3.5 Flash now beats 3.1 Pro on coding and agentic benchmarks. What 3.1 Pro still wins is the genuinely hard reasoning (it leads on academic-reasoning tests like Humanity's Last Exam) and precise long-context retrieval — finding a specific fact buried in a hundred-thousand-tokenThe basic unit of text a model reads and writes. Tokens are roughly three-quarters of a word in English — so 100 tokens is about 75 words. Models don't see letters or words directly; they see tokens. Pricing is almost always quoted per million tokens, and context windows are measured in tokens rather than words. document, where the larger model's depth shows. So it remains the right tool for a specific, demanding slice of work.

The bigger context: Google announced Gemini 3.5 Pro at I/O 2026 and said it ships "next month" (June 2026). So 3.1 Pro is a model in transition — fully capable now, but about to be succeeded. If you're choosing a Pro-tier Gemini for the long term, it's worth watching for 3.5 Pro before committing (we're tracking it on the watchlist).

Best For

  • Deep-reasoning tasks — dense legal, financial, or scientific analysis where correctness beats speed.
  • Precise long-context retrieval — finding specific facts across very large documents or codebases.
  • MultimodalA model that can handle more than one type of input or output — typically text plus images, sometimes plus audio or video. "GPT-4 Vision" and "Llama 3.2 11B Vision" are multimodal models that accept both text and images. A text-only model is called "unimodal" but nobody uses that term; text-only is the assumed default. reasoning over images, audio, and video where you want the higher Pro tier.
  • Existing Gemini Pro workloads that need a stable model today while you evaluate the move to 3.5 Pro.

Not For

  • Everyday or high-volume work — Gemini 3.5 Flash is faster, cheaper, and now better at coding/agentic tasks.
  • New long-term Pro deployments — Gemini 3.5 Pro is imminent; consider waiting (see the watchlist).
  • Self-hosting or data-in-house requirements — it's closed; use Gemma 4 31B.
  • Cost-sensitive use — at $2/$12 it's the priciest Gemini tier short of the unreleased 3.5 Pro.

License — Plain-English Summary

Proprietary, like all of Gemini — accessed through Google's API under Google Cloud's terms, with commercial rights to your outputs but none to the model. The diligence items are Google's Generative AI Prohibited Use Policy and the Gemini API / Vertex AI data terms (with the usual Google Cloud enterprise data-residency options). No weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself., no self-hosting; for that, Gemma is Google's open line.

How It Compares

Against Gemini 3.5 Flash, 3.1 Pro is slower and pricier and now trails on coding, but still leads on the hardest reasoning and long-context precision — a narrowing but real advantage until 3.5 Pro lands. Against the closed flagships from OpenAI and Anthropic, Gemini Pro's calling cards are long-context retrieval and native multimodality, especially video. Against open alternatives like Gemma 4 31B or the China-based open flagships, the trade is the usual closed-vs-open one: managed frontier capability and no self-hosting, versus weightsThe numerical values inside a trained model that encode everything it has learned. A model is, functionally, a giant list of weights — tens of billions of numbers for a mid-sized model, hundreds of billions for a frontier model. "Open-weight" means those numbers are published. "Downloading the weights" means getting the actual file you'd need to run the model yourself. you own and run yourself.

Cost

API input (per 1M tokens)
$2.00
API output (per 1M tokens)
$12.00
API providers
google-gemini-api, google-vertex-ai, openrouter
Notes
$2.00 input / $12.00 output per million tokens — the higher Pro tier above Flash. Available via the Gemini API, Vertex AI, and to paid Gemini app subscribers. No self-hosting.

Comparable models

Commercial-use conditions

Commercial use permitted through the Gemini API / Vertex AI under Google Cloud's terms; no weights, no modification, no redistribution.

Sources