What is a context window?

The context window is the maximum number of tokens a model can consider at once, your prompt plus its response. A 1M-token window can hold roughly 750,000 words, enough for very large documents or codebases.

Are these prices and specs current?

They reflect published figures at the last-verified date shown under the table, with links to each provider’s pricing page. Models and prices change frequently, so confirm before committing to a budget.

Free tool · no signup

AI Model Comparison

Compare current AI models from Anthropic, OpenAI and Google on context window, input/output price and key strengths. Sort and filter to find the right model for your task and budget.

Show providers:

		Max out			Notes
GPT-4.1 nanoOpenAI	1M	32K	$0.1	$0.4	Cheapest model in OpenAI’s lineup.
Gemini 2.5 FlashGoogle	1M	64K	$0.3	$2.5	Fast and cheap; output price includes thinking tokens.
GPT-4.1 miniOpenAI	1M	32K	$0.4	$1.6	Best value for large-context tasks.
Claude Haiku 4.5Anthropic	200K	64K	$1	$5	Fastest, most cost-effective Claude model.
Gemini 2.5 ProGoogle	1M	64K	$1.25	$10	Tiered: prompts over 200K tokens bill at $2.50 / $15.
GPT-4.1OpenAI	1M	32K	$2	$8	Budget long-context model.
GPT-5.4OpenAI	1M	128K	$2.5	$15	Recommended production workhorse; 1M context.
Claude Sonnet 4.6Anthropic	1M	64K	$3	$15	Best balance of speed and intelligence.
Claude Opus 4.8Anthropic	1M	128K	$5	$25	Flagship for long-horizon agentic + knowledge work.
GPT-5.5OpenAI	400K	128K	$5	$30	OpenAI flagship for the hardest reasoning.
Claude Fable 5Anthropic	1M	128K	$10	$50	Most capable Claude model; always-on thinking.

Specs and list prices (per 1M tokens) last verified June 2026. Confirm against the source: Anthropic ↗ · OpenAI ↗ · Google ↗

This AI model comparison puts current models from Anthropic, OpenAI and Google side by side on context window, input and output price, and key strengths. Sort by any column and filter by provider to find the right model for your task and budget, Claude vs GPT vs Gemini, at a glance.

There is no single “best” model; the right choice depends on the work, the volume and how much you’re willing to spend.

How to choose an AI model

For the hardest reasoning, long agentic workflows and high-stakes output, the flagship models win. For high-volume, latency-sensitive or cost-sensitive work, the smaller models are dramatically cheaper and usually good enough. Sort by price to find the cheapest option that clears your quality bar, or by context window if you need to fit large documents.

Context window, price and speed

The context window is how much text a model can consider at once, a 1M-token window holds roughly 750,000 words. Input and output prices are quoted per million tokens, with output typically costing several times more than input. Smaller models are also faster, which matters for interactive and real-time features.

Frontier vs small models

A common pattern is to route most traffic to a small, cheap model and escalate only the hard cases to a frontier model. This “model routing” keeps costs low without sacrificing quality where it counts. The figures here are last verified at the date shown under the table, with links to each provider for the current numbers.

Want the work done, not just the tool?

OpenHelm runs AI agents in a secure cloud environment to do the actual task, research, outreach, reporting, monitoring, and hands back the result for your sign-off.

See how OpenHelm works Browse use cases →

More free tools

MCP Server Config Generator LLM API Cost Calculator AI Automation ROI Calculator LLM Token Counter llms.txt Generator JSON-LD Schema Generator

Frequently asked questions