OpenAI
6 of 13 models-
GPT-5.5 CurrentVision Thinking Tool use Computer Use Cache
- Context
- 1.000K
- Input/1M
- —
- Output/1M
- —
- Release
- Apr 2026
Latest OpenAI flagship, 6 weeks after GPT-5.4. Fewer tokens per task, same latency. API not yet available, ChatGPT and Codex only. Pro variant available.
-
GPT-5.4 pro CurrentVision Thinking Tool use Cache
- Context
- 1.050K
- Input/1M
- $30.00
- Output/1M
- $180.00
- Release
- —
Premium GPT-5.4 variant for the most demanding tasks. Source: openai.com/api/pricing.
-
GPT-5.4 mini CurrentVision Thinking Tool use Cache
- Context
- 400K
- Input/1M
- $0.75
- Output/1M
- $4.50
- Release
- —
Compact GPT-5.4 variant, competitively priced. Source: openai.com/api/pricing.
-
GPT-5.4 nano CurrentVision Tool use
- Context
- 400K
- Input/1M
- $0.20
- Output/1M
- $1.25
- Release
- —
Smallest GPT-5.4 variant, for high-volume and simple tasks. Source: openai.com/api/pricing.
-
o4-mini CurrentReasoning Code Wiskunde Tool use
- Context
- 200K
- Input/1M
- $1.10
- Output/1M
- $4.40
- Release
- Jun 2025
Compact reasoning model from the o-series. Strong price-to-performance for AIME and coding. Source: openai.com/api/pricing.
-
GPT-5.3 Codex CurrentCode Agent workflow Tool use
- Context
- 400K
- Input/1M
- $1.75
- Output/1M
- $14.00
- Release
- Jan 2026
Coding-focused variant from the GPT-5.3 line. Same pricing as gpt-5.2. Source: openrouter.ai.
Anthropic
6 models-
Claude Fable 5 Current
- Context
- 1.000K
- Input/1M
- $10.00
- Output/1M
- $50.00
- Release
- Jun 2026
Mythos-class model with safety restrictions for general use. State-of-the-art on nearly all benchmarks.
-
Claude Opus 4.8 Current
- Context
- 1.000K
- Input/1M
- $5.00
- Output/1M
- $25.00
- Release
- May 2026
Successor to Opus 4.7. Four times less likely to miss code bugs. Adaptive thinking automatically adjusts reasoning depth. Fast mode (research preview): 2.5x higher output speed. Mid-conversation system messages and lower prompt cache threshold (1,024 tokens). Dynamic workflows for hundreds of parallel subagents.
-
Claude Mythos 5 Current
- Context
- 1.000K
- Input/1M
- $10.00
- Output/1M
- $50.00
- Release
- Jun 2026
Unrestricted version of Fable 5, only available to cyberdefenders via Project Glasswing.
-
Claude Sonnet 4.6 CurrentVision Thinking Tool use Computer Use Cache Batch
- Context
- 1.000K
- Input/1M
- $3.00
- Output/1M
- $15.00
- Release
- Dec 2025
The workhorse of the Claude family. Good price-to-performance for daily work, 1M context at standard pricing, works well with Claude Code and agentic workflows.
-
Claude Haiku 4.5 CurrentVision Tool use Cache Batch
- Context
- 200K
- Input/1M
- $1.00
- Output/1M
- $5.00
- Release
- Oct 2025
Anthropic's fastest and cheapest option. For high-volume tasks like classification, extraction, customer support and simple agent loops.
-
Claude Opus 4.7 LegacyVision Thinking Tool use Computer Use Cache Batch
- Context
- 1.000K
- Input/1M
- $5.00
- Output/1M
- $25.00
- Release
- Apr 2026
Anthropic flagship model. Sharpest vision (up to 3.75 megapixels) and new xhigh effort setting between high and max. Built-in cybersecurity abuse blocking. New tokenizer can produce up to 35% more tokens per input than 4.6.
-
Gemini 3.5 Flash Currenttext images audio video tool-use function-calling structured-output code-execution search-as-tool dynamic-thinking
- Context
- 1.049K
- Input/1M
- $1.50
- Output/1M
- $9.00
- Release
- May 2026
Four times faster than frontier models. Outperforms Gemini 3.1 Pro on coding and agentic benchmarks. Powers Gemini Spark under the hood. Available in GitHub Copilot.
-
Gemini 2.5 Pro CurrentVision Audio Video Thinking Tool use Cache
- Context
- 1.049K
- Input/1M
- $1.25
- Output/1M
- $10.00
- Release
- Mar 2025
Google flagship, strong in multimodal (video and audio). Under 200K tokens costs $1.25/$10; above that, pricing doubles to $2.50/$15 for the entire prompt.
-
Gemini 2.5 Flash CurrentVision Audio Video Tool use Cache
- Context
- 1.049K
- Input/1M
- $0.30
- Output/1M
- $2.50
- Release
- Jun 2025
The budget workhorse variant. Same multimodal features as Pro, but lower accuracy on complex tasks.
-
Gemini 3.1 Pro PreviewVision Thinking Multimodaal Tool use
- Context
- 1.049K
- Input/1M
- $2.00
- Output/1M
- $12.00
- Release
- Apr 2026
Preview release. Leads GPQA Diamond and HLE as of April 2026. Source: openrouter.ai.
-
Gemini 3 Pro PreviewVision Thinking Multimodaal Tool use
- Context
- —
- Input/1M
- —
- Output/1M
- —
- Release
- Mar 2026
Preview release. Pricing and exact context window not yet publicly disclosed.
-
Gemini 3 Flash PreviewVision Multimodaal Snel
- Context
- 1.049K
- Input/1M
- $0.50
- Output/1M
- $3.00
- Release
- Mar 2026
Preview release from the Gemini 3 series. Source: openrouter.ai.
xAI
3 models-
Grok 4 CurrentThinking Tool use Real-time search
- Context
- 256K
- Input/1M
- $3.00
- Output/1M
- $15.00
- Release
- Jul 2025
Reasoning-heavy model with built-in X/Twitter search. Source: openrouter.ai.
-
Grok 4 Fast CurrentTekst Snel
- Context
- 2.000K
- Input/1M
- $0.20
- Output/1M
- $0.50
- Release
- Mar 2026
Budget variant with 2M context window. Source: openrouter.ai.
-
Grok 3 LegacyTekst Tool use
- Context
- 131K
- Input/1M
- $3.00
- Output/1M
- $15.00
- Release
- Feb 2025
Previous generation Grok, replaced by Grok 4. Source: openrouter.ai.
DeepSeek
1 model-
DeepSeek V3.2 CurrentTool use Thinking Open source
- Context
- 128K
- Input/1M
- $0.27
- Output/1M
- $0.42
- Release
- Dec 2025
Open source Chinese model. Excellent price-to-performance ratio, up to 90% cheaper than Western competitors. Mixture-of-experts architecture with 671B parameters, 37B active per token.