Models
Models
Browse 158 AI models across 16 providers. Compare pricing, context windows, and capabilities.
Compare models
Side-by-side comparison of top models across key metrics.
Cost
Per 1M tokensGrok 4.1 Fast Reasoning
$0.2 input / $0.5 output
Gemini 3.1 Flash Lite Preview
$0.25 input / $1.5 output
Gemini 3 Flash Preview
$0.5 input / $3 output
Gemini 3.1 Pro Preview
$2 input / $12 output
GPT-5.4
$2.5 input / $15 output
Claude Sonnet 4.6
$3 input / $15 output
Claude Sonnet 4.5
$3 input / $15 output
Claude Sonnet 4.0
$3 input / $15 output
Claude Opus 4.6
$5 input / $25 output
GPT-5.4 Pro
$30 input / $180 output
All models
OpenAI
GPT-4.1
GPT-4.1 Mini
GPT-4.1 Nano
GPT-5.4 Pro
GPT-5.4
GPT-5.4 Mini
GPT-5.4 Nano
GPT-5.2 Pro
GPT-5.2
GPT-5.1
GPT-5 Pro
GPT-5
GPT-5 Mini
GPT-5 Nano
GPT-5 Chat Latest
o4 Mini
o3 Pro
o3
o3 Mini
o1
GPT-4o
Anthropic
Claude Opus 4.6
Claude Sonnet 4.6
Claude Opus 4.5
Claude Opus 4.1
Claude Opus 4.0
Claude Sonnet 4.5
Claude Sonnet 4.0
Claude Haiku 4.5
Claude 3 Haiku
Azure OpenAI
GPT-4o
GPT-5.4
GPT-5.4 Mini
GPT-5.4 Nano
GPT-5.2
GPT-5.1
GPT-5.1 Codex
GPT-5
GPT-5 Mini
GPT-5 Nano
GPT-5 Chat Latest
o3
o4 Mini
GPT-4.1
GPT-4.1 Mini
GPT-4.1 Nano
Model Router
Azure Anthropic
Claude Opus 4.6
Claude Opus 4.5
Claude Sonnet 4.5
Claude Opus 4.1
Claude Haiku 4.5
Gemini 3.1 Pro Preview
Gemini 3.1 Flash Lite Preview
Gemini 3 Flash Preview
Gemini 2.5 Pro
Gemini 2.5 Flash
Gemini 2.5 Flash Lite
Gemini 2.0 Flash
Gemini 2.0 Flash Lite
Deep Research Pro Preview 12 2025
Vertex AI
Gemini 3.1 Pro Preview
Gemini 3.1 Flash Lite Preview
Gemini 3 Pro Preview
Gemini 3 Flash Preview
Gemini 2.5 Pro
Gemini 2.5 Flash
Gemini 2.5 Flash Lite
Gemini 2.0 Flash
Gemini 2.0 Flash Lite
Deep Research Pro Preview 12 2025
DeepSeek
DeepSeek Chat
DeepSeek V3
DeepSeek R1
DeepSeek Reasoner
xAI
Grok 4 Latest
Grok 4 0709
Grok 4.1 Fast Reasoning
Grok 4.1 Fast Non Reasoning
Grok 4 Fast Reasoning
Grok 4 Fast Non Reasoning
Grok Code Fast 1
Grok 4.20 0309 Reasoning
Grok 4.20 0309 Non Reasoning
Grok 4.20 Multi Agent 0309
Grok 3 Latest
Grok 3 Fast Latest
Cerebras
GPT OSS 120B
Llama 3 1 8B
Qwen 3 235B A22B Instruct 2507
Zai GLM 4.7
Groq
OpenAI GPT OSS 120B
OpenAI GPT OSS 20B
OpenAI GPT OSS Safeguard 20B
Qwen 3 32B
Llama 3.1 8B Instant
Llama 3.3 70B Versatile
Meta Llama 4 Scout 17B 16E Instruct
Moonshot AI Kimi K2 Instruct 0905
Mistral AI
Mistral Large Latest
Mistral Large 2512
Mistral Small 2603
Devstral 2512
Mistral Large 2411
Magistral Medium Latest
Magistral Medium 2509
Magistral Small Latest
Magistral Small 2509
Mistral Medium Latest
Mistral Medium 2508
Mistral Medium 2505
Mistral Small Latest
Mistral Small 2506
Open Mistral Nemo
Codestral Latest
Codestral 2508
Devstral Latest
Devstral Small Latest
Devstral Small 2507
Devstral Medium 2507
Ministral 14B Latest
Ministral 14B 2512
Ministral 8B Latest
Ministral 8B 2512
Ministral 3B Latest
Ministral 3B 2512
AWS Bedrock
Anthropic Claude Opus 4.5
Anthropic Claude Sonnet 4.5
Anthropic Claude Haiku 4.5
Anthropic Claude Opus 4.1
Amazon Nova 2 Pro
Amazon Nova 2 Lite
Amazon Nova Premier
Amazon Nova Pro
Amazon Nova Lite
Amazon Nova Micro
Meta Llama 4 Maverick 17B Instruct
Meta Llama 4 Scout 17B Instruct
Meta Llama 3 70B Instruct
Meta Llama 3 2 90B Instruct
Meta Llama 3 2 11B Instruct
Meta Llama 3 2 3B Instruct
Meta Llama 3 2 1B Instruct
Meta Llama 3 1 405B Instruct
Meta Llama 3 1 70B Instruct
Meta Llama 3 1 8B Instruct
Mistral Large 3 675B Instruct
Mistral Large 2411
Mistral Large 2407
Mistral Pixtral Large 2502
Mistral Magistral Small 2509
Mistral Ministral 3 14B Instruct
Mistral Ministral 3 8B Instruct
Mistral Ministral 3 3B Instruct
Mistral Mixtral 8x7b Instruct
Amazon Titan Text Premier
Cohere Command R Plus
Cohere Command R
Dynamic model catalogs
These providers load their model lists dynamically at runtime.
Frequently asked questions
The most important factors for agent tasks are reliable tool use (function calling), a large enough context window to track conversation history and tool outputs, and consistent instruction following. In Sim, OpenAI GPT-4.1, Anthropic Claude Sonnet, and Google Gemini 2.5 Pro are popular choices — each supports tool use, structured outputs, and context windows of 128K tokens or more. For cost-sensitive or high-throughput agents, Groq and Cerebras offer significantly faster inference at lower cost.