What does context window size mean when running an AI agent?

The context window is the total number of tokens a model can process in a single call, including your system prompt, conversation history, tool call results, and any documents you pass in. For agents running multi-step tasks, context fills up quickly — each tool result and each retrieved document adds tokens. A 128K-token context window fits roughly 300 pages of text; models like Gemini 2.5 Pro support up to 1M tokens, enough to hold an entire codebase in a single pass.

Are model prices shown per million tokens?

Yes. Input, cached input, and output prices are all listed per one million tokens, matching how providers bill through their APIs. For agents that chain multiple calls, costs compound quickly — an agent completing 100 turns at 10K tokens each consumes roughly 1M tokens per session. Cached input pricing applies when a provider supports prompt caching, where a repeated prefix like a system prompt is billed at a reduced rate.

Which AI models support tool use and function calling?

Tool use — also called function calling — lets an agent invoke external APIs, query databases, run code, or take any action you define. In Sim, all first-party models from OpenAI, Anthropic, Google, Mistral, Groq, Cerebras, and xAI support tool use. Look for the Tool Use capability tag on any model card in this directory to confirm support.

How do I add a model to a Sim agent workflow?

Open any workflow in Sim, add an Agent block, and select your provider and model from the model picker inside that block. Every model listed in this directory is available in the Agent block. Swapping models takes one click and does not affect the rest of your workflow, making it straightforward to test different models on the same task without rebuilding anything.

AI Models Directory | Acme AI Studio

Models

Browse 158 AI models across 16 providers. Compare pricing, context windows, and capabilities.

Compare models

Side-by-side comparison of top models across key metrics.

All models

OpenAI

21 models · OpenAI's models

GPT-4.1

gpt-4.1 · Input $2/1M · Output $8/1M · 1.05M context

GPT-4.1 Mini

gpt-4.1-mini · Input $0.4/1M · Output $1.6/1M · 1.05M context

GPT-4.1 Nano

gpt-4.1-nano · Input $0.1/1M · Output $0.4/1M · 1.05M context

GPT-5.4 Pro

gpt-5.4-pro · Input $30/1M · Output $180/1M · 1.05M context

GPT-5.4

gpt-5.4 · Input $2.5/1M · Output $15/1M · 1.05M context

GPT-5.4 Mini

gpt-5.4-mini · Input $0.75/1M · Output $4.5/1M · 400k context

GPT-5.4 Nano

gpt-5.4-nano · Input $0.2/1M · Output $1.25/1M · 400k context

GPT-5.2 Pro

gpt-5.2-pro · Input $21/1M · Output $168/1M · 400k context

GPT-5.2

gpt-5.2 · Input $1.75/1M · Output $14/1M · 400k context

GPT-5.1

gpt-5.1 · Input $1.25/1M · Output $10/1M · 400k context

GPT-5 Pro

gpt-5-pro · Input $15/1M · Output $120/1M · 400k context

GPT-5

gpt-5 · Input $1.25/1M · Output $10/1M · 400k context

GPT-5 Mini

gpt-5-mini · Input $0.25/1M · Output $2/1M · 400k context

GPT-5 Nano

gpt-5-nano · Input $0.05/1M · Output $0.4/1M · 400k context

GPT-5 Chat Latest

gpt-5-chat-latest · Input $1.25/1M · Output $10/1M · 128k context

o4 Mini

o4-mini · Input $1.1/1M · Output $4.4/1M · 200k context

o3 Pro

o3-pro · Input $20/1M · Output $80/1M · 200k context

o3

o3 · Input $2/1M · Output $8/1M · 200k context

o3 Mini

o3-mini · Input $1.1/1M · Output $4.4/1M · 200k context

o1

o1 · Input $15/1M · Output $60/1M · 200k context

GPT-4o

gpt-4o · Input $2.5/1M · Output $10/1M · 128k context

Anthropic

9 models · Anthropic's Claude models

Claude Opus 4.6

claude-opus-4-6 · Input $5/1M · Output $25/1M · 1M context

Claude Sonnet 4.6

claude-sonnet-4-6 · Input $3/1M · Output $15/1M · 1M context

Claude Opus 4.5

claude-opus-4-5 · Input $5/1M · Output $25/1M · 200k context

Claude Opus 4.1

claude-opus-4-1 · Input $15/1M · Output $75/1M · 200k context

Claude Opus 4.0

claude-opus-4-0 · Input $15/1M · Output $75/1M · 200k context

Claude Sonnet 4.5

claude-sonnet-4-5 · Input $3/1M · Output $15/1M · 1M context

Claude Sonnet 4.0

claude-sonnet-4-0 · Input $3/1M · Output $15/1M · 1M context

Claude Haiku 4.5

claude-haiku-4-5 · Input $1/1M · Output $5/1M · 200k context

Claude 3 Haiku

claude-3-haiku-20240307 · Input $0.25/1M · Output $1.25/1M · 200k context

Azure OpenAI

17 models · Microsoft Azure OpenAI Service models

GPT-4o

azure/gpt-4o · Input $2.5/1M · Output $10/1M · 128k context

GPT-5.4

azure/gpt-5.4 · Input $2.5/1M · Output $15/1M · 1.05M context

GPT-5.4 Mini

azure/gpt-5.4-mini · Input $0.75/1M · Output $4.5/1M · 400k context

GPT-5.4 Nano

azure/gpt-5.4-nano · Input $0.2/1M · Output $1.25/1M · 400k context

GPT-5.2

azure/gpt-5.2 · Input $1.75/1M · Output $14/1M · 400k context

GPT-5.1

azure/gpt-5.1 · Input $1.25/1M · Output $10/1M · 400k context

GPT-5.1 Codex

azure/gpt-5.1-codex · Input $1.25/1M · Output $10/1M · 400k context

GPT-5

azure/gpt-5 · Input $1.25/1M · Output $10/1M · 400k context

GPT-5 Mini

azure/gpt-5-mini · Input $0.25/1M · Output $2/1M · 400k context

GPT-5 Nano

azure/gpt-5-nano · Input $0.05/1M · Output $0.4/1M · 400k context

GPT-5 Chat Latest

azure/gpt-5-chat-latest · Input $1.25/1M · Output $10/1M · 128k context

o3

azure/o3 · Input $2/1M · Output $8/1M · 200k context

o4 Mini

azure/o4-mini · Input $1.1/1M · Output $4.4/1M · 200k context

GPT-4.1

azure/gpt-4.1 · Input $2/1M · Output $8/1M · 1.05M context

GPT-4.1 Mini

azure/gpt-4.1-mini · Input $0.4/1M · Output $1.6/1M · 1.05M context

GPT-4.1 Nano

azure/gpt-4.1-nano · Input $0.1/1M · Output $0.4/1M · 1.05M context

Model Router

azure/model-router · Input $2/1M · Output $8/1M · 200k context

Azure Anthropic

5 models · Anthropic Claude models via Azure AI Foundry

Claude Opus 4.6

azure-anthropic/claude-opus-4-6 · Input $5/1M · Output $25/1M · 200k context

Claude Opus 4.5

azure-anthropic/claude-opus-4-5 · Input $5/1M · Output $25/1M · 200k context

Claude Sonnet 4.5

azure-anthropic/claude-sonnet-4-5 · Input $3/1M · Output $15/1M · 200k context

Claude Opus 4.1

azure-anthropic/claude-opus-4-1 · Input $15/1M · Output $75/1M · 200k context

Claude Haiku 4.5

azure-anthropic/claude-haiku-4-5 · Input $1/1M · Output $5/1M · 200k context

Google

9 models · Google's Gemini models

Gemini 3.1 Pro Preview

gemini-3.1-pro-preview · Input $2/1M · Output $12/1M · 1.05M context

Gemini 3.1 Flash Lite Preview

gemini-3.1-flash-lite-preview · Input $0.25/1M · Output $1.5/1M · 1.05M context

Gemini 3 Flash Preview

gemini-3-flash-preview · Input $0.5/1M · Output $3/1M · 1M context

Gemini 2.5 Pro

gemini-2.5-pro · Input $1.25/1M · Output $10/1M · 1.05M context

Gemini 2.5 Flash

gemini-2.5-flash · Input $0.3/1M · Output $2.5/1M · 1.05M context

Gemini 2.5 Flash Lite

gemini-2.5-flash-lite · Input $0.1/1M · Output $0.4/1M · 1.05M context

Gemini 2.0 Flash

gemini-2.0-flash · Input $0.1/1M · Output $0.4/1M · 1.05M context

Gemini 2.0 Flash Lite

gemini-2.0-flash-lite · Input $0.075/1M · Output $0.3/1M · 1.05M context

Deep Research Pro Preview 12 2025

deep-research-pro-preview-12-2025 · Input $2/1M · Output $2/1M · 1M context

Vertex AI

10 models · Google's Vertex AI platform for Gemini models

Gemini 3.1 Pro Preview

vertex/gemini-3.1-pro-preview · Input $2/1M · Output $12/1M · 1.05M context

Gemini 3.1 Flash Lite Preview

vertex/gemini-3.1-flash-lite-preview · Input $0.25/1M · Output $1.5/1M · 1.05M context

Gemini 3 Pro Preview

vertex/gemini-3-pro-preview · Input $2/1M · Output $12/1M · 1M context

Gemini 3 Flash Preview

vertex/gemini-3-flash-preview · Input $0.5/1M · Output $3/1M · 1M context

Gemini 2.5 Pro

vertex/gemini-2.5-pro · Input $1.25/1M · Output $10/1M · 1.05M context

Gemini 2.5 Flash

vertex/gemini-2.5-flash · Input $0.3/1M · Output $2.5/1M · 1.05M context

Gemini 2.5 Flash Lite

vertex/gemini-2.5-flash-lite · Input $0.1/1M · Output $0.4/1M · 1.05M context

Gemini 2.0 Flash

vertex/gemini-2.0-flash · Input $0.1/1M · Output $0.4/1M · 1.05M context

Gemini 2.0 Flash Lite

vertex/gemini-2.0-flash-lite · Input $0.075/1M · Output $0.3/1M · 1.05M context

Deep Research Pro Preview 12 2025

vertex/deep-research-pro-preview-12-2025 · Input $2/1M · Output $2/1M · 1M context

DeepSeek

4 models · DeepSeek's chat models

DeepSeek Chat

deepseek-chat · Input $0.28/1M · Output $0.42/1M · 128k context

DeepSeek V3

deepseek-v3 · Input $0.28/1M · Output $0.42/1M · 128k context

DeepSeek R1

deepseek-r1 · Input $0.55/1M · Output $2.19/1M · 128k context

DeepSeek Reasoner

deepseek-reasoner · Input $0.28/1M · Output $0.42/1M · 128k context

xAI

12 models · xAI's Grok models

Grok 4 Latest

grok-4-latest · Input $3/1M · Output $15/1M · 256k context

Grok 4 0709

grok-4-0709 · Input $3/1M · Output $15/1M · 256k context

Grok 4.1 Fast Reasoning

grok-4-1-fast-reasoning · Input $0.2/1M · Output $0.5/1M · 2M context

Grok 4.1 Fast Non Reasoning

grok-4-1-fast-non-reasoning · Input $0.2/1M · Output $0.5/1M · 2M context

Grok 4 Fast Reasoning

grok-4-fast-reasoning · Input $0.2/1M · Output $0.5/1M · 2M context

Grok 4 Fast Non Reasoning

grok-4-fast-non-reasoning · Input $0.2/1M · Output $0.5/1M · 2M context

Grok Code Fast 1

grok-code-fast-1 · Input $0.2/1M · Output $1.5/1M · 256k context

Grok 4.20 0309 Reasoning

grok-4.20-0309-reasoning · Input $2/1M · Output $6/1M · 2M context

Grok 4.20 0309 Non Reasoning

grok-4.20-0309-non-reasoning · Input $2/1M · Output $6/1M · 2M context

Grok 4.20 Multi Agent 0309

grok-4.20-multi-agent-0309 · Input $2/1M · Output $6/1M · 2M context

Grok 3 Latest

grok-3-latest · Input $3/1M · Output $15/1M · 131k context

Grok 3 Fast Latest

grok-3-fast-latest · Input $5/1M · Output $25/1M · 131k context

Cerebras

4 models · Cerebras Cloud LLMs

GPT OSS 120B

cerebras/gpt-oss-120b · Input $0.35/1M · Output $0.75/1M · 131k context

Llama 3 1 8B

cerebras/llama3.1-8b · Input $0.1/1M · Output $0.1/1M · 33k context

Qwen 3 235B A22B Instruct 2507

cerebras/qwen-3-235b-a22b-instruct-2507 · Input $0.6/1M · Output $1.2/1M · 131k context

Zai GLM 4.7

cerebras/zai-glm-4.7 · Input $2.25/1M · Output $2.75/1M · 131k context

Groq

8 models · Groq's LLM models with high-performance inference

OpenAI GPT OSS 120B

groq/openai/gpt-oss-120b · Input $0.15/1M · Output $0.6/1M · 131k context

OpenAI GPT OSS 20B

groq/openai/gpt-oss-20b · Input $0.075/1M · Output $0.3/1M · 131k context

OpenAI GPT OSS Safeguard 20B

groq/openai/gpt-oss-safeguard-20b · Input $0.075/1M · Output $0.3/1M · 131k context

Qwen 3 32B

groq/qwen/qwen3-32b · Input $0.29/1M · Output $0.59/1M · 131k context

Llama 3.1 8B Instant

groq/llama-3.1-8b-instant · Input $0.05/1M · Output $0.08/1M · 131k context

Llama 3.3 70B Versatile

groq/llama-3.3-70b-versatile · Input $0.59/1M · Output $0.79/1M · 131k context

Meta Llama 4 Scout 17B 16E Instruct

groq/meta-llama/llama-4-scout-17b-16e-instruct · Input $0.11/1M · Output $0.34/1M · 131k context

Moonshot AI Kimi K2 Instruct 0905

groq/moonshotai/kimi-k2-instruct-0905 · Input $1/1M · Output $3/1M · 262k context

Mistral AI

27 models · Mistral AI's language models

Mistral Large Latest

mistral-large-latest · Input $0.5/1M · Output $1.5/1M · 256k context

Mistral Large 2512

mistral-large-2512 · Input $0.5/1M · Output $1.5/1M · 256k context

Mistral Small 2603

mistral-small-2603 · Input $0.15/1M · Output $0.6/1M · 256k context

Devstral 2512

devstral-2512 · Input $0.4/1M · Output $2/1M · 256k context

Mistral Large 2411

mistral-large-2411 · Input $2/1M · Output $6/1M · 128k context

Magistral Medium Latest

magistral-medium-latest · Input $2/1M · Output $5/1M · 128k context

Magistral Medium 2509

magistral-medium-2509 · Input $2/1M · Output $5/1M · 128k context

Magistral Small Latest

magistral-small-latest · Input $0.5/1M · Output $1.5/1M · 128k context

Magistral Small 2509

magistral-small-2509 · Input $0.5/1M · Output $1.5/1M · 128k context

Mistral Medium Latest

mistral-medium-latest · Input $0.4/1M · Output $2/1M · 128k context

Mistral Medium 2508

mistral-medium-2508 · Input $0.4/1M · Output $2/1M · 128k context

Mistral Medium 2505

mistral-medium-2505 · Input $0.4/1M · Output $2/1M · 128k context

Mistral Small Latest

mistral-small-latest · Input $0.15/1M · Output $0.6/1M · 256k context

Mistral Small 2506

mistral-small-2506 · Input $0.1/1M · Output $0.3/1M · 128k context

Open Mistral Nemo

open-mistral-nemo · Input $0.15/1M · Output $0.15/1M · 128k context

Codestral Latest

codestral-latest · Input $0.3/1M · Output $0.9/1M · 128k context

Codestral 2508

codestral-2508 · Input $0.3/1M · Output $0.9/1M · 128k context

Devstral Latest

devstral-latest · Input $0.4/1M · Output $2/1M · 256k context

Devstral Small Latest

devstral-small-latest · Input $0.1/1M · Output $0.3/1M · 256k context

Devstral Small 2507

devstral-small-2507 · Input $0.1/1M · Output $0.3/1M · 128k context

Devstral Medium 2507

devstral-medium-2507 · Input $0.4/1M · Output $2/1M · 128k context

Ministral 14B Latest

ministral-14b-latest · Input $0.2/1M · Output $0.2/1M · 256k context

Ministral 14B 2512

ministral-14b-2512 · Input $0.2/1M · Output $0.2/1M · 256k context

Ministral 8B Latest

ministral-8b-latest · Input $0.15/1M · Output $0.15/1M · 256k context

Ministral 8B 2512

ministral-8b-2512 · Input $0.15/1M · Output $0.15/1M · 256k context

Ministral 3B Latest

ministral-3b-latest · Input $0.1/1M · Output $0.1/1M · 256k context

Ministral 3B 2512

ministral-3b-2512 · Input $0.1/1M · Output $0.1/1M · 256k context

AWS Bedrock

32 models · AWS Bedrock foundation models

Anthropic Claude Opus 4.5

bedrock/anthropic.claude-opus-4-5-20251101-v1:0 · Input $5/1M · Output $25/1M · 200k context

Anthropic Claude Sonnet 4.5

bedrock/anthropic.claude-sonnet-4-5-20250929-v1:0 · Input $3/1M · Output $15/1M · 200k context

Anthropic Claude Haiku 4.5

bedrock/anthropic.claude-haiku-4-5-20251001-v1:0 · Input $1/1M · Output $5/1M · 200k context

Anthropic Claude Opus 4.1

bedrock/anthropic.claude-opus-4-1-20250805-v1:0 · Input $15/1M · Output $75/1M · 200k context

Amazon Nova 2 Pro

bedrock/amazon.nova-2-pro-v1:0 · Input $1/1M · Output $4/1M · 1M context

Amazon Nova 2 Lite

bedrock/amazon.nova-2-lite-v1:0 · Input $0.08/1M · Output $0.32/1M · 1M context

Amazon Nova Premier

bedrock/amazon.nova-premier-v1:0 · Input $2.5/1M · Output $10/1M · 1M context

Amazon Nova Pro

bedrock/amazon.nova-pro-v1:0 · Input $0.8/1M · Output $3.2/1M · 300k context

Amazon Nova Lite

bedrock/amazon.nova-lite-v1:0 · Input $0.06/1M · Output $0.24/1M · 300k context

Amazon Nova Micro

bedrock/amazon.nova-micro-v1:0 · Input $0.035/1M · Output $0.14/1M · 128k context

Meta Llama 4 Maverick 17B Instruct

bedrock/meta.llama4-maverick-17b-instruct-v1:0 · Input $0.24/1M · Output $0.97/1M · 1M context

Meta Llama 4 Scout 17B Instruct

bedrock/meta.llama4-scout-17b-instruct-v1:0 · Input $0.18/1M · Output $0.72/1M · 3.5M context

Meta Llama 3 70B Instruct

bedrock/meta.llama3-3-70b-instruct-v1:0 · Input $0.72/1M · Output $0.72/1M · 128k context

Meta Llama 3 2 90B Instruct

bedrock/meta.llama3-2-90b-instruct-v1:0 · Input $2/1M · Output $2/1M · 128k context

Meta Llama 3 2 11B Instruct

bedrock/meta.llama3-2-11b-instruct-v1:0 · Input $0.16/1M · Output $0.16/1M · 128k context

Meta Llama 3 2 3B Instruct

bedrock/meta.llama3-2-3b-instruct-v1:0 · Input $0.15/1M · Output $0.15/1M · 128k context

Meta Llama 3 2 1B Instruct

bedrock/meta.llama3-2-1b-instruct-v1:0 · Input $0.1/1M · Output $0.1/1M · 128k context

Meta Llama 3 1 405B Instruct

bedrock/meta.llama3-1-405b-instruct-v1:0 · Input $5.32/1M · Output $16/1M · 128k context

Meta Llama 3 1 70B Instruct

bedrock/meta.llama3-1-70b-instruct-v1:0 · Input $2.65/1M · Output $3.5/1M · 128k context

Meta Llama 3 1 8B Instruct

bedrock/meta.llama3-1-8b-instruct-v1:0 · Input $0.3/1M · Output $0.6/1M · 128k context

Mistral Large 3 675B Instruct

bedrock/mistral.mistral-large-3-675b-instruct · Input $2/1M · Output $6/1M · 128k context

Mistral Large 2411

bedrock/mistral.mistral-large-2411-v1:0 · Input $2/1M · Output $6/1M · 128k context

Mistral Large 2407

bedrock/mistral.mistral-large-2407-v1:0 · Input $4/1M · Output $12/1M · 128k context

Mistral Pixtral Large 2502

bedrock/mistral.pixtral-large-2502-v1:0 · Input $2/1M · Output $6/1M · 128k context

Mistral Magistral Small 2509

bedrock/mistral.magistral-small-2509 · Input $0.5/1M · Output $1.5/1M · 128k context

Mistral Ministral 3 14B Instruct

bedrock/mistral.ministral-3-14b-instruct · Input $0.2/1M · Output $0.2/1M · 128k context

Mistral Ministral 3 8B Instruct

bedrock/mistral.ministral-3-8b-instruct · Input $0.1/1M · Output $0.1/1M · 128k context

Mistral Ministral 3 3B Instruct

bedrock/mistral.ministral-3-3b-instruct · Input $0.04/1M · Output $0.04/1M · 128k context

Mistral Mixtral 8x7b Instruct

bedrock/mistral.mixtral-8x7b-instruct-v0:1 · Input $0.45/1M · Output $0.7/1M · 32k context

Amazon Titan Text Premier

bedrock/amazon.titan-text-premier-v1:0 · Input $0.5/1M · Output $1.5/1M · 32k context

Cohere Command R Plus

bedrock/cohere.command-r-plus-v1:0 · Input $3/1M · Output $15/1M · 128k context

Cohere Command R

bedrock/cohere.command-r-v1:0 · Input $0.5/1M · Output $1.5/1M · 128k context

Dynamic model catalogs

These providers load their model lists dynamically at runtime.

Frequently asked questions

The most important factors for agent tasks are reliable tool use (function calling), a large enough context window to track conversation history and tool outputs, and consistent instruction following. In Sim, OpenAI GPT-4.1, Anthropic Claude Sonnet, and Google Gemini 2.5 Pro are popular choices — each supports tool use, structured outputs, and context windows of 128K tokens or more. For cost-sensitive or high-throughput agents, Groq and Cerebras offer significantly faster inference at lower cost.

Models

Compare models

Cost

Context window

All models

OpenAI

GPT-4.1

GPT-4.1 Mini

GPT-4.1 Nano

GPT-5.4 Pro

GPT-5.4

GPT-5.4 Mini

GPT-5.4 Nano

GPT-5.2 Pro

GPT-5.2

GPT-5.1

GPT-5 Pro

GPT-5

GPT-5 Mini

GPT-5 Nano

GPT-5 Chat Latest

o4 Mini

o3 Pro

o3

o3 Mini

o1

GPT-4o

Anthropic

Claude Opus 4.6

Claude Sonnet 4.6

Claude Opus 4.5

Claude Opus 4.1

Claude Opus 4.0

Claude Sonnet 4.5

Claude Sonnet 4.0

Claude Haiku 4.5

Claude 3 Haiku

Azure OpenAI

GPT-4o

GPT-5.4

GPT-5.4 Mini

GPT-5.4 Nano

GPT-5.2

GPT-5.1

GPT-5.1 Codex

GPT-5

GPT-5 Mini

GPT-5 Nano

GPT-5 Chat Latest

o3

o4 Mini

GPT-4.1

GPT-4.1 Mini

GPT-4.1 Nano

Model Router

Azure Anthropic

Claude Opus 4.6

Claude Opus 4.5

Claude Sonnet 4.5

Claude Opus 4.1

Claude Haiku 4.5

Google

Gemini 3.1 Pro Preview

Gemini 3.1 Flash Lite Preview

Gemini 3 Flash Preview

Gemini 2.5 Pro

Gemini 2.5 Flash

Gemini 2.5 Flash Lite

Gemini 2.0 Flash

Gemini 2.0 Flash Lite

Deep Research Pro Preview 12 2025

Vertex AI

Gemini 3.1 Pro Preview

Gemini 3.1 Flash Lite Preview

Gemini 3 Pro Preview

Gemini 3 Flash Preview

Gemini 2.5 Pro

Gemini 2.5 Flash

Gemini 2.5 Flash Lite

Gemini 2.0 Flash