Vertex AI model

Gemini 2.5 Flash Lite

Gemini 2.5 Flash Lite is a Vertex AI model tracked in Sim. It supports a 1.05M token context window. Pricing starts at $0.1/1M input tokens and $0.4/1M output tokens. Key capabilities include Temperature 0-2, Tool choice, Structured outputs. Best for long-context retrieval, large documents, and high-memory workflows.

Input price$0.1/1M
Cached input$0.01/1M
Output price$0.4/1M
Context window1.05M
Max outputNot published
ProviderVertex AI
UpdatedApr 1, 2026
Best forBest for long-context retrieval, large documents, and high-memory workflows.
Temperature0 to 2
Reasoning effortNot supported
VerbosityNot supported
Thinking levelsNot supported
Structured outputsSupported
Tool choiceSupported
Computer useNot supported
Deep researchNot supported
Memory supportSupported
Max output tokensNot published

Frequently asked questions

Gemini 2.5 Flash Lite is a Vertex AI model available in Sim. Gemini 2.5 Flash Lite is a Vertex AI model tracked in Sim. It supports a 1.05M token context window. Pricing starts at $0.1/1M input tokens and $0.4/1M output tokens. Key capabilities include Temperature 0-2, Tool choice, Structured outputs.