Back to all models
google
Google: Gemini 3.1 Flash Lite Preview
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.
google/gemini-3.1-flash-lite-preview
Context Size
1,048.576K
Input Price
1,537.5 Ks/M
Output Price
9,225 Ks/M
Architecture
Text
Image
Video
File
Audio
Supported Parameters
include_reasoningmax_tokensreasoningresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_p
Details
TokenizerGemini
Max Completion65,536 tokens
Provider Context1,048.576K tokens
ModeratedNo
Image Price1,537.5 Ks/M