Gemini 3.1 Flash Lite Preview
Google's mid-tier model. Long-context specialist with 1.0M window.
Scores by test
Methodology →What you need to know
Gemini 3.1 Flash Lite Preview is positioned as a high-efficiency model with a massive 1.0M token context window. Its primary differentiator is the ability to maintain strict persona consistency, safety calibration, and multilingual accuracy, all while operating at a low cost of $0.25 per million input tokens.
The model demonstrates a high level of reliability in structured output, tabular data processing, and strategic analysis. While it ranks 15th overall out of 77 models, its performance is uneven; it excels in faithfulness and complex analysis but shows a relative weakness in simple classification tasks.
At a blended cost of $1.19 per million tokens, this model is a bargain for developers requiring long-context capabilities and rigid adherence to formatting or personas. The cost-to-performance ratio is favorable for high-volume applications that do not rely heavily on precise classification.
Use this model if your application requires processing massive documents, maintaining a specific brand voice, or handling multilingual data on a budget. Skip this model if your primary use case is high-accuracy classification or requires the absolute highest precision in constrained rewriting.
Strengths — Top 3
Relative weaknesses — Bottom 3
Similar models