Google

Gemini 3.1 Flash-Lite

google/gemini-3.1-flash-lite

visionchatreasoningaffordablelong-context

Input price

149.1 RODI/M

~ 0.25 USD/M

Output price

894.2 RODI/M

~ 1.5 USD/M

Context

1M

Max output

Input:textimagevideoaudiodocument
Output:text

Pricing

RateRODIUSD (ref.)Unit
In149.1~ 0.250USD/M · RODI/M
Out894.2~ 1.50USD/M · RODI/M
Cached15.0~ 0.0250USD/M · RODI/M
Audio in298.1~ 0.500USD/M · RODI/M

RODI prices include Rodium markup and upstream fees. USD figures are wholesale reference rates.

Capabilities

Streaming
Tool calling
Vision
JSON mode
Reasoning

About this model

Most cost-efficient model in Gemini API. Moved from preview to GA on May 7, 2026. Optimized for high-volume agentic tasks, translation, and simple data processing. Audio input priced 2x higher than text/image/video. Flat pricing regardless of context length. via Google Gemini API (direct).

API usage

Use the canonical model slug in your chat completion requests.

Shell / scripts:

Chat completions docs →