Gemini 2.5 Flash Lite vs Grok 4.1 Fast: Pricing Comparison

Compare pricing, capabilities, and costs for your LLM workloads.

Google

Gemini 2.5 Flash Lite

Pricing (per 1M tokens)

Input$0.1000

Output$0.4000

Cached Input$0.0100

Batch Input$0.0500

Batch Output$0.2000

Context & Output

Context Window1M tokens

Max Output65.5K tokens

Capabilities

Categorybudget

Multimodaltext + image + audio

Fine-tuningNo

StreamingYes

xAI

Grok 4.1 Fast

Pricing (per 1M tokens)

Input$0.2000

Output$0.5000

Cached Input$0.0500

Context & Output

Context Window2M tokens

Max Output16K tokens

Capabilities

Categorymid

Multimodaltext + image

Fine-tuningNo

StreamingYes

Quick Verdict

Cheaper Input Price

Gemini 2.5 Flash Lite

50.0% cheaper

Cheaper Output Price

Gemini 2.5 Flash Lite

20.0% cheaper

Larger Context Window

Grok 4.1 Fast

+1M tokens

Cost Comparison

Sample workload: 1,000,000 input tokens + 1,000,000 output tokens

Gemini 2.5 Flash Lite

$0.5000

$0.1000/1M input + $0.4000/1M output

Grok 4.1 Fast

$0.7000

$0.2000/1M input + $0.5000/1M output

Gemini 2.5 Flash Lite is 28.6% cheaper for this workload.

Frequently Asked Questions

Which is cheaper, Gemini 2.5 Flash Lite or Grok 4.1 Fast?

For input tokens, Gemini 2.5 Flash Lite is cheaper at $0.1000 per 1M tokens. For output tokens, Gemini 2.5 Flash Lite is cheaper at $0.4000 per 1M tokens. The overall cost depends on your workload's input/output ratio.

What is the context window size of Gemini 2.5 Flash Lite vs Grok 4.1 Fast?

Gemini 2.5 Flash Lite has a context window of 1M tokens, while Grok 4.1 Fast has 2M tokens. Grok 4.1 Fast supports a larger context window of 2M tokens, which is beneficial for processing longer documents.

Need more tools?

Explore our complete suite of LLM calculators and comparison tools.

Full Pricing Table Cost Estimator Browse All Comparisons