DeepSeek V4 Flash vs Gemini 2.5 Flash: Pricing Comparison

Compare pricing, capabilities, and costs for your LLM workloads.

DeepSeek

DeepSeek V4 Flash

Pricing (per 1M tokens)

Input$0.1400

Output$0.2800

Cached Input$0.002800

Context & Output

Context Window1M tokens

Max Output384K tokens

Capabilities

Categorybudget

Multimodaltext

Fine-tuningNo

StreamingYes

Google

Gemini 2.5 Flash

Pricing (per 1M tokens)

Input$0.3000

Output$2.50

Cached Input$0.0300

Batch Input$0.1500

Batch Output$1.25

Context & Output

Context Window1M tokens

Max Output65.5K tokens

Capabilities

Categorymid

Multimodaltext + image + audio

Fine-tuningNo

StreamingYes

Quick Verdict

Cheaper Input Price

DeepSeek V4 Flash

53.3% cheaper

Cheaper Output Price

DeepSeek V4 Flash

88.8% cheaper

Larger Context Window

Gemini 2.5 Flash

+0 tokens

Cost Comparison

Sample workload: 1,000,000 input tokens + 1,000,000 output tokens

DeepSeek V4 Flash

$0.4200

$0.1400/1M input + $0.2800/1M output

Gemini 2.5 Flash

$2.80

$0.3000/1M input + $2.50/1M output

DeepSeek V4 Flash is 85.0% cheaper for this workload.

Frequently Asked Questions

Which is cheaper, DeepSeek V4 Flash or Gemini 2.5 Flash?

For input tokens, DeepSeek V4 Flash is cheaper at $0.1400 per 1M tokens. For output tokens, DeepSeek V4 Flash is cheaper at $0.2800 per 1M tokens. The overall cost depends on your workload's input/output ratio.

What is the context window size of DeepSeek V4 Flash vs Gemini 2.5 Flash?

DeepSeek V4 Flash has a context window of 1M tokens, while Gemini 2.5 Flash has 1M tokens. DeepSeek V4 Flash supports a larger context window of 1M tokens, which is beneficial for processing longer documents.

Need more tools?

Explore our complete suite of LLM calculators and comparison tools.

Full Pricing Table Cost Estimator Browse All Comparisons