Gemini 2.5 Flash vs Magistral Small: Pricing Comparison

Compare pricing, capabilities, and costs for your LLM workloads.

Google

Gemini 2.5 Flash

Pricing (per 1M tokens)

Input$0.3000

Output$2.50

Cached Input$0.0300

Batch Input$0.1500

Batch Output$1.25

Context & Output

Context Window1M tokens

Max Output65.5K tokens

Capabilities

Categorymid

Multimodaltext + image + audio

Fine-tuningNo

StreamingYes

Mistral

Magistral Small

Pricing (per 1M tokens)

Input$0.5000

Output$1.50

Context & Output

Context Window40K tokens

Max Output16.4K tokens

Capabilities

Categorymid

Multimodaltext

Fine-tuningNo

StreamingYes

Quick Verdict

Cheaper Input Price

Gemini 2.5 Flash

40.0% cheaper

Cheaper Output Price

Magistral Small

40.0% cheaper

Larger Context Window

Gemini 2.5 Flash

+960K tokens

Cost Comparison

Sample workload: 1,000,000 input tokens + 1,000,000 output tokens

Gemini 2.5 Flash

$2.80

$0.3000/1M input + $2.50/1M output

Magistral Small

$2.00

$0.5000/1M input + $1.50/1M output

Magistral Small is 28.6% cheaper for this workload.

Frequently Asked Questions

Which is cheaper, Gemini 2.5 Flash or Magistral Small?

For input tokens, Gemini 2.5 Flash is cheaper at $0.3000 per 1M tokens. For output tokens, Magistral Small is cheaper at $1.50 per 1M tokens. The overall cost depends on your workload's input/output ratio.

What is the context window size of Gemini 2.5 Flash vs Magistral Small?

Gemini 2.5 Flash has a context window of 1M tokens, while Magistral Small has 40K tokens. Gemini 2.5 Flash supports a larger context window of 1M tokens, which is beneficial for processing longer documents.

Need more tools?

Explore our complete suite of LLM calculators and comparison tools.

Full Pricing Table Cost Estimator Browse All Comparisons