Embeddings Cost Calculator
Compare embedding model pricing across OpenAI, Voyage AI, and Cohere.
The cost of creating vector embeddings depends on the model and your total token volume. OpenAI text-embedding-3-small costs $0.020 per million tokens — the cheapest OpenAI option. OpenAI text-embedding-3-large costs $0.130 per million tokens and delivers higher retrieval accuracy for semantic search. Voyage AI's voyage-3-lite starts at $0.010 per million tokens, making it the lowest-cost option for high-volume pipelines. Enter your corpus size below to calculate one-time and recurring embedding costs across all models — no signup required.
Total tokens to embed: 5,000,000
| Model | Provider | Price/1M | One-Time Cost | Dimensions |
|---|---|---|---|---|
| text-embedding-3-small | OpenAI | $0.0200 | $0.1000 | 1,536 |
| text-embedding-3-large | OpenAI | $0.1300 | $0.6500 | 3,072 |
| Cohere embed-v3 | Cohere | $0.1000 | $0.5000 | 1,024 |
| Voyage 3 | Voyage AI | $0.0600 | $0.3000 | 1,024 |
How embedding costs are calculated
Embedding cost is determined by two variables: the total number of tokens in your corpus and the per-token price of the embedding model. Unlike LLM generation costs — which have separate input and output rates — embedding models charge a single flat rate per token processed. There is no output price because embeddings do not generate text; they convert text into a fixed-length vector. To estimate your total tokens, use the ratio of approximately 1 token per 4 characters of English text (or 250,000 tokens per 1MB of plain text). A corpus of 100,000 average documents — each 500 words / 3,750 characters — contains roughly 94 million tokens. At $0.020 per million tokens, that costs $1.88 to embed with text-embedding-3-small. The same corpus costs $12.22 with text-embedding-3-large. If you re-embed your corpus — for example, to update it with new content or switch to a newer model — that is an additional full-cost pass. Build re-embedding frequency into your budget: weekly re-embedding of a 100M-token corpus costs $2 per run with text-embedding-3-small, $13 per run with text-embedding-3-large.
Choosing the right embedding model
The three practical choices for most production RAG pipelines are OpenAI text-embedding-3-small, Voyage AI voyage-3, and Cohere embed-v4.0. Here is how they compare on the dimensions that matter: **OpenAI text-embedding-3-small** ($0.020/M tokens) is the default choice for teams already on the OpenAI platform. It is performant on standard retrieval benchmarks, requires no additional vendor relationship, and is priced aggressively for its quality tier. The main limitation is that it produces 1536-dimensional vectors — larger than some alternatives, which increases vector storage costs at scale. **Voyage AI voyage-3-lite** ($0.010/M tokens) is the cheapest option for pure throughput. It trades some retrieval accuracy for a 50% cost reduction versus text-embedding-3-small. Best fit: high-volume pipelines where retrieval precision is less critical than cost, such as broad content discovery or deduplication. **Voyage AI voyage-3** ($0.060/M tokens) outperforms text-embedding-3-small on most domain-specific benchmarks — particularly code, legal, and scientific text. If your RAG quality matters and you are not locked into OpenAI infrastructure, voyage-3 is the best quality-per-dollar option in most evaluations. **OpenAI text-embedding-3-large** ($0.130/M tokens) delivers the highest retrieval accuracy of the mainstream options but at 6.5× the cost of text-embedding-3-small. Justified only for applications where retrieval precision directly drives revenue — such as enterprise search, legal discovery, or high-stakes recommendation engines.
Frequently Asked Questions
How much does OpenAI text-embedding-3-small cost per million tokens?
What is the cheapest embedding model?
How many tokens are in 1MB of plain text?
What is the difference between text-embedding-3-small and text-embedding-3-large?
How much does it cost to embed 1 million documents?
Do I pay again if I re-embed my corpus?
How does Voyage AI's pricing compare to OpenAI for embeddings?
All pricing data sourced from official provider pricing pages and verified manually. Prices subject to change — this calculator updates with each monthly pricing refresh. Token estimates use standard approximations: 1 token ≈ 4 characters (English), 1 token ≈ 3.5 characters (code), 1 token ≈ 2.5 characters (CJK). Verify exact token counts with provider tokenizers for production cost planning.