Token-to-Word Converter

Convert between tokens and words instantly.

Quickly convert between LLM tokens and words with this bidirectional converter. Supports English text, code, and CJK languages. See how many words fit in popular context windows from GPT-5.4, Claude Opus 4.6, and Gemini 3.1 Pro.

Ratio: 1 token ≈ 0.75 words (4 chars)

Reference Scale

How Tokens Relate to Words

In English text, one token represents approximately 0.75 words, or about 4 characters. This means 1,000 tokens is roughly 750 words, and a 10,000-word document is approximately 13,333 tokens. The ratio varies by content type: code uses more tokens per word due to syntax characters and variable names, while CJK languages (Chinese, Japanese, Korean) use roughly 0.5 words per token because characters carry more meaning per unit.

Context Window Sizes in Words

Modern LLMs offer context windows ranging from 128K to over 1 million tokens. GPT-5.4 and GPT-5.2 support 1,050,000 tokens (approximately 787,000 words). Claude Opus 4.6 supports 200,000 tokens (about 150,000 words). Gemini 3.1 Pro supports 1,000,000 tokens (roughly 750,000 words). Understanding these limits in words helps you gauge whether your document, codebase, or conversation history will fit.

Frequently Asked Questions

How many tokens is 1,000 words?
In English text, 1,000 words is approximately 1,333 tokens. For code, 1,000 words of code is roughly 1,667 tokens due to syntax characters. For CJK languages, 1,000 words is approximately 2,000 tokens.
How many words fit in a 128K context window?
A 128,000 token context window holds approximately 96,000 words of English text. That is equivalent to a full-length novel or a 380-page document. Models with 128K context windows include GPT-4o, DeepSeek-R1, and Mistral Large.
What is the token-to-word ratio for code?
Code uses approximately 1 token per 3.5 characters, which works out to roughly 0.6 words per token. This means code requires about 67% more tokens per word than English prose due to punctuation, operators, and short variable names.
How many words is 1 million tokens?
One million tokens is approximately 750,000 words of English text. That is equivalent to roughly 10 full-length novels, 3,000 pages, or several complete textbooks. GPT-5.4 and Gemini 3.1 Pro both offer context windows approaching or exceeding 1 million tokens.

Token-to-word ratios are approximations based on average English text. Actual ratios vary by content, vocabulary complexity, and the specific tokenizer used by each model.