Question 1

How accurate is this token count estimate?

Accepted Answer

The estimate is approximate — accurate to within 10-20% for typical English text. Tokenisation varies by model: GPT-4 uses cl100k_base, Claude uses its own tokeniser, Gemini uses another. For exact counts, use the model provider's official tokeniser (tiktoken for OpenAI, or the API's token counting endpoint). This tool is best for budgeting and rough planning.

Question 2

Why does code have more tokens than equivalent English prose?

Accepted Answer

Code contains more punctuation, unusual character combinations, and specific token vocabulary that causes more splits. A line of Python code like 'for item in items:' might be 6-8 tokens while 6-8 English words is also 6-8 tokens — but compact code often has more semantic content per word, causing higher density.

Question 3

What is a context window?

Accepted Answer

The context window is the maximum amount of text (measured in tokens) a model can process at once — both your input and its output. GPT-4 Turbo has 128K tokens. Claude 3 has 200K. Gemini 1.5 Pro has 1M tokens. Your prompt plus the model's response must fit within this limit. Longer context enables more complex tasks but costs more.

Question 4

How do I reduce token usage in my prompts?

Accepted Answer

Be concise — remove filler phrases and unnecessary context. Use structured formats (numbered lists, JSON) which often token-encode efficiently. Summarise long documents rather than pasting them in full. Avoid repeating the same instructions in multiple ways. Remove examples once the model demonstrates understanding.

Question 5

Why do different models charge different prices per token?

Accepted Answer

Larger models (more parameters) require more computation per token. GPT-4 costs more than GPT-3.5 because it is larger and more capable. Input tokens (your prompt) are cheaper than output tokens (the model's response) because generating each output token is computationally more intensive than reading input.

LLM Token Counter & API Cost Estimator

What Is a Token in LLMs

Why Token Counting Matters

Frequently Asked Questions

LLM Token Counter & API Cost Estimator

What Is a Token in LLMs

Why Token Counting Matters

Frequently Asked Questions

Related Tools