Question 1

How is the cost calculated?

Accepted Answer

Cost = (input tokens × input price + output tokens × output price) × requests per month. Prices are per million tokens. When prompt caching is enabled, the cached share of your input tokens is billed at the cache-read rate (~10% of the normal input price for Anthropic models), which is where most agent savings come from.

Question 2

How many tokens is one word?

Accepted Answer

Roughly 0.75 words per token in English — so ~1.3 tokens per word. A 500-word prompt is ~650 tokens; a dense 2,000-word document is ~2,700 tokens. For non-English or code, tokens run higher.

Question 3

Are these prices current?

Accepted Answer

Prices are hard-coded as of 2026-06-29 and shown per million tokens. Model pricing changes — always confirm against the provider's live pricing page before committing a budget. You can also edit the input/output price fields directly to model a custom or negotiated rate.

Question 4

When does Haiku beat Sonnet, or Sonnet beat Opus?

Accepted Answer

For high-volume, well-scoped steps (classification, extraction, routing, short replies) the cheapest capable model usually wins on cost-per-outcome. Reserve the frontier model for the few steps that genuinely need its reasoning. The calculator makes that trade-off visible — model the same workload across all three tiers and compare the monthly totals.

What will your
AI agent actually cost?

Your workload

Estimated cost

Same workload, every model

How to read this

Cost calculator questions.

Want it built right?

What will yourAI agent actually cost?