Question 1

Why does a multi-turn conversation cost more than the sum of its messages?

Accepted Answer

Because LLMs are stateless: the model remembers nothing between calls, so to keep context you must resend the entire conversation history on every turn. Turn 5 pays to process turns 1–4 all over again, plus the new message. The input you are billed for grows with each turn, so total cost rises faster than linearly — roughly quadratically with conversation length. With the defaults, one 5-turn conversation costs $0.039.

Question 2

What exactly grows each turn?

Accepted Answer

Only the input side grows. Each turn you resend the system prompt plus every prior user and assistant message, then add the new user message — so billed input tokens climb turn after turn. Output stays roughly constant per turn (one fresh answer). That is why long conversations are dominated by re-processing old context, not by generating new replies.

Question 3

How can I reduce conversation cost?

Accepted Answer

Three proven levers: cap the history (keep only the last N turns or a running summary instead of the full transcript), cache the system prompt so the fixed part of the context is billed at a fraction of its price, and shorten answers where you can. Summarising old turns into a compact memory is the single biggest win for long sessions — it breaks the quadratic growth.

Question 4

How do I get from one conversation to a monthly bill?

Accepted Answer

Multiply the per-conversation cost by your conversations per day, then by ~30. With the defaults — $0.039 per conversation × 1,000 conversations/day — that is about $39.00/day, or roughly $1,170/month. Change the conversations-per-day input to size your own traffic.

Question 5

Does prompt caching help with conversations?

Accepted Answer

Yes, especially for the fixed system prompt and any long, unchanging context (instructions, knowledge base). Caching re-prices that repeated input at a small fraction, which directly attacks the part of the conversation cost that grows. Model the effect with the cached & batch discount calculator.

Question 6

How current are these prices?

Accepted Answer

The bundled defaults are publicly listed prices verified on Jun 25, 2026, linked to source below. Every field — token sizes, turns, prices, conversations per day — is editable, so the calculator stays correct even if a default goes stale. Always confirm current pricing with the provider.

Turn	Input billed	Output billed	Cumulative input
1	300	300	300
2	700	300	1,000
3	1,100	300	2,100
4	1,500	300	3,600
5	1,900	300	5,500

LLM Cost Per Conversation Calculator

How it works

A worked example

Frequently asked questions