Use-case cost estimate
How Much Does an AI Chatbot Cost?
This example models one language-model response per conversation. Production chatbots often add conversation history, retrieval, moderation, retries, and observability.
Default workload assumptions
These values make the example reproducible. They are planning assumptions, not measured usage from your application.
- Input tokens per conversation
- 1,000
- Output tokens per conversation
- 300
- Conversations per month
- 1,000
- Example model
- GPT-4.1 mini
Calculator-style cost example
Estimate- Chat completion usage
- $0.881,000 conversations per month
Estimated monthly cost
$0.88
Estimated yearly cost
$10.56
GPT-4.1 mini
Last verified Jun 21, 2026 · OpenAI model pricing
Formula
Main cost drivers
- Conversation history included on later turns
- Response length and retry frequency
- Retrieved context and tool definitions
- Active conversations rather than registered users
Ways to reduce cost
- Summarize or trim old conversation history
- Cap response length where the product permits
- Route simple tasks to a lower-cost evaluated model
- Cache stable prompt prefixes when eligible
Frequently asked questions
Does this include every message in a conversation?
No. The example treats one conversation as one model request. Multiply by average turns or use the calculator when a conversation contains several calls.
Does it include hosting and vector search?
No. The displayed provider estimate covers the listed model usage only.
Related pricing pages
Related glossary terms
Input tokens
Input tokens are the tokenized units sent to a model, including instructions, user content, conversation history, retrieved context, and tool definitions.
OpenRequests per day
Requests per day is the number of billable API calls made during a day. TokenMath commonly derives it from requests per active user multiplied by active users.
OpenCost per request
Cost per request is the sum of all billable usage generated by one API call, commonly input token cost plus output token cost for a text model.
Open