Skip to main content

AI API cost glossary

Cost per request

Cost per request is the sum of all billable usage generated by one API call, commonly input token cost plus output token cost for a text model.

Why it matters for API cost

It is the bridge between a model's unit rates and product economics. Once measured, it can drive budget limits, pricing tiers, and gross-margin analysis.

Formula

cost per request = input cost + output cost + other billable units

Example

A $0.001 request used 100,000 times per month produces roughly $100 in modeled provider usage.

Frequently asked questions

Should failed requests be included?

Include any failed or retried call that can still create provider usage. Billing behavior depends on the failure stage and provider.