No monthly fee. No minimums. No commitment. Drop in a card, run one call, pay for one call. Cancel any time. The free sandbox tier never expires.
Each text tier has its own input and output rate. Pick the tier that matches the job; you're billed for the tokens that move through the API.
| Model | Input | Output | Best for |
|---|---|---|---|
| qai-hello-world | Free | Free | Free sandbox tier (100/key/day cap) |
| qai-flash | $0.20 | $0.50 | Cheap and fast (per 1M tokens) |
| qai-pro | $0.80 | $2.00 | Balanced default (per 1M tokens) |
| qai-max | $1.50 | $5.00 | Top capability (per 1M tokens) |
| qai-think | $1.80 | $8.00 | Reasoning (per 1M tokens) |
Imagine you are running a chatbot at 100,000 calls per month, with an average of 400 input tokens and 200 output tokens per call. That is 40M input tokens and 20M output tokens monthly. Here is the bill on each text tier:
Both image tiers bill per image returned. If you set n: 4, you get four images and you are billed for four. Permanent CDN hosting included when you pass hostMedia: true.
| Model | Price | Best for |
|---|---|---|
| qai-imagine-turbo | $0.04 | Real-time, batch, preview workflows |
| qai-imagine-quality | $0.08 | Studio-grade detail and photorealism |
If you generate a thousand images monthly, here is what that runs:
Billed on the duration returned, not the duration requested. A 6-second clip bills 6 seconds; a 4-second clip bills 4. Hosted on the Qai CDN out of the box.
| Model | Price | Best for |
|---|---|---|
| qai-motion | $0.18 | Up to 6-second clips, 832x480 |
Roughly 16 six-second clips, or 33 three-second clips.
Stateless helpers like Clean JSON and Humanize Text are free for every account. They do not consume LLM tokens. Rate-limited per key on the same account-wide allowance as your other endpoints.
POST /v1/utilities/clean-json — Free. Strips markdown fences, fixes trailing commas, converts Python booleans, recovers valid JSON from messy LLM output.
POST /v1/utilities/humanize-text — Free in default mode (typography cleanup). Optional naturalizeWording mode uses a small text model and is billed at qai-flash rates.
Three layers of cost control to make sure you never get a surprise invoice.
Each API key can have a daily and a monthly spend cap configured from the dashboard. When the cap is hit, that key starts returning 402 Payment Required. Your other keys keep working. You get an email at 50%, 80%, and 100% of the cap.
Your dashboard shows running spend in real time, broken down by model, by API key, and by time range. Spot a runaway cron job an hour after it starts, not at the end of the billing cycle.
qai-hello-world is unlimited in development and capped at 100 calls per key per UTC day in production. Use it during integration work so your real spend only kicks in when you go to production.
Sign up free. The sandbox tier lets you test your integration without spending a cent.
Get your API key