Pick a plan. Pay for tokens.
Three plans, one of them free. Each comes with a monthly pack of tokens. Need more? You roll straight onto pay-as-you-go — billed per token, with spending limits so there are no surprises.
Try Faustine on real work — no card required.
- 1 live assistant
- Prompt caching built in
- Spending limits
- Community support
Ship an assistant your users rely on.
- Unlimited assistants
- Prompt caching + Batch API
- Pay-as-you-go past your pack
- Spending limits per assistant
- Email support
High volume, with controls and support.
- Everything in Pro
- Higher rate limits
- SSO & team roles
- Usage analytics
- Priority support
Custom volume, controls and compliance.
- Everything in Scale
- Custom token volume & limits
- SSO / SAML & SCIM
- Dedicated support & SLA
- VPC or on-prem option
Token packs cover input + output across any model · unused tokens reset monthly · cancel or switch anytime.
Past your pack, just pay per token.
Go over your monthly pack — or skip the plan entirely — and you only pay for the tokens you use, metered by the model. No seats, no minimums.
| Model | Input | Output | Cache write5-min | Cache readhit | Batchin / out |
|---|---|---|---|---|---|
| Faustine Opus 4.8DefaultMost capable · default | $5.75 | $28.75 | $7.19 | $0.58 | $2.88 / $14.38 |
| Faustine Sonnet 4.6Balanced speed & smarts | $3.45 | $17.25 | $4.31 | $0.35 | $1.73 / $8.63 |
| Faustine Haiku 4.5Fastest & most efficient | $1.15 | $5.75 | $1.44 | $0.12 | $0.58 / $2.88 |
| Faustine Fable 5Frontier reasoning | $11.50 | $57.50 | $14.38 | $1.15 | $5.75 / $28.75 |
All prices in USD per 1M tokens. Cache reads are billed at ~10% of input; the Batch API runs at 50% off for non-urgent work.
Token compression, on by default.
Before you’re billed for a single token, Faustine squeezes the request — reusing context that repeats, pruning stale tool output, and summarizing long conversations on the fly. Same answers, a fraction of the tokens, with nothing to configure.
- Prompt caching. Repeated context is reused across requests instead of re-sent and re-billed.
- Context pruning. Stale tool results and dead branches are dropped before they cost you.
- Live compaction. Long histories are summarized automatically as the conversation grows.
The pricing, plainly.
Start free. Today.
Connect your app, switch on the actions you want, and ship the assistant on the free plan. Upgrade only when you outgrow it.