OpenAI/Anthropic API bill is running up unexpectedly
Token prices look low until someone builds a loop or sends a huge document with every request. A few checks catch 90% of it.
Try this first
- 1Set usage limits in the provider dashboard. Both have spend alerts and monthly hard caps at organisation and API-key level, but the granularity differs and changes often, verify in the current Console before relying on it.
- 2Track which app uses which key. One key per app/team helps you find the spiker.
- 3Long context (RAG with large documents) is more expensive than people expect. Cache prompts where possible and trim context to what is needed.
- 4With agents: a loop without a limit can burn a few hundred euros in an hour. Hard-code max iterations or max tokens.
- 5Calculate dollars and FX, that is where a surprise hides if USD moves.
When to bring us in
Suddenly 10x normal usage and you cannot see why: revoke the key, rotate it, and call us for log analysis.
See also
- Can I paste a customer file or email into ChatGPT?Depends on the account and settings. Free ChatGPT and a Team tenant behave very differently from what most people assume.
- I want a one-page AI policy for my teamA real one-pager beats a thick document nobody reads. Four headers and concrete examples.
- How do I tell if an AI answer is made up?Models sound confident even when they are wrong. A few habits catch most mistakes.
None of the above fits?
Describe your situation below. We pass your input plus the steps you already saw to our AI and return tailored next-step advice. If it's too risky to DIY, we'll say so.
Or skip the DIY entirely
Our Managed IT clients do not look these things up. One point of contact, a fixed monthly price, resolved within working hours.