429 with the standard error envelope until the next second’s budget opens up.
What the limit actually means
- It’s per key, not per IP or per workspace. Two integrations sharing one key share one budget, which is the practical argument for one key per integration.
- There is no daily quota. The limit is a rate. Stay under 10 in any given second and you can run at that pace all day.
- Parallel workers count together. Five workers each making 3 requests per second on the same key will hit the limit constantly.
Staying under it
- Ask for more per request. Lists return 10 items by default but will give you up to 50.
limit=50cuts a bulk read’s request count by 5x. See pagination. - Cache reads that rarely change. Your user list doesn’t change every second. Fetch it on startup or on an interval measured in minutes, not in a loop.
- Smooth your bursts. A client-side throttle (most HTTP libraries have one) turns a thundering herd into a queue, and a queue never sees a
429. - Back off on
429. When you do hit the limit, retry with exponential backoff and jitter. Immediate retries from several workers arrive together and fail together.