To ensure the stability and fair use of our platform for all users, the Gloo AI API enforces rate limits on incoming requests. These limits define the number of API calls that can be made within a certain period.Documentation Index
Fetch the complete documentation index at: https://docs.gloo.com/llms.txt
Use this file to discover all available pages before exploring further.
How Rate Limiting Works
Rate limiting is applied on a per-API-key basis. The specific limits depend on your organization’s subscription plan. When you exceed the number of allowed requests in a given time window, the API will respond with an HTTP429 Too Many Requests error.
Handling Rate Limits Gracefully
When you receive a429 Too Many Requests error, you should pause your requests until the time specified in the X-RateLimit-Reset header.
A common strategy for handling rate limits is to implement an exponential backoff mechanism. This involves waiting for a progressively longer period between retries.

