Skip to main content

What are the API rate limits?

Agenta applies rate limits per organization to ensure fair usage and system stability. Limits vary by plan and endpoint type.

Each limit has two values. Burst is the maximum number of requests that can be served at once. Rate is the sustained number of requests per minute.

Endpoint TypeExampleFreeProBusinessEnterprise
Data retrievalPOST */retrieve1,200 / 1,200 per min3,600 / 3,600 per min36,000 / 36,000 per minCustom
Trace ingestionPOST /otlp/v1/traces1,200 / 1,200 per min3,600 / 3,600 per min36,000 / 36,000 per minCustom
Queries and analyticsPOST /tracing/*/query120 / 1 per min180 / 1 per min1,800 / 1 per minCustom
Other endpointsGeneral API calls120 / 120 per min360 / 360 per min3,600 / 3,600 per minCustom

Rate limit response

When you exceed the rate limit, the API returns 429 Too Many Requests.

{
"detail": "Rate limit exceeded. Please retry after 5 seconds."
}

The response includes headers that help you back off:

HeaderDescription
X-RateLimit-LimitMaximum requests allowed (burst capacity)
X-RateLimit-RemainingRemaining requests in the current window
Retry-AfterSeconds to wait before retrying. Only returned on 429 responses.

Successful responses also include X-RateLimit-Remaining.