Rate limits
To ensure consistent performance and fair use of the LawyerGPT API during the Beta phase, we enforce strict rate limits per API key. These limits may be adjusted as we monitor performance
Current Rate Limits (Beta Tier)
/ask
60 requests
per minute
/legal-summary
30 requests
per minute
/generate-document
15 requests
per minute
/status
120 requests
per minute
Note: Exceeding these limits will result in a 429 Too Many Requests
response. Please implement retry logic in your application.
Response Headers
To help you manage your usage, each API response includes rate-limit headers:
X-RateLimit-Limit: 60
X-RateLimit-Remaining: 12
X-RateLimit-Reset: 1685583492
X-RateLimit-Limit — Max requests allowed in the window
X-RateLimit-Remaining — Requests left in the current window
X-RateLimit-Reset — Timestamp (in UNIX format) when the limit resets
Retry Strategy (Recommended)
When your app hits a rate limit, use exponential backoff with jitter to prevent overwhelming the API. Avoid making aggressive retries immediately after failure.
Example (Pseudocode):
if response.status_code == 429:
wait_time = get_retry_after(response)
sleep(wait_time + random_jitter)
Upgraded Plans
Higher rate limits will be available in the following:
✅ Pro Tier (Coming Soon) — Higher burst capacity & concurrent sessions
✅ Enterprise Tier — Custom SLAs and priority support
Join the waitlist to request priority access when tiered plans go live.
Last updated