Our approach to rate limits for the Claude API (original) (raw)

Your rate limit depends on your usage tier, and is currently measured in three key metrics:

  1. Requests per minute (RPM)
  2. Input tokens per minute (ITPM)
  3. Output tokens per minute (OTPM)

If you exceed any of these rate limits, you will get a 429 error describing which rate limit was exceeded, along with a retry-after header indicating how long to wait.

Rate limits are set at the organization level and are defined by usage tiers. Each tier has different spend and rate limits, with automatic tier advancement based on usage thresholds up to Tier 4.

You can view your organization's current tier and limits in the Claude Console.

More information on usage tiers and rate limits can be found in our Claude docs.


Related Articles

Cost and Usage Reporting in the Claude ConsoleHow can I advance my Claude API usage to Tier 2?Get started with the Claude Enterprise Analytics APIClaude Enterprise Analytics API reference guideClaude Enterprise Admin API reference guide