Requesting RPM Increase (40 to 200 RPM) for Multi-Agent Workflows and RAG Developmen (original) (raw)

Hello NVIDIA Support Team,

I would like to request a rate limit increase for my NVIDIA NIM API account.

Account Details

Use Case
I am using the NVIDIA NIM API for personal development, security auditing, and non-production testing of advanced multi-agent AI workflows. My setup focuses on local automation, system architecture, and specialized data engineering tasks.

A typical workflow involves:

Models evaluated in this workflow:

Why 40 RPM is Insufficient
Even with client-side throttling and exponential backoff, a single agentic execution loop (which plans, tests, reads files, and self-corrects) triggers dozens of concurrent API calls within a few seconds. The default 40 RPM global limit causes immediate 429 “Too Many Requests” errors, completely breaking the autonomous agent loops mid-execution.

Commitment

Thank you for considering my request.

Best regards,
Miguel