I got rate limit error every time I use Nvidia NIM API (original) (raw)

Every time I call Nvidia NIM API through my hermes agent it just give me this

⚠️ API call failed (attempt 1/3): RateLimitError [HTTP 429]
🔌 Provider: nvidia Model: moonshotai/kimi-k2.6
🌐 Endpoint: https://integrate.api.nvidia.com/v1
📝 Error: HTTP 429: Error code: 429 - {‘status’: 429, ‘title’: ‘Too Many Requests’}
📋 Details: {‘status’: 429, ‘title’: ‘Too Many Requests’}
⏱️ Elapsed: 1.31s Context: 73 msgs, ~91,856 tokens
⏱️ Rate limited. Waiting 2.7s (attempt 2/3)…
⚠️ API call failed (attempt 2/3): RateLimitError [HTTP 429]
🔌 Provider: nvidia Model: moonshotai/kimi-k2.6
🌐 Endpoint: https://integrate.api.nvidia.com/v1
📝 Error: HTTP 429: Error code: 429 - {‘status’: 429, ‘title’: ‘Too Many Requests’}
📋 Details: {‘status’: 429, ‘title’: ‘Too Many Requests’}
⏱️ Elapsed: 4.44s Context: 73 msgs, ~91,856 tokens
⏱️ Rate limited. Waiting 4.7s (attempt 3/3)…
⚠️ API call failed (attempt 3/3): RateLimitError [HTTP 429]
🔌 Provider: nvidia Model: moonshotai/kimi-k2.6
🌐 Endpoint: https://integrate.api.nvidia.com/v1
📝 Error: HTTP 429: Error code: 429 - {‘status’: 429, ‘title’: ‘Too Many Requests’}
📋 Details: {‘status’: 429, ‘title’: ‘Too Many Requests’}
⏱️ Elapsed: 9.51s Context: 73 msgs, ~91,856 tokens
❌ Rate limited after 3 retries — HTTP 429: Error code: 429 - {‘status’: 429, ‘title’: ‘Too Many Requests’}
💀 Final error: HTTP 429: Error code: 429 - {‘status’: 429, ‘title’: ‘Too Many Requests’}

I tried using it after few minutes, the next day and after two days but still I am facing the issue. I did not even use it.

Please fix the issue