Requests in GitHub Copilot - GitHub Docs (original) (raw)

Learn about requests in Copilot, including premium requests, how they work, and how to manage your usage effectively.

Important

Premium requests for Spark and Copilot coding agent are tracked in dedicated SKUs from November 1, 2025. This provides better cost visibility and budget control for each AI product.
Billing for premium requests began on June 18, 2025, for all paid Copilot plans on GitHub.com, and on August 1, 2025, on GHE.com. The request counters were only set to zero for paid plans.
Premium request counters reset on the 1st of each month at 00:00:00 UTC. See Monitoring your GitHub Copilot usage and entitlements.
Certain requests may experience rate limits to accommodate high demand. Rate limits restrict the number of requests that can be made within a specific time period.

What is a request?

A request is any interaction where you ask Copilot to do something for you—whether it’s generating code, answering a question, or helping you through an extension. Each time you send a prompt in a chat window or trigger a response from Copilot, you’re making a request.

What are premium requests?

Some Copilot features use more advanced processing power and count as premium requests. The number of premium requests a feature consumes can vary depending on the feature and the AI model used.

Premium features

The following Copilot features can use premium requests.

Feature	Premium request consumption	SKU Attribution
Copilot Chat	Copilot Chat uses one premium request per user prompt, multiplied by the model's rate. This includes ask, edit, agent, and plan modes in Copilot Chat in an IDE.	Copilot premium requests
Copilot CLI	Each prompt to Copilot CLI uses one premium request with the default model. For other models, this is multiplied by the model's rate.	Copilot premium requests
Copilot code review	Each time Copilot reviews a pull request (when assigned as a reviewer) or reviews code in your IDE, one premium request is consumed.	Copilot premium requests
Copilot coding agent	Copilot coding agent uses one premium request per session, multiplied by the model's rate. A session begins when you ask Copilot to create a pull request or make one or more changes to an existing pull request. In addition, each real-time steering comment made during an active session uses one premium request per session, multiplied by the model's rate.	Copilot coding agent premium requests
Copilot Spaces	Copilot Spaces uses one premium request per user prompt, multiplied by the model's rate.	Copilot premium requests
Spark	Each prompt to Spark uses a fixed rate of four premium requests.	Spark premium requests
OpenAI Codex integration	While in preview, each prompt to OpenAI Codex uses one premium request multiplied by the model multiplier rates.	Copilot premium requests

How do request allowances work per plan?

If you use Copilot Free, your plan comes with up to 2,000 inline suggestion requests and up to 50 premium requests per month. All chat interactions count as premium requests.

If you're on a paid plan, you get unlimited inline suggestions and unlimited chat interactions using the included models (GPT-5 mini, GPT-4.1 and GPT-4o). Rate limiting is in place to accommodate for high demand. See Rate limits for GitHub Copilot.

Paid plans also receive a monthly allowance of premium requests, which can be used for advanced chat interactions, inline suggestions using premium models, and other premium features. For an overview of the amount of premium requests included in each plan, see Plans for GitHub Copilot.

Note

If a user has licenses from multiple enterprises, or standalone organizations, they must make a selection using the "Usage billed to" drop down in order to utilize premium requests. The billing entity selected will be billed for any premium requests they make. See Monitoring your GitHub Copilot usage and entitlements.

What happens to unused requests at the end of the month?

Unused requests for the previous month do not carry over to the following month.

What if I run out of premium requests?

Note

Additional premium requests are not available to:

Users on Copilot Free. To access more premium requests, upgrade to a paid plan.
Users who subscribe, or have subscribed, to Copilot Pro or Copilot Pro+ through GitHub Mobile on iOS or Android.

If you're on a paid plan and use all of your premium requests, you can still use Copilot with one of the included models for the rest of the month. This is subject to change. Response times for the included models may vary during periods of high usage. Requests to the included models may be subject to rate limiting. See Rate limits for GitHub Copilot.

If you need more premium requests beyond your monthly allowance:

For an individual subscription, set a budget for additional premium requests or upgrade to a higher plan. See Setting up budgets to control spending on metered products.
If you're an enterprise or organization owner, ensure that the "Premium request paid usage" policy is enabled and that extra spending is not prevented by a budget. See Managing the premium request allowance for your organization or enterprise.

Accounts created before August 22, 2025 have a default $0 budget for Copilot premium requests. Premium requests over the allowance are rejected unless you edit or delete this budget.

Model multipliers

The available models vary depending on your Copilot plan. See Plans for GitHub Copilot.

Note

The models included with Copilot plans are subject to change.
Discounted multipliers are available for using Copilot auto model selection in Copilot Chat in VS Code. See About Copilot auto model selection.
- If you are on a paid Copilot plan and use auto model selection, models qualify for a 10% multiplier discount. For example, Sonnet 4 would be billed at .9x rather than 1x when using auto model selection.
- Discounted multipliers are not available for Copilot Free.

Each model has a premium request multiplier, based on its complexity and resource usage. If you are on a paid Copilot plan, your premium request allowance is deducted according to this multiplier.

GPT-5 mini, GPT-4.1 and GPT-4o are the included models, and do not consume any premium requests if you are on a paid plan.

If you use Copilot Free, you have access to a limited number of models, and each model will consume one premium request when used.

Examples of premium request usage

Premium request usage is based on the model’s multiplier and the feature you’re using. For example:

Using Claude Opus 4.1 in Copilot Chat: With a 10× multiplier, one interaction counts as 10 premium requests.
Using GPT-5 mini on Copilot Free: Each interaction counts as 1 premium request.
Using GPT-5 mini on a paid plan: No premium requests are consumed.