Requests in GitHub Copilot - GitHub Docs (original) (raw)

Learn about requests in Copilot, including premium requests, how they work, and how to manage your usage effectively.

Important

What is a request?

A request is any interaction where you ask Copilot to do something for you—whether it’s generating code, answering a question, or helping you through an extension. Each time you send a prompt in a chat window or trigger a response from Copilot, you’re making a request.

What are premium requests?

Some Copilot features use more advanced processing power and count as premium requests. The number of premium requests a feature consumes can vary depending on the feature and the AI model used.

Premium features

The following Copilot features can use premium requests.

Feature Premium request consumption SKU Attribution
Copilot Chat Copilot Chat uses one premium request per user prompt, multiplied by the model's rate. This includes ask, edit, agent, and plan modes in Copilot Chat in an IDE. Copilot premium requests
Copilot CLI Each prompt to Copilot CLI uses one premium request with the default model. For other models, this is multiplied by the model's rate. Copilot premium requests
Copilot code review Each time Copilot reviews a pull request (when assigned as a reviewer) or reviews code in your IDE, one premium request is consumed. Copilot premium requests
Copilot coding agent Copilot coding agent uses one premium request per session, multiplied by the model's rate. A session begins when you ask Copilot to create a pull request or make one or more changes to an existing pull request. In addition, each real-time steering comment made during an active session uses one premium request per session, multiplied by the model's rate. Copilot coding agent premium requests
Copilot Spaces Copilot Spaces uses one premium request per user prompt, multiplied by the model's rate. Copilot premium requests
Spark Each prompt to Spark uses a fixed rate of four premium requests. Spark premium requests
OpenAI Codex integration While in preview, each prompt to OpenAI Codex uses one premium request multiplied by the model multiplier rates. Copilot premium requests

How do request allowances work per plan?

If you use Copilot Free, your plan comes with up to 2,000 inline suggestion requests and up to 50 premium requests per month. All chat interactions count as premium requests.

If you're on a paid plan, you get unlimited inline suggestions and unlimited chat interactions using the included models (GPT-5 mini, GPT-4.1 and GPT-4o). Rate limiting is in place to accommodate for high demand. See Rate limits for GitHub Copilot.

Paid plans also receive a monthly allowance of premium requests, which can be used for advanced chat interactions, inline suggestions using premium models, and other premium features. For an overview of the amount of premium requests included in each plan, see Plans for GitHub Copilot.

Note

If a user has licenses from multiple enterprises, or standalone organizations, they must make a selection using the "Usage billed to" drop down in order to utilize premium requests. The billing entity selected will be billed for any premium requests they make. See Monitoring your GitHub Copilot usage and entitlements.

What happens to unused requests at the end of the month?

Unused requests for the previous month do not carry over to the following month.

What if I run out of premium requests?

Note

Additional premium requests are not available to:

If you're on a paid plan and use all of your premium requests, you can still use Copilot with one of the included models for the rest of the month. This is subject to change. Response times for the included models may vary during periods of high usage. Requests to the included models may be subject to rate limiting. See Rate limits for GitHub Copilot.

If you need more premium requests beyond your monthly allowance:

Accounts created before August 22, 2025 have a default $0 budget for Copilot premium requests. Premium requests over the allowance are rejected unless you edit or delete this budget.

Model multipliers

The available models vary depending on your Copilot plan. See Plans for GitHub Copilot.

Note

Each model has a premium request multiplier, based on its complexity and resource usage. If you are on a paid Copilot plan, your premium request allowance is deducted according to this multiplier.

GPT-5 mini, GPT-4.1 and GPT-4o are the included models, and do not consume any premium requests if you are on a paid plan.

If you use Copilot Free, you have access to a limited number of models, and each model will consume one premium request when used.

Examples of premium request usage

Premium request usage is based on the model’s multiplier and the feature you’re using. For example: