Reserve capacity through your account team (original) (raw)

Skip to main content

Reserve capacity through your account team

This document explains how to obtain capacity when you want to use the future reservations in AI Hypercomputer consumption option. To learn about all the capacity options in AI Hypercomputer, seeCapacity overview.

For a very high assurance that your workloads have the resources they need, request a future reservation from Google. This action lets you reserve blocks of capacity for a defined duration, starting on a specific date and time that you choose. Based on your request, Google creates a draft future reservation request. After you review and submit this draft request, and Google Cloud approves it, Compute Engine automatically creates (auto-creates) an empty reservation. Then, at your chosen start time, Compute Engine provisions your requested capacity into the auto-created reservation. You can then use the reservation to create Compute Engine instances until the reservation period ends.

Limitations

This section describes the limitations for future reservation requests, and for the auto-created reservation for a request.

Limitations for future reservation requests

After Google creates a draft future reservation request for you, the following limitations apply:

Limitations for auto-created reservations

After Compute Engine creates an on-demand reservation to fulfill your requested capacity, the following limitations apply:

Before you begin

Console

When you use the Google Cloud console to access Google Cloud services and APIs, you don't need to set up authentication.

gcloud

In the Google Cloud console, activate Cloud Shell.
Activate Cloud Shell
At the bottom of the Google Cloud console, aCloud Shell session starts and displays a command-line prompt. Cloud Shell is a shell environment with the Google Cloud CLI already installed and with values already set for your current project. It can take a few seconds for the session to initialize.

REST

To use the REST API samples on this page in a local development environment, you use the credentials you provide to the gcloud CLI.
Install the Google Cloud CLI.
If you're using an external identity provider (IdP), you must first sign in to the gcloud CLI with your federated identity.
For more information, seeAuthenticate for using REST in the Google Cloud authentication documentation.

Required roles

To get the permissions that you need to create a future reservation request, ask your administrator to grant you theCompute Future Reservation User (roles/compute.futureReservationUser) IAM role on the project. For more information about granting roles, see Manage access to projects, folders, and organizations.

This predefined role contains the permissions required to create a future reservation request. To see the exact permissions that are required, expand the Required permissions section:

Required permissions

The following permissions are required to create a future reservation request:

You might also be able to get these permissions with custom roles or other predefined roles.

Quota

As part of the future reservation request process, Google manages quota for your reserved resources. You don't need to request quota. At the start time of your approved future reservation, Google automatically increases your quota if your project doesn't have enough for the reserved resources.

Overview

To reserve blocks of capacity, complete the following steps:

  1. Request capacity through your account team. Contact your account team to specify the type and number of resources that you want to reserve.
  2. Review and submit a draft reservation request. After Google creates a draft future reservation request, review it. If it looks correct, then submit the request for review. Google Cloud immediately approves it.

Contact your account team and provide the following information for Google to create a draft future reservation request:

After Google creates the draft future reservation request, your account team contacts you.

Review and submit a draft reservation request

After you provide the type and amount of resources to reserve to your account team, Google creates a draft future reservation request. You can review the draft request and, if correct, submit it for review. You must submit the request before the request start time.

To review and submit a draft future reservation request, select one of the following options:

Console

  1. In the Google Cloud console, go to the Reservations page.
    Go to Reservations
  2. Click the Future reservations tab. The Future Reservations table lists each future reservation request in your project, and each table column describes a property.
  3. In the Name column, click the name of the draft request that Google created for you. A page that gives the details of the future reservation request opens.
  4. In the Basic information section, verify that the request details, such as Dates and Share type, are correct. Also, if you requested a commitment, verify that it's specified. If any of these details are incorrect, then contact your account team.
  5. If everything looks accurate, then submit your request:
    1. Click Edit draft. A page to modify the draft request appears.
    2. Click Create. The Reservation page appears. Google Cloud approves your request within a few minutes, and then Compute Engine creates an empty reservation with your requested resources.

gcloud

  1. To view a list of future reservation requests in your project, use thegcloud compute future-reservations list command with the --filter flag set to PROCUREMENT_STATUS=DRAFTING:
    gcloud compute future-reservations list --filter=PROCUREMENT_STATUS=DRAFTING
  2. In the command output, look for the reservation request that has the name that you provided to your account team.
  3. To view the details of the draft request, use thegcloud compute future-reservations describe command:
    gcloud compute future-reservations describe FUTURE_RESERVATION_NAME \
    --zone=ZONE
    Replace the following:
    • acceleratorCount: 8
      acceleratorType: nvidia-h200-141gb
      localSsds:
    • diskSizeGb: '375'
      interface: NVME
      ...
      machineType: a3-ultragpu-8g
      totalCount: '2'
      status:
      autoCreatedReservations:
  1. In the command output, verify that the request details, such as the reservation period and share type, are correct. Additionally, if you purchased a commitment, verify that it's specified. If the details are incorrect, then contact your account team.
  2. To submit the draft request for review, use thegcloud compute future-reservations update command with the --planning-status flag set to SUBMITTED:
    gcloud compute future-reservations update FUTURE_RESERVATION_NAME \
    --planning-status=SUBMITTED \
    --zone=ZONE
    Within a few minutes, Google Cloud approves your request, and then Compute Engine creates an empty reservation with your requested resources.

REST

  1. To view a list of future reservation requests in your project, make aGET request to thefutureReservations.list method. In the request URL, include the filter query parameter and set it tostatus.procurementStatus=DRAFTING:
    GET https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/futureReservations?filter=status.procurementStatus=DRAFTING
    Replace the following:
    • PROJECT_ID: the ID of the project where Google created the draft future reservation request.
    • ZONE: the zone where request exists.
  2. In the request output, look for the reservation request that has the name that you provided to your account team.
  3. To view the details of the draft request, make a GET request to thebeta futureReservations.get method:
    GET https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/futureReservations/FUTURE_RESERVATION_NAME
    Replace FUTURE_RESERVATION_NAME with the name of the draft future reservation request.
    The output is similar to the following:
    {
    "specificSkuProperties": {
    "instanceProperties": {
    "machineType": "a3-ultragpu-8g",
    "guestAccelerators": [
    {
    "acceleratorType": "nvidia-h200-141gb",
    "acceleratorCount": 8
    }
    ],
    "localSsds": [
    {
    "diskSizeGb": "375",
    "interface": "NVME"
    },
    ...
    ]
    },
    "totalCount": "2"
    },
    "kind": "compute#futureReservation",
    "id": "7979651787097007552",
    "creationTimestamp": "2025-11-27T11:14:58.305-08:00",
    "selfLink": "https://www.googleapis.com/compute/v1/projects/example-project/zones/europe-west1-b/futureReservations/example-draft-request",
    "selfLinkWithId": "https://www.googleapis.com/compute/v1/projects/example-project/zones/europe-west1-b/futureReservations/7979651787097007552",
    "zone": "https://www.googleapis.com/compute/v1/projects/example-project/zones/europe-west1-b",
    "name": "example-draft-request",
    "timeWindow": {
    "startTime": "2026-01-27T19:20:00Z",
    "endTime": "2026-02-10T19:20:00Z"
    },
    "status": {
    "procurementStatus": "DRAFTING",
    "lockTime": "2026-01-27T19:15:00Z"
    },
    "planningStatus": "DRAFT",
    "specificReservationRequired": true,
    "reservationName": "example-reservation",
    "deploymentType": "DENSE",
    "schedulingType": "INDEPENDENT",
    "autoCreatedReservationsDeleteTime": "2026-02-10T19:20:00Z"
    }
  4. In the output, verify that the request details, such as the reservation period and share type, are correct. Additionally, if you requested a commitment, verify that it's specified. If the details are incorrect, then contact your account team.
  5. To submit the draft request for review, make a PATCH request to thebeta futureReservations.update method. In the request URL, include the updateMask query parameter and set it toplanningStatus:
    PATCH https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/futureReservations/FUTURE_RESERVATION_NAME?updateMask=planningStatus
    {
    "name": "FUTURE_RESERVATION_NAME",
    "planningStatus": "SUBMITTED"
    }
    Within a few minutes, Google Cloud approves your request, and then Compute Engine creates an empty reservation with your requested resources.

What's next

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-06-15 UTC.