Automate web and desktop apps with computer use - Microsoft Copilot Studio (original) (raw)

Computer use is a tool in Copilot Studio that lets your agent interact with and automate tasks on a Windows computer. It works with websites and desktop apps by selecting buttons, choosing menus, and entering text into fields on the screen. Describe in natural language what you want computer use to do, and it performs the task on a computer you set up by using a virtual mouse and keyboard. By using computer use, agents can complete tasks even when there's no API to connect directly to the system. If a person can use an app or website, computer use can too. You can use computer use for tasks like automated data entry, invoice processing, and data extraction.

Computer use is powered by Computer-Using Agents (CUA), an AI model that combines vision capabilities with advanced reasoning to interact with graphical user interfaces (GUIs). Because it's AI-powered, it adapts to interface changes. For example, when buttons or screens change, the tool continues working without breaking your flow. It's simple to use. Just describe what you want in natural language. You don't need to write code.

Watch this video to learn how the computer use tool lets an agent interact with a web application.

Add computer use to your agent

In Copilot Studio, add computer use as a tool by following these steps:

  1. Go to the Tools page for your agent and select Add tool.
  2. In Add tool, select New tool.
  3. Select Computer use.
  4. Provide the instructions that describe the task the tool should perform on the computer. Instruction templates are available to help you get started. Follow best practices for instructions for computer use.
  5. Select Add and configure. On the configuration page, configure these four fields:
    • Name: Enter the display name for the computer use tool. This name helps you differentiate it from other tools you add to your agent.
    • Description: Enter a short description of what this tool does and when to use it. This text lets your agent know when to use this tool.
    • Model: Select the model used to execute the computer use tool. The following models are available:
      Provider Model Tier Status
      OpenAI Computer-Using Agent (CUA) Standard Generally available
      Anthropic Claude Sonnet 4.5 Standard Generally available
      Anthropic Claude Sonnet 4.6 Standard Experimental
      Anthropic Claude Opus 4.6 Premium Experimental
    • Instructions: List the steps the tool should perform, including URLs and application names. Follow best practices for instructions for computer use.
  6. Optionally, review and adjust the other fields available on the configuration page.
  7. Select Save.

Configure optional settings

The configuration page includes optional fields beyond the four required fields (Name, Description, Model, Instructions). Review and configure these options as needed for your scenario.

Inputs

Use inputs to define dynamic values that change each time computer use runs. For example, if you want to fill out a form with a different value on every run, create an input for that field. At execution time, computer use combines your instructions with the input values to complete the task.

Machine

Select the target machine that the agent uses to run computer use. Learn more in Configure where computer use runs.

Connection

Connection used for this tool. Update or create a new connection to change the credentials used.

Credentials to use

Specify how computer use authenticates during execution:

Human supervision

Specify who should be contacted via email (Outlook) if the computer-use agent detects potentially harmful instructions that could alter model behavior. Each agent run and its activity is tied to the user who initiated it. If you choose a reviewer other than the person running the computer-use agent, they likely don't see the activity because they didn't initiate the run. Therefore, they can't properly verify or act on the request. Also, ensure the recipient is authorized and has the necessary context to handle such requests. The response time limit defines how long the request remains active. After this period, the request expires, and the computer-use run stops if no response is received. Learn more in Human supervision.

Stored credentials

Define the credentials that computer use uses to sign in to websites and applications. During execution, if a sign-in prompt appears, computer use securely uses any credentials you defined in this section for that site or application. Passwords are either stored in Power Platform internal storage (no configuration required) or in an Azure Key Vault that you provide. Learn more in Create a key vault using the Azure portal.

Access control

By default, computer use can operate on any website or application. If you want to restrict this access, enable access control to define the specific URLs and desktop applications that computer use should be limited to. You can configure both websites and applications:

Note

Access control only prevents the model from taking actions on websites or applications that aren't in the allow list. It doesn't stop the model from opening them. For example, if only microsoft.com and Microsoft Edge are in the allow list, the model can still use the Microsoft Edge search bar to open Bing. However, once Bing is opened, any attempt to interact with it fails because it isn't in the allow list.

Test computer use

Testing computer use is a key step in the authoring journey. After you enter a name, description, and instructions, and save the tool, select Test to start the test experience.

After a short loading period, the test experience appears:

When the task finishes, a Test completed message appears. While the test is in progress, you can select Stop testing to immediately stop all actions on the machine.

If the result isn't what you expect, go to the configuration page and refine your instructions. Add more detail to improve accuracy. Follow best practices for writing effective instructions.

Publish an agent with computer use

Set up computer use, and then publish your agent. How your agent runs depends on the scenario. It can be autonomous or conversational:

Computer use works best for autonomous agents, performing tasks in the background without user interaction.

You can also apply computer use in conversational experiences, but keep these considerations in mind:

Best practices

To stay productive and safe in today's digital environments, follow best practices for computer use, especially when writing clear instructions and securing machines.

Best practices for securing machines

When setting up machines for computer use that lets AI perform tasks by using natural language, consider these security recommendations:

Security recommendation Additional information
Use dedicated machines for computer use Assign specific, isolated machines exclusively for tasks involving computer use. This approach reduces the risk of cross-contamination from unrelated software, malware, or unauthorized access. It lets you control configurations, updates, and monitoring more effectively.
Limit permissions to the user account that you're using for computer use Set up the user account for computer use to follow the principle of least privilege—grant only the permissions needed to run the required tools.
Limit web access to an allow list of specific trusted websites only Allow web access only to a predefined list of vetted and trusted domains. For example, you can configure Microsoft Edge policy settings with Microsoft Intune that target machines used for computer use.
Limit specific desktop apps to be available Only install and allow execution of applications that are essential for the intended AI workflows. Remove or disable access to unnecessary software. For example, you can configure application control to limit what applications are allowed to run on the machine.

Best practices for instructions for computer use

The instructions you write determine how well computer use works. Specific, detailed instructions help computer use complete tasks accurately. Think of it as explaining a task to a colleague. Clear, step-by-step guidance helps ensure success.

Tips for writing effective instructions:

Sample instructions

Explore these sample instructions to try out computer use or use them as a reference for writing your own.

Scenario Name Description Instructions
Invoice processing Transfer and submit invoice details Transfer invoice data from a PDF and submit it to another form. 1. Go to https://computerusedemos.blob.core.windows.net/web/Contoso/invoice-manager.html, set the Date filter to Last 24 hours, and open the invoice PDF.2. In a new tab, open https://computerusedemos.blob.core.windows.net/web/Contoso/index.html and fill out the form with the data from that PDF. Submit the invoice form, no confirmation needed.
Data entry Submit inventory items Add products to the inventory system. 1. Go to https://computerusedemos.blob.core.windows.net/web/Adventure/index.html.2. Submit a new entry for each of the following items:Rear Derailleur, RD-4821, 50, 42.75, Tailspin ToysPedal Set, PD-1738, 80, 19.99, Northwind TradersBrake Lever, BL-2975, 35, 14.50, Trey ResearchChainring Bolt Set, CB-6640, 100, 5.25, VanArsdel, Ltd.Bottom Bracket, BB-9320, 60, 24.90, Tailwind Traders
Data extraction Look up portfolio manager and value Get the manager name and value for a portfolio. 1. Go to https://computerusedemos.blob.core.windows.net/web/Portfolio/index.html.2. Find the row for Fourth Coffee and record the Portfolio Manager name and the current Portfolio Value exactly as shown.3. Return those two values as the final output.

Use computer use to extract data from websites or applications. Then, pass that data to your agent or to other tools in the agent. Describe what information you want to extract directly in the instructions for computer use.

If you want to use the extracted data with another tool (for example, sending it by email), specify that need in the agent instructions and make sure you add both tools to the agent. Example: Use computer use to extract the client information for the financial portfolio dashboard and email the extracted data. In this case, your agent must include both the computer use tool and the email tool.

Instruct computer use to extract specific values and return them as plain text. For example, the following prompt extracts the portfolio manager and portfolio value for the client, Fourth Coffee:

1. Go to https://computerusedemos.blob.core.windows.net/web/Portfolio/index.html.
1. Find the row for Fourth Coffee and record the Portfolio Manager name and the current Portfolio Value exactly as shown.
1. Return those two values as the final output.

The agent returns only the two requested values as text.

You can also ask computer use to return extracted information in JSON format. This format is useful when you want to pass structured data to another tool. The following example extracts all portfolio rows related to Contoso and returns them as properly formatted JSON:

Navigate to https://computerusedemos.blob.core.windows.net/web/Portfolio/index.html, retrieve the portfolio details for all Contoso entities, and return the results as a valid JSON object.

Structure the output so that:
*    Each top-level key is the client name
*    Each value contains the client's portfolio ID, portfolio value, portfolio manager, and last updated date (format: YYYY-MM-DD)
*    Return only the JSON, with no additional text.

This approach ensures the output is structured.

Licensing

Computer use bills usage through the Agent action feature with a billing rate of five Copilot Credits. Learn more in Billing rate and management.

Each computer use run relies on an AI model that executes a sequence of steps. A step might involve one or more low-level actions, such as clicking, typing, or navigating. Each step consumes 5 Copilot Credits, or 15 Copilot Credits when using a premium model.

The following models are available for computer use:

Provider Model Tier Credits per step
OpenAI Computer-Using Agent (CUA) Standard 5
Anthropic Claude Sonnet 4.5 Standard 5
Anthropic Claude Sonnet 4.6 Standard 5
Anthropic Claude Opus 4.6 Premium 15

For example, if you configure computer use to fill out a web-based time-sheet form, the run might perform the following steps:

  1. Launch the browser and go to the time-sheet portal.
  2. Select Create new time sheet.
  3. Fill in the Start Time, End Time, and Project Code fields.
  4. Select Submit.

In this example, the computer use run executes four steps. If it uses a standard model, this process consumes a total of 20 Copilot Credits. If it uses a premium model, it consumes a total of 60 Copilot Credits.

Do you have feedback about computer use? Let us know at computeruse-feedback@microsoft.com.