playwright-mcp/README.md

## Playwright MCP

A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

### Key Features

- **Fast and lightweight**: Uses Playwright's accessibility tree, not pixel-based input.
- **LLM-friendly**: No vision models needed, operates purely on structured data.
- **Deterministic tool application**: Avoids ambiguity common with screenshot-based approaches.

### Use Cases

- Web navigation and form-filling
- Data extraction from structured content
- Automated testing driven by LLMs
- General-purpose browser interaction for agents

### Example config

```js
{
  "mcpServers": {
    "playwright": {
      "command": "npx",
      "args": [
        "@playwright/mcp@latest"
      ]
    }
  }
}
```


#### Installation in VS Code

Install the Playwright MCP server in VS Code using one of these buttons:

<!--
// Generate using?:
const config = JSON.stringify({ name: 'playwright', command: 'npx', args: ["-y", "@playwright/mcp@latest"] });
const urlForWebsites = `vscode:mcp/install?${encodeURIComponent(config)}`;
// Github markdown does not allow linking to `vscode:` directly, so you can use our redirect:
const urlForGithub = `https://insiders.vscode.dev/redirect?url=${encodeURIComponent(urlForWebsites)}`;
-->

[<img alt="Install in VS Code Insiders" src="https://img.shields.io/badge/VS_Code_Insiders-VS_Code_Insiders?style=flat-square&label=Install%20Server&color=24bfa5">](https://insiders.vscode.dev/redirect?url=vscode-insiders%3Amcp%2Finstall%3F%257B%2522name%2522%253A%2522playwright%2522%252C%2522command%2522%253A%2522npx%2522%252C%2522args%2522%253A%255B%2522-y%2522%252C%2522%2540playwright%252Fmcp%2540latest%2522%255D%257D)

Alternatively, you can install the Playwright MCP server using the VS Code CLI:

```bash
# For VS Code
code --add-mcp '{"name":"playwright","command":"npx","args":["@playwright/mcp@latest"]}'
```

```bash
# For VS Code Insiders
code-insiders --add-mcp '{"name":"playwright","command":"npx","args":["@playwright/mcp@latest"]}'
```

After installation, the Playwright MCP server will be available for use with your GitHub Copilot agent in VS Code.

### User data directory

Playwright MCP will launch Chrome browser with the new profile, located at

```
- `%USERPROFILE%\AppData\Local\ms-playwright\mcp-chrome-profile` on Windows
- `~/Library/Caches/ms-playwright/mcp-chrome-profile` on macOS
- `~/.cache/ms-playwright/mcp-chrome-profile` on Linux
```

All the logged in information will be stored in that profile, you can delete it between sessions if you'dlike to clear the offline state.


### Running headless browser (Browser without GUI).

This mode is useful for background or batch operations.

```js
{
  "mcpServers": {
    "playwright": {
      "command": "npx",
      "args": [
        "@playwright/mcp@latest",
        "--headless"
      ]
    }
  }
}
```

### Running headed browser on Linux w/o DISPLAY

When running headed browser on system w/o display or from worker processes of the IDEs,
you can run Playwright in a client-server manner. You'll run the Playwright server
from environment with the DISPLAY

```sh
npx playwright run-server
```

And then in MCP config, add following to the `env`:

```js
{
  "mcpServers": {
    "playwright": {
      "command": "npx",
      "args": [
        "@playwright/mcp@latest"
      ],
      "env": {
        // Use the endpoint from the output of the server above.
        "PLAYWRIGHT_WS_ENDPOINT": "ws://localhost:<port>/"
      }
    }
  }
}
```

### Tool Modes

The tools are available in two modes:

1. **Snapshot Mode** (default): Uses accessibility snapshots for better performance and reliability
2. **Vision Mode**: Uses screenshots for visual-based interactions

To use Vision Mode, add the `--vision` flag when starting the server:

```js
{
  "mcpServers": {
    "playwright": {
      "command": "npx",
      "args": [
        "@playwright/mcp@latest",
        "--vision"
      ]
    }
  }
}
```

Vision Mode works best with the computer use models that are able to interact with elements using
X Y coordinate space, based on the provided screenshot.

### Programmatic usage with custom transports

```js
import { createServer } from '@playwright/mcp';

// ...

const server = createServer({
  launchOptions: { headless: true }
});
transport = new SSEServerTransport("/messages", res);
server.connect(transport);
```

### Snapshot Mode

The Playwright MCP provides a set of tools for browser automation. Here are all available tools:

- **browser_navigate**
  - Description: Navigate to a URL
  - Parameters:
    - `url` (string): The URL to navigate to

- **browser_go_back**
  - Description: Go back to the previous page
  - Parameters: None

- **browser_go_forward**
  - Description: Go forward to the next page
  - Parameters: None

- **browser_click**
  - Description: Perform click on a web page
  - Parameters:
    - `element` (string): Human-readable element description used to obtain permission to interact with the element
    - `ref` (string): Exact target element reference from the page snapshot

- **browser_hover**
  - Description: Hover over element on page
  - Parameters:
    - `element` (string): Human-readable element description used to obtain permission to interact with the element
    - `ref` (string): Exact target element reference from the page snapshot

- **browser_drag**
  - Description: Perform drag and drop between two elements
  - Parameters:
    - `startElement` (string): Human-readable source element description used to obtain permission to interact with the element
    - `startRef` (string): Exact source element reference from the page snapshot
    - `endElement` (string): Human-readable target element description used to obtain permission to interact with the element
    - `endRef` (string): Exact target element reference from the page snapshot

- **browser_type**
  - Description: Type text into editable element
  - Parameters:
    - `element` (string): Human-readable element description used to obtain permission to interact with the element
    - `ref` (string): Exact target element reference from the page snapshot
    - `text` (string): Text to type into the element
    - `submit` (boolean): Whether to submit entered text (press Enter after)

- **browser_select_option**
  - Description: Select option in a dropdown
  - Parameters:
    - `element` (string): Human-readable element description used to obtain permission to interact with the element
    - `ref` (string): Exact target element reference from the page snapshot
    - `values` (array): Array of values to select in the dropdown.

- **browser_press_key**
  - Description: Press a key on the keyboard
  - Parameters:
    - `key` (string): Name of the key to press or a character to generate, such as `ArrowLeft` or `a`

- **browser_snapshot**
  - Description: Capture accessibility snapshot of the current page (better than screenshot)
  - Parameters: None

- **browser_save_as_pdf**
  - Description: Save page as PDF
  - Parameters: None

- **browser_take_screenshot**
  - Description: Capture screenshot of the page
  - Parameters:
    - `raw` (string): Optionally returns lossless PNG screenshot. JPEG by default.

- **browser_wait**
  - Description: Wait for a specified time in seconds
  - Parameters:
    - `time` (number): The time to wait in seconds (capped at 10 seconds)

- **browser_close**
  - Description: Close the page
  - Parameters: None


### Vision Mode

Vision Mode provides tools for visual-based interactions using screenshots. Here are all available tools:

- **browser_navigate**
  - Description: Navigate to a URL
  - Parameters:
    - `url` (string): The URL to navigate to

- **browser_go_back**
  - Description: Go back to the previous page
  - Parameters: None

- **browser_go_forward**
  - Description: Go forward to the next page
  - Parameters: None

- **browser_screenshot**
  - Description: Capture screenshot of the current page
  - Parameters: None

- **browser_move_mouse**
  - Description: Move mouse to specified coordinates
  - Parameters:
    - `x` (number): X coordinate
    - `y` (number): Y coordinate

- **browser_click**
  - Description: Click at specified coordinates
  - Parameters:
    - `x` (number): X coordinate to click at
    - `y` (number): Y coordinate to click at

- **browser_drag**
  - Description: Perform drag and drop operation
  - Parameters:
    - `startX` (number): Start X coordinate
    - `startY` (number): Start Y coordinate
    - `endX` (number): End X coordinate
    - `endY` (number): End Y coordinate

- **browser_type**
  - Description: Type text at specified coordinates
  - Parameters:
    - `text` (string): Text to type
    - `submit` (boolean): Whether to submit entered text (press Enter after)

- **browser_press_key**
  - Description: Press a key on the keyboard
  - Parameters:
    - `key` (string): Name of the key to press or a character to generate, such as `ArrowLeft` or `a`

- **browser_save_as_pdf**
  - Description: Save page as PDF
  - Parameters: None

- **browser_wait**
  - Description: Wait for a specified time in seconds
  - Parameters:
    - `time` (number): The time to wait in seconds (capped at 10 seconds)

- **browser_close**
  - Description: Close the page
  - Parameters: None
chore: initial code commit 2025-03-21 10:58:58 -07:00			`## Playwright MCP`

chore: update readme, add workflow 2025-03-21 13:16:30 -07:00			`A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.`

			`### Key Features`

Add VS Code installation links (#17) First take on formatting install steps, please review and improve as needed. --------- Co-authored-by: Simon Knott <info@simonknott.de> 2025-03-25 10:36:16 -07:00			`- Fast and lightweight: Uses Playwright's accessibility tree, not pixel-based input.`
chore: update readme, add workflow 2025-03-21 13:16:30 -07:00			`- LLM-friendly: No vision models needed, operates purely on structured data.`
			`- Deterministic tool application: Avoids ambiguity common with screenshot-based approaches.`

			`### Use Cases`

			`- Web navigation and form-filling`
			`- Data extraction from structured content`
			`- Automated testing driven by LLMs`
			`- General-purpose browser interaction for agents`

chore: initial code commit 2025-03-21 10:58:58 -07:00			`### Example config`

			```js
			`{`
			`"mcpServers": {`
			`"playwright": {`
			`"command": "npx",`
			`"args": [`
docs: remove trailing , from .json config Otherwise it's highlighted as syntax error. 2025-03-24 11:36:28 -07:00			`"@playwright/mcp@latest"`
chore: initial code commit 2025-03-21 10:58:58 -07:00			`]`
			`}`
			`}`
			`}`
			```

Add VS Code installation links (#17) First take on formatting install steps, please review and improve as needed. --------- Co-authored-by: Simon Knott <info@simonknott.de> 2025-03-25 10:36:16 -07:00
			`#### Installation in VS Code`

docs: add VS Code install buttons (#42) 2025-03-26 15:37:51 -07:00			`Install the Playwright MCP server in VS Code using one of these buttons:`

			`<!--`
			`// Generate using?:`
			`const config = JSON.stringify({ name: 'playwright', command: 'npx', args: ["-y", "@playwright/mcp@latest"] });`
			const urlForWebsites = `vscode:mcp/install?${encodeURIComponent(config)}`;
			// Github markdown does not allow linking to `vscode:` directly, so you can use our redirect:
			const urlForGithub = `https://insiders.vscode.dev/redirect?url=${encodeURIComponent(urlForWebsites)}`;
			`-->`

			`[<img alt="Install in VS Code Insiders" src="https://img.shields.io/badge/VS_Code_Insiders-VS_Code_Insiders?style=flat-square&label=Install%20Server&color=24bfa5">](https://insiders.vscode.dev/redirect?url=vscode-insiders%3Amcp%2Finstall%3F%257B%2522name%2522%253A%2522playwright%2522%252C%2522command%2522%253A%2522npx%2522%252C%2522args%2522%253A%255B%2522-y%2522%252C%2522%2540playwright%252Fmcp%2540latest%2522%255D%257D)`

			`Alternatively, you can install the Playwright MCP server using the VS Code CLI:`
Add VS Code installation links (#17) First take on formatting install steps, please review and improve as needed. --------- Co-authored-by: Simon Knott <info@simonknott.de> 2025-03-25 10:36:16 -07:00
			```bash
			`# For VS Code`
			`code --add-mcp '{"name":"playwright","command":"npx","args":["@playwright/mcp@latest"]}'`
make it two codeframes for easy copying 2025-03-25 18:46:19 +01:00			```
Add VS Code installation links (#17) First take on formatting install steps, please review and improve as needed. --------- Co-authored-by: Simon Knott <info@simonknott.de> 2025-03-25 10:36:16 -07:00
make it two codeframes for easy copying 2025-03-25 18:46:19 +01:00			```bash
Add VS Code installation links (#17) First take on formatting install steps, please review and improve as needed. --------- Co-authored-by: Simon Knott <info@simonknott.de> 2025-03-25 10:36:16 -07:00			`# For VS Code Insiders`
			`code-insiders --add-mcp '{"name":"playwright","command":"npx","args":["@playwright/mcp@latest"]}'`
			```

			`After installation, the Playwright MCP server will be available for use with your GitHub Copilot agent in VS Code.`

chore: document user-data-dir 2025-03-26 16:03:46 -07:00			`### User data directory`

			`Playwright MCP will launch Chrome browser with the new profile, located at`

			```
			- `%USERPROFILE%\AppData\Local\ms-playwright\mcp-chrome-profile` on Windows
			- `~/Library/Caches/ms-playwright/mcp-chrome-profile` on macOS
			- `~/.cache/ms-playwright/mcp-chrome-profile` on Linux
			```

			`All the logged in information will be stored in that profile, you can delete it between sessions if you'dlike to clear the offline state.`

Add VS Code installation links (#17) First take on formatting install steps, please review and improve as needed. --------- Co-authored-by: Simon Knott <info@simonknott.de> 2025-03-25 10:36:16 -07:00
chore: update readme, add workflow 2025-03-21 13:16:30 -07:00			`### Running headless browser (Browser without GUI).`

			`This mode is useful for background or batch operations.`
chore: initial code commit 2025-03-21 10:58:58 -07:00
			```js
			`{`
			`"mcpServers": {`
			`"playwright": {`
			`"command": "npx",`
			`"args": [`
chore: add @latest to the recommended version in config 2025-03-21 13:33:24 -07:00			`"@playwright/mcp@latest",`
chore: update readme, add workflow 2025-03-21 13:16:30 -07:00			`"--headless"`
chore: initial code commit 2025-03-21 10:58:58 -07:00			`]`
			`}`
			`}`
			`}`
			```

chore: update readme, add workflow 2025-03-21 13:16:30 -07:00			`### Running headed browser on Linux w/o DISPLAY`
chore: initial code commit 2025-03-21 10:58:58 -07:00
			`When running headed browser on system w/o display or from worker processes of the IDEs,`
			`you can run Playwright in a client-server manner. You'll run the Playwright server`
			`from environment with the DISPLAY`

			```sh
			`npx playwright run-server`
			```

			And then in MCP config, add following to the `env`:

			```js
			`{`
			`"mcpServers": {`
			`"playwright": {`
			`"command": "npx",`
			`"args": [`
chore: add @latest to the recommended version in config 2025-03-21 13:33:24 -07:00			`"@playwright/mcp@latest"`
chore: initial code commit 2025-03-21 10:58:58 -07:00			`],`
			`"env": {`
			`// Use the endpoint from the output of the server above.`
			`"PLAYWRIGHT_WS_ENDPOINT": "ws://localhost:<port>/"`
			`}`
			`}`
			`}`
			`}`
			```

			`### Tool Modes`

			`The tools are available in two modes:`

			`1. Snapshot Mode (default): Uses accessibility snapshots for better performance and reliability`
			`2. Vision Mode: Uses screenshots for visual-based interactions`

			To use Vision Mode, add the `--vision` flag when starting the server:

			```js
			`{`
			`"mcpServers": {`
			`"playwright": {`
			`"command": "npx",`
			`"args": [`
chore: add @latest to the recommended version in config 2025-03-21 13:33:24 -07:00			`"@playwright/mcp@latest",`
chore: initial code commit 2025-03-21 10:58:58 -07:00			`"--vision"`
			`]`
			`}`
			`}`
			`}`
			```

			`Vision Mode works best with the computer use models that are able to interact with elements using`
			`X Y coordinate space, based on the provided screenshot.`

chore: export server for custom transports (#20) Fixes https://github.com/microsoft/playwright-mcp/issues/11 2025-03-25 14:46:39 -07:00			`### Programmatic usage with custom transports`

			```js
			`import { createServer } from '@playwright/mcp';`

			`// ...`

			`const server = createServer({`
			`launchOptions: { headless: true }`
			`});`
			`transport = new SSEServerTransport("/messages", res);`
			`server.connect(transport);`
			```

chore: initial code commit 2025-03-21 10:58:58 -07:00			`### Snapshot Mode`

			`The Playwright MCP provides a set of tools for browser automation. Here are all available tools:`

			`- browser_navigate`
			`- Description: Navigate to a URL`
			`- Parameters:`
			- `url` (string): The URL to navigate to

			`- browser_go_back`
			`- Description: Go back to the previous page`
			`- Parameters: None`

			`- browser_go_forward`
			`- Description: Go forward to the next page`
			`- Parameters: None`

			`- browser_click`
			`- Description: Perform click on a web page`
			`- Parameters:`
			- `element` (string): Human-readable element description used to obtain permission to interact with the element
			- `ref` (string): Exact target element reference from the page snapshot

			`- browser_hover`
			`- Description: Hover over element on page`
			`- Parameters:`
			- `element` (string): Human-readable element description used to obtain permission to interact with the element
			- `ref` (string): Exact target element reference from the page snapshot

			`- browser_drag`
			`- Description: Perform drag and drop between two elements`
			`- Parameters:`
			- `startElement` (string): Human-readable source element description used to obtain permission to interact with the element
			- `startRef` (string): Exact source element reference from the page snapshot
			- `endElement` (string): Human-readable target element description used to obtain permission to interact with the element
			- `endRef` (string): Exact target element reference from the page snapshot

			`- browser_type`
			`- Description: Type text into editable element`
			`- Parameters:`
			- `element` (string): Human-readable element description used to obtain permission to interact with the element
			- `ref` (string): Exact target element reference from the page snapshot
			- `text` (string): Text to type into the element
			- `submit` (boolean): Whether to submit entered text (press Enter after)

feat(tool): add `locator.selectOption()` action (#25) Implemented `locator.selectOption` 2025-03-26 13:53:56 +09:00			`- browser_select_option`
			`- Description: Select option in a dropdown`
			`- Parameters:`
			- `element` (string): Human-readable element description used to obtain permission to interact with the element
			- `ref` (string): Exact target element reference from the page snapshot
			- `values` (array): Array of values to select in the dropdown.

chore: initial code commit 2025-03-21 10:58:58 -07:00			`- browser_press_key`
			`- Description: Press a key on the keyboard`
			`- Parameters:`
			- `key` (string): Name of the key to press or a character to generate, such as `ArrowLeft` or `a`

			`- browser_snapshot`
			`- Description: Capture accessibility snapshot of the current page (better than screenshot)`
			`- Parameters: None`

			`- browser_save_as_pdf`
			`- Description: Save page as PDF`
			`- Parameters: None`

chore: allow taking pixel screenshots in snapshot mode (#44) Ref: https://github.com/microsoft/playwright-mcp/issues/39 2025-03-27 07:27:34 -07:00			`- browser_take_screenshot`
			`- Description: Capture screenshot of the page`
			`- Parameters:`
			- `raw` (string): Optionally returns lossless PNG screenshot. JPEG by default.

chore: initial code commit 2025-03-21 10:58:58 -07:00			`- browser_wait`
			`- Description: Wait for a specified time in seconds`
			`- Parameters:`
			- `time` (number): The time to wait in seconds (capped at 10 seconds)

			`- browser_close`
			`- Description: Close the page`
			`- Parameters: None`


			`### Vision Mode`

			`Vision Mode provides tools for visual-based interactions using screenshots. Here are all available tools:`

			`- browser_navigate`
			`- Description: Navigate to a URL`
			`- Parameters:`
			- `url` (string): The URL to navigate to

			`- browser_go_back`
			`- Description: Go back to the previous page`
			`- Parameters: None`

			`- browser_go_forward`
			`- Description: Go forward to the next page`
			`- Parameters: None`

			`- browser_screenshot`
			`- Description: Capture screenshot of the current page`
			`- Parameters: None`

			`- browser_move_mouse`
			`- Description: Move mouse to specified coordinates`
			`- Parameters:`
			- `x` (number): X coordinate
			- `y` (number): Y coordinate

			`- browser_click`
			`- Description: Click at specified coordinates`
			`- Parameters:`
			- `x` (number): X coordinate to click at
			- `y` (number): Y coordinate to click at

			`- browser_drag`
			`- Description: Perform drag and drop operation`
			`- Parameters:`
			- `startX` (number): Start X coordinate
			- `startY` (number): Start Y coordinate
			- `endX` (number): End X coordinate
			- `endY` (number): End Y coordinate

			`- browser_type`
			`- Description: Type text at specified coordinates`
			`- Parameters:`
			- `text` (string): Text to type
			- `submit` (boolean): Whether to submit entered text (press Enter after)

			`- browser_press_key`
			`- Description: Press a key on the keyboard`
			`- Parameters:`
			- `key` (string): Name of the key to press or a character to generate, such as `ArrowLeft` or `a`

			`- browser_save_as_pdf`
			`- Description: Save page as PDF`
			`- Parameters: None`

			`- browser_wait`
			`- Description: Wait for a specified time in seconds`
			`- Parameters:`
			- `time` (number): The time to wait in seconds (capped at 10 seconds)

			`- browser_close`
			`- Description: Close the page`
			`- Parameters: None`