NEW Browse AI tools across categories — updated daily. See what's new →

Remote Browser

Cloud browser automation for sandboxed agents without local GUI access. Supports navigation, page inspection, interactions, JavaScript execution, and cookie management via 30+ CLI commands Includes...

Version1.0.0
LicenseMIT
Token count~2,417
UpdatedJun 5, 2026

Cloud browser automation for sandboxed agents without local GUI access. Supports navigation, page inspection, interactions, JavaScript execution, and cookie management via 30+ CLI commands Includes cloud session management for running autonomous browser agents in parallel, with task monitoring and structured output options Provides local dev server tunneling via Cloudflare to expose ports from the remote machine to the cloud browser Maintains session state across commands, enabling...

Install

Quick install

via npx skills · works with 57+ agents
npx skills add https://github.com/browser-use/browser-use/tree/HEAD/skills/remote-browser
Or pick agent:
npx skills add browser-use/browser-use --skill remote-browser --agent claude-code
npx skills add browser-use/browser-use --skill remote-browser --agent cursor
npx skills add browser-use/browser-use --skill remote-browser --agent codex
npx skills add browser-use/browser-use --skill remote-browser --agent opencode
npx skills add browser-use/browser-use --skill remote-browser --agent github-copilot
npx skills add browser-use/browser-use --skill remote-browser --agent windsurf
More install options

Shorthand — useful for multi-skill repos:

npx skills add browser-use/browser-use --skill remote-browser

Manual — clone the repo and drop the folder into your agent's skills directory:

git clone https://github.com/browser-use/browser-use.git
cp -r browser-use/skills/remote-browser ~/.claude/skills/
How to use: Once installed, ask your agent to "use the remote-browser skill" or describe what you want (e.g. "Cloud browser automation for sandboxed agents without local GUI access. Supports"). Requires Node.js 18+.

remote-browser

Cloud browser automation for sandboxed agents without local GUI access. Supports navigation, page inspection, interactions, JavaScript execution, and cookie management via 30+ CLI commands Includes cloud session management for running autonomous browser agents in parallel, with task monitoring and structured output options Provides local dev server tunneling via Cloudflare to expose ports from the remote machine to the cloud browser Maintains session state across commands, enabling...

remote-browserby browser-use

Cloud browser automation for sandboxed agents without local GUI access. Supports navigation, page inspection, interactions, JavaScript execution, and cookie management via 30+ CLI commands Includes cloud session management for running autonomous browser agents in parallel, with task monitoring and structured output options Provides local dev server tunneling via Cloudflare to expose ports from the remote machine to the cloud browser Maintains session state across commands, enabling...

npx skills add https://github.com/browser-use/browser-use --skill remote-browserDownload ZIPGitHub

Browser Automation for Sandboxed Agents

This skill is for agents running on sandboxed remote machines (cloud VMs, CI, coding agents) that need to control a headless browser.

Prerequisites

`browser-use doctor # Verify installation
`

For setup details, see https://github.com/browser-use/browser-use/blob/main/browser_use/skill_cli/README.md

Core Workflow

  • Navigate: browser-use open <url> — starts headless browser if needed
  • Inspect: browser-use state — returns clickable elements with indices
  • Interact: use indices from state (browser-use click 5, browser-use input 3 "text")
  • Verify: browser-use state or browser-use screenshot to confirm
  • Repeat: browser stays open between commands
  • Cleanup: browser-use close when done

Browser Modes

`browser-use open <url> # Default: headless Chromium
browser-use cloud connect # Provision cloud browser and connect
browser-use --connect open <url> # Auto-discover running Chrome via CDP
browser-use --cdp-url ws://localhost:9222/... open <url> # Connect via CDP URL
`

Commands

`# Navigation
browser-use open <url> # Navigate to URL
browser-use back # Go back in history
browser-use scroll down # Scroll down (--amount N for pixels)
browser-use scroll up # Scroll up
browser-use tab list # List all tabs with lock status
browser-use tab new [url] # Open a new tab (blank or with URL)
browser-use tab switch <index> # Switch to tab by index
browser-use tab close <index> [index...] # Close one or more tabs

# Page State — always run state first to get element indices
browser-use state # URL, title, clickable elements with indices
browser-use screenshot [path.png] # Screenshot (base64 if no path, --full for full page)

# Interactions — use indices from state
browser-use click <index> # Click element by index
browser-use click <x> <y> # Click at pixel coordinates
browser-use type "text" # Type into focused element
browser-use input <index> "text" # Click element, then type
browser-use keys "Enter" # Send keyboard keys (also "Control+a", etc.)
browser-use select <index> "option" # Select dropdown option
browser-use upload <index> <path> # Upload file to file input
browser-use hover <index> # Hover over element
browser-use dblclick <index> # Double-click element
browser-use rightclick <index> # Right-click element

# Data Extraction
browser-use eval "js code" # Execute JavaScript, return result
browser-use get title # Page title
browser-use get html [--selector "h1"] # Page HTML (or scoped to selector)
browser-use get text <index> # Element text content
browser-use get value <index> # Input/textarea value
browser-use get attributes <index> # Element attributes
browser-use get bbox <index> # Bounding box (x, y, width, height)

# Wait
browser-use wait selector "css" # Wait for element (--state visible|hidden|attached|detached, --timeout ms)
browser-use wait text "text" # Wait for text to appear

# Cookies
browser-use cookies get [--url <url>] # Get cookies (optionally filtered)
browser-use cookies set <name> <value> # Set cookie (--domain, --secure, --http-only, --same-site, --expires)
browser-use cookies clear [--url <url>] # Clear cookies
browser-use cookies export <file> # Export to JSON
browser-use cookies import <file> # Import from JSON

# Python — persistent session with browser access
browser-use python "code" # Execute Python (variables persist across calls)
browser-use python --file script.py # Run file
browser-use python --vars # Show defined variables
browser-use python --reset # Clear namespace

# Session
browser-use close # Close browser and stop daemon
browser-use sessions # List active sessions
browser-use close --all # Close all sessions
`

The Python browser object provides: browser.url, browser.title, browser.html, browser.goto(url), browser.back(), browser.click(index), browser.type(text), browser.input(index, text), browser.keys(keys), browser.upload(index, path), browser.screenshot(path), browser.scroll(direction, amount), browser.wait(seconds).

Tunnels

Expose local dev servers to the browser via Cloudflare tunnels.

`browser-use tunnel <port> # Start tunnel (idempotent)
browser-use tunnel list # Show active tunnels
browser-use tunnel stop <port> # Stop tunnel
browser-use tunnel stop --all # Stop all tunnels
`

Command Chaining

Commands can be chained with &&. The browser persists via the daemon, so chaining is safe and efficient.

`browser-use open https://example.com && browser-use state
browser-use input 5 "[email&#160;protected]" && browser-use input 6 "password" && browser-use click 7
`

Chain when you don't need intermediate output. Run separately when you need to parse state to discover indices first.

Common Workflows

Exposing Local Dev Servers

`python -m http.server 3000 & # Start dev server
browser-use tunnel 3000 # → https://abc.trycloudflare.com
browser-use open https://abc.trycloudflare.com # Browse the tunnel
`

Tunnels are independent of browser sessions and persist across browser-use close.

Multi-Agent (--connect mode)

Multiple agents can share one browser via --connect. Each agent gets its own tab — other agents can't interfere.

Setup: Register once, then pass the index with every --connect command:

`INDEX=$(browser-use register) # → prints "1"
browser-use --connect $INDEX open <url> # Navigate in agent's own tab
browser-use --connect $INDEX state # Get state from agent's tab
browser-use --connect $INDEX click <element> # Click in agent's tab
`
  • Tab locking: When an agent mutates a tab (click, type, navigate), that tab is locked to it. Other agents get an error if they try to mutate the same tab.
  • Read-only access: state, screenshot, get, and wait commands work on any tab regardless of locks.
  • Agent sessions expire after 5 minutes of inactivity. Run browser-use register again to get a new index.

Global Options

OptionDescription--headedShow browser window--connectAuto-discover running Chrome via CDP--cdp-url <url>Connect via CDP URL (http:// or ws://)--session NAMETarget a named session (default: "default")--jsonOutput as JSON

Tips

  • Always run state first to see available elements and their indices
  • Sessions persist — browser stays open between commands until you close it
  • Tunnels are independent — they persist across browser-use close
  • tunnel is idempotent — calling again for the same port returns the existing URL

Troubleshooting

  • Browser won't start? browser-use close then retry. Run browser-use doctor to check.
  • Element not found? browser-use scroll down then browser-use state
  • Tunnel not working? which cloudflared to check, browser-use tunnel list to see active tunnels

Cleanup

`browser-use close # Close browser session
browser-use tunnel stop --all # Stop tunnels (if any)
`

More skills from browser-use

browser-useby browser-useAutomates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, or extract information from web pages.cdpby browser-useDrive Chrome via the DevTools Protocol from JavaScript. Run JS snippets through the browser-harness-js CLI — it auto-spawns a long-lived bun HTTP server…browserby browser-useDirect browser control via CDP. Use when the user wants to automate, scrape, test, or interact with web pages. Connects to the user's already-running Chrome.browser-harnessby browser-useDirect browser control via CDP. Use when the user wants to automate, scrape, test, or interact with web pages. Connects to the user's already-running Chrome.cloudby browser-useReference docs for the Cloud REST API, SDKs, and integration patterns. Read the relevant file based on what the user needs.open-sourceby browser-useReference docs for writing Python code against the browser-use library. Read the relevant file based on what the user needs.manim-videoby browser-useProduction pipeline for mathematical and technical animations using Manim Community Edition. Creates 3Blue1Brown-style explainer videos, algorithm…video-useby browser-useEdit any video by conversation. Transcribe, cut, color grade, generate overlay animations, burn subtitles — for talking heads, montages, tutorials, travel,…

---

Source: https://github.com/browser-use/browser-use/tree/HEAD/skills/remote-browser
Author: browser-use
Discovered via: mcpservers.org

SKILL.md source

---
name: remote-browser
description: Cloud browser automation for sandboxed agents without local GUI access. Supports navigation, page inspection, interactions, JavaScript execution, and cookie management via 30+ CLI commands Includes...
---

# remote-browser

Cloud browser automation for sandboxed agents without local GUI access. Supports navigation, page inspection, interactions, JavaScript execution, and cookie management via 30+ CLI commands Includes cloud session management for running autonomous browser agents in parallel, with task monitoring and structured output options Provides local dev server tunneling via Cloudflare to expose ports from the remote machine to the cloud browser Maintains session state across commands, enabling...

# remote-browserby browser-use
Cloud browser automation for sandboxed agents without local GUI access. Supports navigation, page inspection, interactions, JavaScript execution, and cookie management via 30+ CLI commands Includes cloud session management for running autonomous browser agents in parallel, with task monitoring and structured output options Provides local dev server tunneling via Cloudflare to expose ports from the remote machine to the cloud browser Maintains session state across commands, enabling...

`npx skills add https://github.com/browser-use/browser-use --skill remote-browser`Download ZIPGitHub

## Browser Automation for Sandboxed Agents

This skill is for agents running on sandboxed remote machines (cloud VMs, CI, coding agents) that need to control a headless browser.

## Prerequisites

```
`browser-use doctor # Verify installation
`
```

For setup details, see https://github.com/browser-use/browser-use/blob/main/browser_use/skill_cli/README.md

## Core Workflow

* Navigate: `browser-use open <url>` — starts headless browser if needed

* Inspect: `browser-use state` — returns clickable elements with indices

* Interact: use indices from state (`browser-use click 5`, `browser-use input 3 "text"`)

* Verify: `browser-use state` or `browser-use screenshot` to confirm

* Repeat: browser stays open between commands

* Cleanup: `browser-use close` when done

## Browser Modes

```
`browser-use open <url> # Default: headless Chromium
browser-use cloud connect # Provision cloud browser and connect
browser-use --connect open <url> # Auto-discover running Chrome via CDP
browser-use --cdp-url ws://localhost:9222/... open <url> # Connect via CDP URL
`
```

## Commands

```
`# Navigation
browser-use open <url> # Navigate to URL
browser-use back # Go back in history
browser-use scroll down # Scroll down (--amount N for pixels)
browser-use scroll up # Scroll up
browser-use tab list # List all tabs with lock status
browser-use tab new [url] # Open a new tab (blank or with URL)
browser-use tab switch <index> # Switch to tab by index
browser-use tab close <index> [index...] # Close one or more tabs

# Page State — always run state first to get element indices
browser-use state # URL, title, clickable elements with indices
browser-use screenshot [path.png] # Screenshot (base64 if no path, --full for full page)

# Interactions — use indices from state
browser-use click <index> # Click element by index
browser-use click <x> <y> # Click at pixel coordinates
browser-use type "text" # Type into focused element
browser-use input <index> "text" # Click element, then type
browser-use keys "Enter" # Send keyboard keys (also "Control+a", etc.)
browser-use select <index> "option" # Select dropdown option
browser-use upload <index> <path> # Upload file to file input
browser-use hover <index> # Hover over element
browser-use dblclick <index> # Double-click element
browser-use rightclick <index> # Right-click element

# Data Extraction
browser-use eval "js code" # Execute JavaScript, return result
browser-use get title # Page title
browser-use get html [--selector "h1"] # Page HTML (or scoped to selector)
browser-use get text <index> # Element text content
browser-use get value <index> # Input/textarea value
browser-use get attributes <index> # Element attributes
browser-use get bbox <index> # Bounding box (x, y, width, height)

# Wait
browser-use wait selector "css" # Wait for element (--state visible|hidden|attached|detached, --timeout ms)
browser-use wait text "text" # Wait for text to appear

# Cookies
browser-use cookies get [--url <url>] # Get cookies (optionally filtered)
browser-use cookies set <name> <value> # Set cookie (--domain, --secure, --http-only, --same-site, --expires)
browser-use cookies clear [--url <url>] # Clear cookies
browser-use cookies export <file> # Export to JSON
browser-use cookies import <file> # Import from JSON

# Python — persistent session with browser access
browser-use python "code" # Execute Python (variables persist across calls)
browser-use python --file script.py # Run file
browser-use python --vars # Show defined variables
browser-use python --reset # Clear namespace

# Session
browser-use close # Close browser and stop daemon
browser-use sessions # List active sessions
browser-use close --all # Close all sessions
`
```

The Python `browser` object provides: `browser.url`, `browser.title`, `browser.html`, `browser.goto(url)`, `browser.back()`, `browser.click(index)`, `browser.type(text)`, `browser.input(index, text)`, `browser.keys(keys)`, `browser.upload(index, path)`, `browser.screenshot(path)`, `browser.scroll(direction, amount)`, `browser.wait(seconds)`.

## Tunnels

Expose local dev servers to the browser via Cloudflare tunnels.

```
`browser-use tunnel <port> # Start tunnel (idempotent)
browser-use tunnel list # Show active tunnels
browser-use tunnel stop <port> # Stop tunnel
browser-use tunnel stop --all # Stop all tunnels
`
```

## Command Chaining

Commands can be chained with `&&`. The browser persists via the daemon, so chaining is safe and efficient.

```
`browser-use open https://example.com && browser-use state
browser-use input 5 "[email&#160;protected]" && browser-use input 6 "password" && browser-use click 7
`
```

Chain when you don't need intermediate output. Run separately when you need to parse `state` to discover indices first.

## Common Workflows

### Exposing Local Dev Servers

```
`python -m http.server 3000 & # Start dev server
browser-use tunnel 3000 # → https://abc.trycloudflare.com
browser-use open https://abc.trycloudflare.com # Browse the tunnel
`
```

Tunnels are independent of browser sessions and persist across `browser-use close`.

## Multi-Agent (--connect mode)

Multiple agents can share one browser via `--connect`. Each agent gets its own tab — other agents can't interfere.

Setup: Register once, then pass the index with every `--connect` command:

```
`INDEX=$(browser-use register) # → prints "1"
browser-use --connect $INDEX open <url> # Navigate in agent's own tab
browser-use --connect $INDEX state # Get state from agent's tab
browser-use --connect $INDEX click <element> # Click in agent's tab
`
```

* Tab locking: When an agent mutates a tab (click, type, navigate), that tab is locked to it. Other agents get an error if they try to mutate the same tab.

* Read-only access: `state`, `screenshot`, `get`, and `wait` commands work on any tab regardless of locks.

* Agent sessions expire after 5 minutes of inactivity. Run `browser-use register` again to get a new index.

## Global Options

OptionDescription`--headed`Show browser window`--connect`Auto-discover running Chrome via CDP`--cdp-url <url>`Connect via CDP URL (`http://` or `ws://`)`--session NAME`Target a named session (default: "default")`--json`Output as JSON

## Tips

* Always run `state` first to see available elements and their indices

* Sessions persist — browser stays open between commands until you close it

* Tunnels are independent — they persist across `browser-use close`

* `tunnel` is idempotent — calling again for the same port returns the existing URL

## Troubleshooting

* Browser won't start? `browser-use close` then retry. Run `browser-use doctor` to check.

* Element not found? `browser-use scroll down` then `browser-use state`

* Tunnel not working? `which cloudflared` to check, `browser-use tunnel list` to see active tunnels

## Cleanup

```
`browser-use close # Close browser session
browser-use tunnel stop --all # Stop tunnels (if any)
`
```

## More skills from browser-use
browser-useby browser-useAutomates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, or extract information from web pages.cdpby browser-useDrive Chrome via the DevTools Protocol from JavaScript. Run JS snippets through the `browser-harness-js` CLI — it auto-spawns a long-lived bun HTTP server…browserby browser-useDirect browser control via CDP. Use when the user wants to automate, scrape, test, or interact with web pages. Connects to the user's already-running Chrome.browser-harnessby browser-useDirect browser control via CDP. Use when the user wants to automate, scrape, test, or interact with web pages. Connects to the user's already-running Chrome.cloudby browser-useReference docs for the Cloud REST API, SDKs, and integration patterns. Read the relevant file based on what the user needs.open-sourceby browser-useReference docs for writing Python code against the browser-use library. Read the relevant file based on what the user needs.manim-videoby browser-useProduction pipeline for mathematical and technical animations using Manim Community Edition. Creates 3Blue1Brown-style explainer videos, algorithm…video-useby browser-useEdit any video by conversation. Transcribe, cut, color grade, generate overlay animations, burn subtitles — for talking heads, montages, tutorials, travel,…

---

**Source**: https://github.com/browser-use/browser-use/tree/HEAD/skills/remote-browser
**Author**: browser-use
**Discovered via**: mcpservers.org

Related skills 6

caveman

★ Featured

Ultra-compressed communication mode. Cuts token usage ~75% by speaking like caveman while keeping full technical accuracy. Supports intensity levels: lite, full (default), ultra, wenyan-lite, wenyan-full, wenyan-ultra. Use when user says "caveman mode", "talk like caveman", "use caveman", "less tokens", "be brief", or invokes /caveman. Also auto-triggers when token efficiency is requested.

juliusbrussee 167k
Development

secure-linux-web-hosting

★ Featured

Use when setting up, hardening, or reviewing a cloud server for self-hosting, including DNS, SSH, firewalls, Nginx, static-site hosting, reverse-proxying an app, HTTPS with Let's Encrypt or ACME clients, safe HTTP-to-HTTPS redirects, or optional post-launch network tuning such as BBR.

xixu-me 155k
Development

readme-i18n

★ Featured

Use when the user wants to translate a repository README, make a repo multilingual, localize docs, add a language switcher, internationalize the README, or update localized README variants in a GitHub-style repository.

xixu-me 155k
Development

lark-shared

★ Featured

Use when first setting up lark-cli, running auth login, switching user/bot identity (--as), handling permission denied or scope errors, needing to update lark-cli, or seeing _notice in JSON output.

larksuite 155k
Development

improve-codebase-architecture

★ Featured

Find deepening opportunities in a codebase, informed by the domain language in CONTEXT.md and the decisions in docs/adr/. Use when the user wants to improve architecture, find refactoring opportunities, consolidate tightly-coupled modules, or make a codebase more testable and AI-navigable.

mattpocock 151k
Development

paper-context-resolver

★ Featured

Optional RigorPilot helper for README-first deep learning repo reproduction. Use only when the README and repository files leave a narrow reproduction-critical gap and the task is to resolve a specific paper detail such as dataset split, preprocessing, evaluation protocol, checkpoint mapping, or runtime assumption from primary paper sources while recording conflicts. Do not use for general paper summary, repo scanning, environment setup, command execution, title-only paper lookup, or replacin...

lllllllama 127k
Development