AgentSkillsCN

web-browser

通过点击按钮、填写表单、导航链接等操作与网页进行交互。该技能利用 Chrome DevTools 协议(CDP)对 Google Chrome 进行远程控制。当代理需要浏览网页时,便可借助此技能完成相关操作。

SKILL.md
--- frontmatter
name: web-browser
description: "Allows to interact with web pages by performing actions such as clicking buttons, filling out forms, and navigating links. It works by remote controlling Google Chrome using the Chrome DevTools Protocol (CDP). When an agent needs to browse the web, it can use this skill to do so."
license: Stolen from Mario

Web Browser Skill

Minimal CDP tools for collaborative site exploration.

Note: This skill is configured for WSL2 on Ubuntu using Windows Chrome.

Start Chrome

bash
./scripts/start.js              # Fresh profile
./scripts/start.js --profile    # Copy your Windows Chrome profile (cookies, logins)

Start Windows Chrome on :9222 with remote debugging. The script automatically detects your Windows username and Chrome installation.

Navigate

bash
./scripts/nav.js https://example.com
./scripts/nav.js https://example.com --new

Navigate current tab or open new tab.

Evaluate JavaScript

bash
./scripts/eval.js 'document.title'
./scripts/eval.js 'document.querySelectorAll("a").length'
./scripts/eval.js 'JSON.stringify(Array.from(document.querySelectorAll("a")).map(a => ({ text: a.textContent.trim(), href: a.href })).filter(link => !link.href.startsWith("https://")))'

Execute JavaScript in active tab (async context). Be careful with string escaping, best to use single quotes.

Screenshot

bash
./scripts/screenshot.js

Screenshot current viewport, returns temp file path

Pick Elements

bash
./scripts/pick.js "Click the submit button"

Interactive element picker. Click to select, Cmd/Ctrl+Click for multi-select, Enter to finish.

Dismiss Cookie Dialogs

bash
./scripts/dismiss-cookies.js          # Accept cookies
./scripts/dismiss-cookies.js --reject # Reject cookies (where possible)

Automatically dismisses EU cookie consent dialogs.

Run after navigating to a page:

bash
./scripts/nav.js https://example.com && ./scripts/dismiss-cookies.js

Background Logging (Console + Errors + Network)

Automatically started by start.js and writes JSONL logs to:

code
~/.cache/agent-web/logs/YYYY-MM-DD/<targetId>.jsonl

Manually start:

bash
./scripts/watch.js

Tail latest log:

bash
./scripts/logs-tail.js           # dump current log and exit
./scripts/logs-tail.js --follow  # keep following

Summarize network responses:

bash
./scripts/net-summary.js