AgentSkillsCN

browser-fetch

将浏览器自动化任务委托给轻量级的子智能体(Haiku),以减少上下文消耗。同时通过剪贴工具提供网页剪辑功能(HTML→Markdown)。

SKILL.md
--- frontmatter
name: browser-fetch
description: Delegate browser automation to a lightweight subagent (Haiku) to reduce context consumption. Also provides web clipping (HTML→Markdown) via clipper.

browser-fetch

Delegate browser automation to a cost-effective subagent, preserving the main agent's context window.

Why This Skill?

  • Problem: agent-browser snapshot can consume 10,000+ tokens on complex pages
  • Solution: Haiku subagent processes snapshots and returns only summaries (~100 tokens)
  • Result: 90%+ context reduction while maintaining full browser capabilities

Prerequisites

Install agent-browser

bash
npm install -g agent-browser
agent-browser install

Install clipper

bash
npm install -g @philschmid/clipper

Uses Mozilla Readability + Turndown internally — same stack as Obsidian Web Clipper.

Windows Workaround

On Windows (Git Bash), the daemon may not auto-start. Run manually:

bash
node "$(npm root -g)/agent-browser/dist/daemon.js" &
sleep 3

Basic Flow

code
Main Agent                        Haiku Subagent
    │                                   │
    │  Task: "Open URL, summarize"      │
    ├──────────────────────────────────►│
    │                                   │  1. agent-browser open
    │                                   │  2. snapshot → save to file
    │                                   │  3. build summary
    │   "Page summary + key refs"       │
    │◄──────────────────────────────────┤
    │                                   │
    │  Task: "Click @e5, summarize"     │
    ├──────────────────────────────────►│
    │                                  ...

Snapshot Options

OptionUse Case
-iInteractive elements only (default)
-i -cCompact output (recommended for complex pages)
-i -d 2Limit depth (deeply nested pages)
-i -s "main"Scope to CSS selector

Summary Format (Critical)

Subagent must return structured summaries, not raw element lists.

Good Example

code
Product listing page (127 items displayed)

Structure:
- Filters: category (@e12), price range (@e18), search (@e24)
- Sort: date (@e30), popularity (@e31)

Sample items:
- "Product A" $29.99 (@e45)
- "Product B" $49.99 (@e52)
- "Product C" $19.99 (@e58)

Ready for instructions.
(debug: 001.txt)

Bad Example

code
- @e1 Logo
- @e2 Login button
- @e3 Menu
- @e4 Search
- @e5 Footer

→ No context. Main agent cannot make informed decisions.

Invocation Template

Opening a Page

code
Task(
  model: haiku,
  subagent_type: general-purpose,
  prompt: """
  Use agent-browser to open {URL}.

  Steps:
  1. agent-browser open {URL}
  2. agent-browser snapshot -i -c > scratchpad/browser-session/001.txt
  3. Append to index.log: 001 | {time} | {URL} | {summary}

  Return ONLY a structured summary:
  - Page type and overview (e.g., "Login page", "Product list with 50 items")
  - Key interactive elements with @refs
  - Do NOT include raw snapshot data

  Format:
  {Page description}

  Structure:
  - {feature}: {description} (@ref)

  Key elements:
  - {element description} (@ref)

  Ready for instructions.
  (debug: 001.txt)
  """
)

Performing Actions

code
Task(
  model: haiku,
  subagent_type: general-purpose,
  prompt: """
  Use agent-browser:
  1. agent-browser fill @e1 "value"
  2. agent-browser click @e3
  3. agent-browser snapshot -i -c > scratchpad/browser-session/002.txt
  4. Update index.log

  Return structured summary only.
  """
)

Complex Pages

If snapshot exceeds limits, use additional constraints:

code
agent-browser snapshot -i -c -d 2 > file.txt   # Depth limit
agent-browser snapshot -i -c -s "#content"     # Scope to selector

Rules

Main Agent

  • Trust subagent summaries
  • Do NOT read raw files (scratchpad/browser-session/*.txt) unless debugging
  • Raw data is for troubleshooting only

Subagent

  • Always save snapshots to files (never return raw data)
  • Append to index.log for history
  • Use --session flag to maintain browser state
  • Return concise, structured summaries

File Structure

code
scratchpad/browser-session/
├── index.log    # History (debug reference only)
├── 001.txt      # Raw snapshot data
├── 002.txt
└── ...

index.log Format

code
# browser-session index (debug only - do not read unless troubleshooting)

001 | 14:23:45 | https://example.com/login     | Login form
002 | 14:24:12 | https://example.com/dashboard | Dashboard with 5 widgets

Ending Session

code
Task(
  model: haiku,
  subagent_type: general-purpose,
  prompt: "Run: agent-browser close"
)

Web Clip (HTML → Markdown)

Convert web pages to clean Markdown using clipper. No subagent needed — runs directly.

Basic Usage

bash
clipper clip -u "https://example.com/article" -o scratchpad/browser-session/clip.md

This is the preferred method. Quality is equivalent to Obsidian Web Clipper (same Readability + Turndown stack).

Output Formats

FlagFormat
-f mdMarkdown (default)
-f jsonJSON (title, content, url)

Batch Clipping

bash
clipper crawl -u "https://docs.example.com" -g "https://docs.example.com/**/*" -o dataset.jsonl

When to Use What

ScenarioTool
Read an article → Markdownclipper clip -u
Interact with a page (click, fill, navigate)agent-browser + Haiku subagent