AgentSkillsCN

browser-use-2

通过Browser Use API实现云端浏览器自动化。当您需要在不依赖本地浏览器的情况下,借助AI驱动的网页浏览、数据抓取、表单填充,或完成多步骤的网络任务时,此工具将助您一臂之力。适用于“浏览器使用”、“云端浏览器”、“网站爬取”、“自动化网络任务”,或当本地浏览器无法使用或不适合时。

SKILL.md
--- frontmatter
name: browser-use-2
version: 1.0.0
description: Cloud browser automation via Browser Use API. Use when you need AI-driven web browsing, scraping, form filling, or multi-step web tasks without local browser control. Triggers on "browser use", "cloud browser", "scrape website", "automate web task", or when local browser isn't available/suitable.
metadata: {"clawdbot":{"emoji":"🌐","requires":{"env":["BROWSER_USE_API_KEY"]}}}

Browser Use

Cloud-based AI browser automation. Send a task in plain English, get structured results.

Quick Start

bash
# Submit task
curl -s -X POST https://api.browser-use.com/api/v2/tasks \
  -H "X-Browser-Use-API-Key: $BROWSER_USE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"task": "Go to example.com and extract the main heading"}'

# Poll for result (replace TASK_ID)
curl -s "https://api.browser-use.com/api/v2/tasks/TASK_ID" \
  -H "X-Browser-Use-API-Key: $BROWSER_USE_API_KEY"

Helper Script

Use scripts/browser-use.sh for simpler execution:

bash
# Run task and wait for result
./scripts/browser-use.sh "Go to hacker news and get the top 3 stories"

# Just submit (don't wait)
./scripts/browser-use.sh --no-wait "Search Google for AI news"

API Reference

Create Task

code
POST https://api.browser-use.com/api/v2/tasks

Body:

json
{
  "task": "Plain English description of what to do",
  "llm": "gemini-3-flash-preview"  // optional, default is fast model
}

Response:

json
{
  "id": "task-uuid",
  "sessionId": "session-uuid"
}

Get Task Status

code
GET https://api.browser-use.com/api/v2/tasks/{taskId}

Response fields:

  • status: pending | started | finished | failed
  • output: Result text when finished
  • steps: Array of actions taken (with screenshots)
  • cost: Cost in dollars (e.g., "0.02")
  • isSuccess: Boolean result

Stop Task

code
POST https://api.browser-use.com/api/v2/tasks/{taskId}/stop

Pricing

~$0.01-0.05 per task depending on complexity. Check balance:

bash
curl -s https://api.browser-use.com/api/v2/credits \
  -H "X-Browser-Use-API-Key: $BROWSER_USE_API_KEY"

When to Use

  • Complex multi-step web workflows
  • Sites that block simple scraping
  • Form filling and submissions
  • When you need screenshots of steps
  • When local browser control isn't available

When NOT to Use

  • Simple page fetches (use web_fetch instead)
  • When you have local browser access (use browser tool)
  • Rapid/high-volume scraping (use Code Use or local scraping)