Web Browser Skill

Build, preview, and verify frontend code with real browser interaction. Inspired by ios-simulator-skill's semantic navigation philosophy, adapted for web browsers.

Quick Start

bash

# 1. Launch dev server + browser for current project
python scripts/dev_server.py

# 2. Capture what's on screen
python scripts/screen_capture.py --output screenshot.png

# 3. Analyze page structure
python scripts/page_analyzer.py

# 4. Research a design
python scripts/web_researcher.py --url "https://example.com" --capture

Core Workflow

Building Frontend

•Create project from assets/starter-template/ if none exists
•Write code in React/TypeScript (or HTML/CSS if simpler)
•Launch with dev_server.py to see live preview
•Capture screenshot to verify visual output
•Iterate based on user feedback

Design Research

•User provides inspiration URL or description
•Use web_researcher.py to capture reference designs
•Analyze layout, colors, typography
•Adapt patterns to user's project

Scripts Reference

Script	Purpose	Default Output
`dev_server.py`	Launch Vite dev server + open browser	1 line status
`screen_capture.py`	Screenshot current page	File path
`page_analyzer.py`	Semantic element analysis	5-10 lines
`web_researcher.py`	Browse URL + capture	Screenshot + summary
`visual_diff.py`	Compare two screenshots	Diff highlights

All scripts support --verbose for details, --json for parsing.

Token Efficiency

Default output is minimal:

code

✓ Dev server running at http://localhost:5173
✓ Screenshot saved: ./screenshots/page-2024-01-15-143022.png

Use --verbose only when debugging.

Working with Beginners

When user doesn't understand code:

•Show screenshots of changes
•Use plain language: "I moved the button to the right"
•Offer correction phrases: "Say 'make it bigger' or 'change the color'"
•Reference references/MINIMUM_KNOWLEDGE.md for teaching moments

Playwright MCP Integration (Recommended)

Prefer Playwright MCP for direct browser control - it's faster and uses less context than writing scripts:

bash

# Install once:
claude mcp add playwright -- npx @playwright/mcp@latest

Playwright MCP handles:

•Direct page interaction (clicks, typing, navigation)
•Form testing and validation
•Screenshots and visual verification
•Multi-page workflows

Use this skill's scripts when:

•MCP is unavailable or not configured
•You need dev server lifecycle management
•Complex custom automation logic is required

Project Structure

For new projects, copy assets/starter-template/: