AgentSkillsCN

headless-web-viewer

PDF 文档离线读写与表单处理:提取文本/表格、合并拆分、生成 PDF、填写表单。适用于“本地处理/读取/生成 PDF 文件”(依赖安装可能需要网络连接)。

SKILL.md
--- frontmatter
name: headless-web-viewer
version: 0.1.0
description: Render and view webpages using a headless browser (Playwright) to fetch JS-rendered HTML, extract visible text, and optionally save full-page screenshots. Use when a user asks to “无头浏览器打开/查看网页”, needs the rendered DOM instead of raw curl HTML, or wants a screenshot of a page.

Headless Web Viewer

Run

Render + save artifacts

bash
node agent/skills/headless-web-viewer/scripts/render_url_playwright.mjs '<URL>' \
  --out-html /tmp/page.html \
  --out-text /tmp/page.txt \
  --out-screenshot /tmp/page.png

Print to stdout (pipe-friendly)

bash
node agent/skills/headless-web-viewer/scripts/render_url_playwright.mjs '<URL>' --print text

Dependencies

This skill requires Playwright in the environment where it runs.

Option A (recommended for global use, no browser download)

Install Playwright Core and use system Chrome:

bash
npm i -D playwright-core

Run with --channel chrome.

Option B (bundled browsers)

bash
npm i -D playwright
npx playwright install

Do not auto-install dependencies unless the user asks.

Tips

  • If a page hangs on networkidle, retry with --wait-until domcontentloaded.
  • If a page blocks headless Chromium, try setting --user-agent to a realistic UA.