fetch-content

Name: fetch-content
Rating: 87
Author: dcuplover

抓取 URL 网页内容并转为 Markdown 存储。

执行步骤

注意：下面的 <SKILL_DIR> 指本 SKILL.md 所在的目录，请根据实际路径替换。

•

运行抓取脚本（默认处理当天）：

bash

python <SKILL_DIR>/scripts/fetch_page.py

或指定日期：

bash

python <SKILL_DIR>/scripts/fetch_page.py --date 2026-02-14

每个成功抓取的 URL 生成一个 data/raw-docs/{hash}.md 文件，格式：

markdown

---
source_url: https://example.com/article
fetch_time: 2026-02-14T10:30:00
hash: a1b2c3d4e5f67890
title: 文章标题
---

（网页正文内容，已转为 Markdown）