Name: quiz-crawler
Rating: 76
Author: deepInTheData

Workflow (reproducible pipeline)

This skill is for building a swipe database: store raw artifacts + a clean Notion page.

Run:

Popup handling:

•Auto-accepts JS dialogs (alert/confirm/prompt).
•Attempts to click modal CTAs like Continue/Submit/OK when a popup blocks progress.

Optional:

Artifacts written to OUT_DIR:

Run:

Required env:

Optional env:

Output:

•
OCR uses the system tesseract binary (no JS OCR deps).
•
If tesseract is missing, install it:
- •Ubuntu/Debian: sudo apt-get update && sudo apt-get install -y tesseract-ocr
- •macOS (brew): brew install tesseract
•
R2 sync requires these env vars. Recommended: store them in a skill-local .env file at ~/skills/quiz-crawler/.env (the scripts auto-load it).
- •R2_ENDPOINT
- •R2_BUCKET
- •R2_PUBLIC_BASE
- •R2_ACCESS_KEY_ID
- •R2_SECRET_ACCESS_KEY
•
Notion publish requires:
- •NOTION_API_KEY
- •NOTION_PARENT_PAGE_ID

.env notes:

•Lines like KEY=value or KEY="value".
•Do not commit .env (secrets). If you package/share the skill, exclude .env.

Run:

•SRC_DIR=<OUT_DIR> node scripts/ocr_extract.js (writes ocr.json)
•SRC_DIR=<OUT_DIR> node scripts/qa_extract.js (writes qa.json: OCR question + DOM options)
•SRC_DIR=<OUT_DIR> BRAND=<BrandName> NOTION_PARENT_PAGE_ID=<page_id> node scripts/publish_notion.js

Fast defaults:

Required env:

Output:

•
Creates 1 Notion child page under the parent with the format specified in assets/output.md:
- •Question: ...
- •Answer: ...
- •Screenshot: <external image>

Notes:

•This skill currently captures a single deterministic path. Different answers can lead to different pages; add branching only when requested.