AgentSkillsCN

alma-scraper

面向澳大利亚青年司法资源的智能爬虫。能够从政府、原住民、研究机构以及媒体来源中发现、提取并学习相关资讯。

SKILL.md
--- frontmatter
name: alma-scraper
description: Intelligent scraper for Australian youth justice sources. Discovers, extracts, and learns from government, Indigenous, research, and media sources.

ALMA Intelligent Scraper

When to Use

  • Finding new youth justice information
  • Updating ALMA intelligence
  • Discovering new sources
  • Analyzing coverage gaps
  • Checking what's new in youth justice

Commands

CommandPurposeDuration
quickTop 10 high-value sources5 min
deepAll 50+ sources with discovery30-60 min
discoverFollow discovered linksVariable
source "QLD"Deep dive specific jurisdiction15 min
gapsShow coverage gaps2 min
statusCurrent knowledge stateInstant

Learning Cycle

code
SCRAPE → EXTRACT → EVALUATE → LEARN → STORE
         (Claude)   (Quality)  (Patterns)

Quality Signals

SignalWeight
Relevance (AU youth justice?)30%
Novelty (new info?)25%
Specificity (concrete details?)20%
Evidence (research backed?)15%
Actionability (useful?)10%

Priority Formula

code
priority = (quality × 0.4) + (freshness_need × 0.3) + (coverage_gap × 0.3)

Sacred Boundaries

Never scrape: Private info, court records, social media, paywalled Always mark: Community Controlled, Indigenous orgs, cultural knowledge Always check: Consent level, cultural authority, data sovereignty

File References

NeedReference
Database schemareferences/database-schema.md
Extraction patternsreferences/extraction-patterns.md
Coverage trackingreferences/coverage-tracking.md
Implementation codereferences/implementation.md