arXiv Pattern Discovery
Search arXiv for academic papers describing agentic AI patterns and score them using the Pattern Quality Rubric.
Quick Start
Invoke this skill when the user asks to:
- •Search arXiv for new agent patterns
- •Find academic papers about multi-agent orchestration
- •Discover patterns from academic literature
- •Review the latest papers for extractable patterns
- •What new patterns are emerging from recent AI research?
Workflow
This skill implements a 3-phase discovery workflow:
Phase 1: Discovery
bash
# Search for recent papers (last 30 days, max 50 results) python scripts/arxiv_scanner.py --days=30 --max-results=50 --export-md results.md # Search for specific topics python scripts/arxiv_scanner.py --query="multi-agent systems" --max-results=100 # Search with minimum quality threshold python scripts/arxiv_scanner.py --min-score=7.0 --export-md high-quality.md
Phase 2: Review
- •Read the exported Markdown report
- •Identify high-quality papers (score >= 7.0)
- •Check for potential duplicates flagged by the scanner
- •Select candidates for pattern extraction
Phase 3: Next Steps
- •Use the
create-patternskill to extract patterns from selected papers - •Or manually create patterns using
patterns/TEMPLATE.md - •Run the similarity checker before committing to avoid duplicates
- •Run the validator to ensure pattern quality
Script Reference
arxiv_scanner.py
Main script for querying arXiv API and scoring papers.
Usage:
bash
python scripts/arxiv_scanner.py [OPTIONS]
Options:
- •
--query, -q: arXiv search query (default: agent/agentic/multi-agent papers) - •
--max-results, -n: Maximum results to fetch (default: 100) - •
--days, -d: Only include papers from last N days (default: 365) - •
--min-score, -m: Minimum quality score to include (default: 5.0) - •
--export-json: Export results to JSON file - •
--export-md: Export results to Markdown file - •
--patterns-dir: Path to patterns directory (default: patterns) - •
--verbose, -v: Print detailed output for each paper
Examples:
bash
# Recent high-quality papers python scripts/arxiv_scanner.py --days=7 --min-score=7.0 # Search specific topic python scripts/arxiv_scanner.py --query="multi-agent orchestration" --max-results=50 # Full scan with export python scripts/arxiv_scanner.py --days=30 --export-md arxiv_report.md --verbose
pattern_similarity_checker.py
Detect potentially duplicate or very similar patterns.
Usage:
bash
# Check a single pattern against existing patterns python scripts/pattern_similarity_checker.py patterns/new-pattern.md # Check all patterns against each other python scripts/pattern_similarity_checker.py --all # Custom threshold and export python scripts/pattern_similarity_checker.py --all --threshold=0.7 --export report.md
Options:
- •
--all, -a: Check all patterns against each other - •
--patterns-dir, -d: Path to patterns directory (default: patterns) - •
--threshold, -t: Similarity threshold for reporting (default: 0.5) - •
--export, -e: Export report to Markdown file
pattern_validator.py
Validate pattern files for completeness and quality.
Usage:
bash
# Validate a single pattern python scripts/pattern_validator.py patterns/new-pattern.md # Validate all patterns python scripts/pattern_validator.py --all --verbose # Export validation report python scripts/pattern_validator.py --all --export validation_report.md
Options:
- •
--all, -a: Validate all pattern files - •
--patterns-dir, -d: Path to patterns directory (default: patterns) - •
--verbose, -v: Print detailed output for each issue - •
--strict, -s: Treat warnings as errors - •
--export, -e: Export report to Markdown file
Next Steps After Discovery
After identifying candidate papers from arXiv:
- •Extract the Pattern: Use the
create-patternskill with the paper URL or PDF - •Validate: Run
pattern_validator.pyon the new pattern file - •Check for Duplicates: Run
pattern_similarity_checker.pyon the new pattern - •Review: Ensure the pattern meets the quality threshold (score >= 5.0)
- •Commit: Add the pattern to the repository
Quality Scoring
Papers are scored using the Pattern Quality Rubric (see RUBRIC.md for details):
- •Reusability (30%): Domain-specific → Multi-domain → Universal
- •Novelty (25%): Existing → Incremental → Fundamentally new
- •Clarity (20%): Vague → Clear → Crystal clear
- •Evidence (15%): No eval → Some eval → Strong empirical
- •Completeness (10%): Idea only → Partial details → Production-ready
Threshold: Score >= 5.0 qualifies for pattern extraction