AgentSkillsCN

github-trending

按语言、主题或关键词追踪 GitHub 趋势仓库。可用于发现热门项目、监测数据工程趋势(Databricks、Spark),或寻找优质学习资源。

SKILL.md
--- frontmatter
name: github-trending
description: Track GitHub trending repositories by language, topic, or keyword. Use for discovering popular projects, monitoring data engineering trends (Databricks, Spark), or finding learning resources.

GitHub Trending

Discover trending repositories on GitHub filtered by language, date range, or topic.

Quick Start

Method 1: Web Scraping (No API needed)

bash
# Fetch trending page
web_fetch "https://github.com/trending/python?since=daily"

# Parse with browser tool for better extraction
browser.open("https://github.com/trending")
browser.snapshot()

Method 2: GitHub CLI (if authenticated)

bash
# Search trending repos
gh search repos --sort stars --order desc --limit 10

# Filter by topic
gh search repos --topic databricks --sort stars --order desc

# Filter by language and recent updates
gh search repos --language python --sort updated --limit 10

Method 3: GitHub API (requires token)

bash
# Search by stars (proxy for trending)
curl "https://api.github.com/search/repositories?q=created:>$(date -v-7d +%Y-%m-%d)&sort=stars&order=desc"

Trending Time Ranges

  • ?since=daily - Today's trending
  • ?since=weekly - This week's trending
  • ?since=monthly - This month's trending

Popular Languages

Data Engineering:

  • python - Data science, ML, ETL
  • scala - Spark, big data
  • jupyter-notebook - Analysis notebooks

General:

  • javascript, typescript, go, rust

Filtering Strategies

For Databricks/Data Engineering

bash
# GitHub search query
site:github.com databricks OR "delta lake" OR spark stars:>100

# Topic-based search (gh CLI)
gh search repos --topic databricks --sort stars
gh search repos --topic spark --sort stars
gh search repos --topic mlflow --sort stars

Extract Key Info

From trending page, capture:

  • Repository name (owner/repo)
  • Description
  • Stars (total)
  • Stars today/week (if shown)
  • Primary language
  • Topics/tags

Output Format

markdown
## 🔥 GitHub Trending - [Language/Topic] - [Date]

1. **owner/repo-name** (⭐ 12.3K | +234 today)
   Description of the project
   Topics: #databricks #spark #python
   https://github.com/owner/repo-name

2. **owner/another-repo** (⭐ 5.6K | +89 today)
   Brief description
   Topics: #data-engineering #etl
   https://github.com/owner/another-repo

Common Use Cases

Daily Data Engineering Trends

Track Databricks, Spark, Delta Lake projects:

code
1. Fetch https://github.com/trending/python?since=daily
2. Filter for keywords: databricks, spark, delta, mlflow
3. Return top 3-5 matches

Weekly Learning Resources

Find popular educational repos:

code
1. Search repos with topic "data-engineering" or "databricks"
2. Sort by stars (last 7 days)
3. Filter for tutorials, examples, courses

Interview Prep Projects

Discover real-world implementations:

code
1. Search: "databricks" OR "lakehouse" language:python
2. Filter: stars >100, updated recently
3. Look for: example projects, best practices

Tips

  • Browser tool preferred for visual parsing (stars, trends)
  • GitHub CLI fastest for authenticated searches
  • Web scraping simplest for trending pages (no auth)
  • Cache results - trending changes slowly (safe to cache 1-2 hours)

Example Script

See scripts/github-trending.sh for automated trending fetch with keyword filtering.