Web Scraping with Firecrawl

Name: web-scraping
Rating: 92
Author: firecrawl

This skill enables intelligent web scraping and data extraction using Firecrawl.

When to Use

Use Firecrawl when you need to:

Scrape a single URL and return clean content.

Scrape multiple URLs in a single request.

Crawl an entire website starting from a URL.

•Best for: Comprehensive site extraction, documentation sites
•Returns: Content from multiple discovered pages
•Note: Async operation - use firecrawl_check_crawl_status to poll until complete

Check the status of an ongoing crawl operation.

Discover all URLs on a website.

Search the web and return scraped results.

Autonomous AI agent for complex web data gathering.

•Best for: Multi-step research, finding data across multiple sources, complex queries
•Just describe what data you need - the agent searches, navigates, and extracts automatically
•No URLs required - agent finds the information autonomously
•Returns: Structured data matching your request

Check the status of an agent job.

When scraping, you can request different output formats:

•
Start with map: For large sites, use firecrawl_map first to understand the site structure
•
Use appropriate tool:
- •Known single URL → scrape
- •Known multiple URLs → batch_scrape
- •Unknown pages on a site → crawl
- •Need to find pages → search
- •Complex multi-source research → agent
•
Handle crawls automatically: When crawling, always poll firecrawl_check_crawl_status until complete - don't ask users to check manually
•
Choose the right format: Use summary for quick overviews, markdown for full content, links for navigation analysis
•
Handle rate limits: Be mindful of API credits and rate limits on large operations
•
JavaScript-rendered content: Firecrawl automatically handles dynamic content - no special configuration needed