Firecrawl
Managed scraping and crawling API that returns clean markdown for LLMs.
Pros
- +LLM-ready markdown out of the box
- +Crawl, scrape, search, extract APIs
- +Generous SDKs
Cons
- -Per-page cost adds up
- -Closed source core
Modern web scraping is less about HTML parsing and more about JS rendering, anti-bot evasion, and turning pages into LLM-ready markdown. Pick based on whether you control the runtime or want a managed API.
Managed scraping and crawling API that returns clean markdown for LLMs.
Marketplace and platform for hosted scrapers (Actors) with proxies built in.
Battle-tested Python framework for large-scale crawls.
Browser automation library you can drive for scraping JS-heavy sites.
Headless browser-as-a-service tuned for AI agents and scraping at scale.
Concepts you will run into when working with web scraping tools.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.