Skip to main content

Crawl4AI

Crawl4AI is an open-source LLM-friendly web crawler that simplifies extracting structured data from unstructured web pages.

Why Crawl4AI?

Instead of writing complex CSS selectors that break when the site updates, you pass a Pydantic schema and an LLM extracts the data for you.

Example

python
from crawl4ai import WebCrawler
from pydantic import BaseModel

class Product(BaseModel):
name: str
price: float

crawler = WebCrawler()
result = crawler.run("https://store.example.com", schema=Product)
print(result.extracted_data)