Crawl4AI v0.6.0 Release Notes
We're excited to announce the release of Crawl4AI v0.6.0, our biggest and most feature-rich update yet. This version introduces major architectural upgrades, brand-new capabilities for geo-aware crawling, high-efficiency scraping, and real-time streaming support for scalable deployments.
Highlights
1. World-Aware Crawlers
Crawl as if youโre anywhere in the world. With v0.6.0, each crawl can simulate: - Specific GPS coordinates - Browser locale - Timezone
Example:
CrawlerRunConfig(
url="https://browserleaks.com/geo",
locale="en-US",
timezone_id="America/Los_Angeles",
geolocation=GeolocationConfig(
latitude=34.0522,
longitude=-118.2437,
accuracy=10.0
)
)
2. Native Table Extraction
Extract HTML tables directly into usable formats like Pandas DataFrames or CSV with zero parsing hassle. All table data is available under result.media["tables"]
.
Example:
raw_df = pd.DataFrame(
result.media["tables"][0]["rows"],
columns=result.media["tables"][0]["headers"]
)
3. Browser Pooling & Pre-Warming
We've overhauled browser management. Now, multiple browser instances can be pooled and pages pre-warmed for ultra-fast launches: - Reduces cold-start latency - Lowers memory spikes - Enhances parallel crawling stability
This powers the new Docker Playground experience and streamlines heavy-load crawling.
4. Traffic & Snapshot Capture
Need full visibility? You can now capture: - Full network traffic logs - Console output - MHTML page snapshots for post-crawl audits and debugging
No more guesswork on what happened during your crawl.
5. MCP API and Streaming Support
Weโre exposing MCP socket and SSE endpoints, allowing: - Live streaming of crawl results - Real-time integration with agents or frontends - A new Playground UI for interactive crawling
This is a major step towards making Crawl4AI real-time ready.
6. Stress-Test Framework
Want to test performance under heavy load? v0.6.0 includes a new memory stress-test suite that supports 1,000+ URL workloads. Ideal for: - Load testing - Performance benchmarking - Validating memory efficiency
Core Improvements
- Robots.txt compliance
- Proxy rotation support
- Improved URL normalization and session reuse
- Shared data across crawler hooks
- New page routing logic
Breaking Changes & Deprecations
- Legacy
crawl4ai/browser/*
modules are removed. Update imports accordingly. AsyncPlaywrightCrawlerStrategy.get_page
now uses a new function signature.- Deprecated markdown generator aliases now point to
DefaultMarkdownGenerator
with warning.
Miscellaneous Updates
- FastAPI validators replaced custom validation logic
- Docker build now based on a Chromium layer
- Repo-wide cleanup: ~36,000 insertions, ~5,000 deletions
New Examples Included
- Geo-location crawling
- Network + console log capture
- Docker MCP API usage
- Markdown selector usage
- Crypto project data extraction
Watch the Release Video
Want a visual walkthrough of all these updates? Watch the video: ๐ https://youtu.be/9x7nVcjOZks
If you're new to Crawl4AI, start here: ๐ https://www.youtube.com/watch?v=xo3qK6Hg9AA&t=15s
Join the Community
Weโve just opened up our Discord for the public. Join us to: - Ask questions - Share your projects - Get help or contribute
๐ฌ https://discord.gg/wpYFACrHR4
Install or Upgrade
Live long and import crawl4ai. ๐