Crawl4AI Blog

Welcome to the Crawl4AI blog! Here you'll find detailed release notes, technical insights, and updates about the project. Whether you're looking for the latest improvements or want to dive deep into web crawling techniques, this is the place.

When to Stop Crawling: The Art of Knowing "Enough"

January 29, 2025

Traditional crawlers are like tourists with unlimited timeβ€”they'll visit every street, every alley, every dead end. But what if your crawler could think like a researcher with a deadline? Discover how Adaptive Crawling revolutionizes web scraping by knowing when to stop. Learn about the three-layer intelligence system that evaluates coverage, consistency, and saturation to build focused knowledge bases instead of endless page collections.

Read the full article β†’

The LLM Context Protocol: Why Your AI Assistant Needs Memory, Reasoning, and Examples

January 24, 2025

Ever wondered why your AI coding assistant struggles with your library despite comprehensive documentation? This article introduces the three-dimensional context protocol that transforms how AI understands code. Learn why memory, reasoning, and examples together create wisdomβ€”not just information.

Read the full article β†’

Latest Release

Crawl4AI v0.8.5 – Anti-Bot Detection, Shadow DOM & 60+ Bug Fixes

March 2026

Crawl4AI v0.8.5 is the biggest release since v0.8.0, bringing automatic anti-bot detection with proxy escalation, Shadow DOM flattening, deep crawl cancellation, and over 60 bug fixes.

Key highlights: - πŸ›‘οΈ Anti-Bot Detection & Proxy Escalation: 3-tier detection with automatic retry, proxy chain, and fallback - πŸŒ‘ Shadow DOM Flattening: Extract content hidden inside shadow DOM components - πŸ›‘ Deep Crawl Cancellation: Stop long crawls gracefully with cancel() or should_cancel callback - πŸ”’ Critical Security Fixes: RCE via deserialization patched, Redis CVE-2025-49844 fixed

Read full release notes β†’

Recent Releases

Crawl4AI v0.8.0 – Crash Recovery & Prefetch Mode

January 2026

Crawl4AI v0.8.0 introduces crash recovery for deep crawls, a new prefetch mode for fast URL discovery, and critical security fixes for Docker deployments.

Key highlights: - πŸ”„ Deep Crawl Crash Recovery: on_state_change callback for real-time state persistence, resume_state to continue from checkpoints - ⚑ Prefetch Mode: prefetch=True for 5-10x faster URL discovery, perfect for two-phase crawling patterns - πŸ”’ Security Fixes: Hooks disabled by default, file:// URLs blocked on Docker API, __import__ removed from sandbox

Read full release notes β†’

Crawl4AI v0.7.8 – Stability & Bug Fix Release

December 2025

Crawl4AI v0.7.8 is a focused stability release addressing 11 bugs reported by the community. Fixes for Docker deployments, LLM extraction, URL handling, and dependency compatibility.

Key highlights: - 🐳 Docker API Fixes: ContentRelevanceFilter deserialization, ProxyConfig serialization, cache folder permissions - πŸ€– LLM Improvements: Configurable rate limiter backoff, HTML input format support - πŸ“¦ Dependencies: Replaced deprecated PyPDF2 with pypdf, Pydantic v2 ConfigDict compatibility

Read full release notes β†’


Older Releases

Version Date Highlights
v0.7.7 November 2025 Self-hosting platform, real-time monitoring, smart browser pool
v0.7.6 October 2025 Webhook infrastructure, reliable delivery, custom auth
v0.7.5 September 2025 Docker Hooks System, enhanced LLM integration, HTTPS preservation
v0.7.4 August 2025 LLM-powered table extraction, performance improvements
v0.7.3 July 2025 Undetected browser, multi-URL config, memory monitoring
v0.7.1 June 2025 Bug fixes and stability improvements
v0.7.0 May 2025 Adaptive crawling, virtual scroll, link analysis

Project History

Curious about how Crawl4AI has evolved? Check out our complete changelog for a detailed history of all versions and updates.

Stay Updated

  • Star us on GitHub
  • Follow @unclecode on Twitter
  • Join our community discussions on GitHub

> Feedback