Web Scraping APIs

What Is Scrapling?

What Is Scrapling? — conceptual illustration
On this page

Scrapling is an all-in-one Python scraping framework that bundles fetching, parsing, anti-detection, and crawling behind one API — it is a layer above the other tools, not a competitor. Its three-tier fetching system uses curl_cffi for TLS-impersonated HTTP (fastest), standard Playwright for JS rendering, and a StealthyFetcher that wraps Patchright or Camoufox for maximum stealth with Cloudflare auto-solving. Its standout feature is adaptive element tracking that re-locates selectors when a site changes its DOM.

Quick facts

TypeAll-in-one scraping framework (fetch + parse + crawl + stealth)
LanguagePython
Three tiersFetcher (curl_cffi) / DynamicFetcher (Playwright) / StealthyFetcher (Patchright or Camoufox)
Unique featureAdaptive selectors auto-relocate when the DOM changes
Also includesScrapy-like spider, fast parser, MCP server for AI workflows

The three-tier fetching system

Tier 1 — Fetcher uses curl_cffi to send HTTP requests with a real browser TLS fingerprint (JA3/JA4) for Chrome/Firefox/Safari/Edge, plus browserforge-generated headers. No JavaScript runs, so there is no browser-fingerprinting surface at all — the fastest option (~10MB footprint) and enough for ~90% of pages. Tier 2 — DynamicFetcher is vanilla Playwright for JS-rendered content with wait_selector and network-idle helpers, but no special stealth. Tier 3 — StealthyFetcher wraps Patchright (default) or Camoufox (use_camoufox=True) and exposes flags like solve_cloudflare=True, block_webrtc=True, hide_canvas=True, and disable_webgl=True.

Cloudflare handling and adaptive selectors

solve_cloudflare=True detects the Turnstile/interstitial page, waits for the challenge iframe, interacts, and waits for the redirect — without an external CAPTCHA API. As the analysis stresses, it is not solving the CAPTCHA; Patchright/Camoufox present a browser environment Cloudflare trusts enough to auto-pass.

The unique feature is adaptive element tracking: Scrapling can store the structural context of a matched element and, when a CSS selector breaks after a site redesign, re-locate the same element by fuzzy structural matching. No other tool in this comparison offers this. It pairs with a fast lxml-based parser, a Scrapy-like spider with pause/resume checkpoints, and an MCP server for Claude/Cursor workflows.

When to use Scrapling

Use it when: you want one framework for an entire pipeline rather than stitching tools together, you have mixed protection levels (cheap HTTP for most pages, browser stealth for the hard ones), you scrape sites that change their DOM often, or you want AI-integrated scraping. Note: it does not add new stealth of its own — Tier 3 inherits whatever Patchright or Camoufox provides — so for the hardest enterprise targets you are still bound by those engines' limits. It is a Python orchestration layer, not a novel evasion technique.

Code example

python
from scrapling import Fetcher, StealthyFetcher

# Tier 1: fast TLS-impersonated HTTP for easy pages
html = Fetcher().get("https://example.com", stealthy_headers=True)

# Tier 3: Patchright/Camoufox under the hood for protected pages
page = StealthyFetcher().get(
    "https://cloudflare-protected.com",
    solve_cloudflare=True,
    block_webrtc=True,
    hide_canvas=True,
)
print(page.css_first("h1::text"))

Related terms

Concept map

How Scrapling connects

The terms most directly tied to this one. Hover a node to see its neighbours, click to preview, drag to rearrange.

0 terms · 0 connections
You are here · Web Scraping APIs
Building map…

Frequently asked questions

Is Scrapling a stealth tool or a framework?

A framework. It does not invent new evasion — it orchestrates other tools (curl_cffi for TLS, Patchright/Camoufox for browser stealth) behind one API, and adds parsing, crawling, and adaptive selectors on top. Think of it as a layer above Camoufox and Patchright, not a competitor to them.

What are the three fetching tiers?

Fetcher (curl_cffi HTTP with TLS impersonation — fastest, no browser), DynamicFetcher (vanilla Playwright for JS rendering, no special stealth), and StealthyFetcher (Patchright or Camoufox for maximum stealth with Cloudflare auto-solving). You pick the tier per target by protection level.

How does solve_cloudflare work?

It detects the Turnstile/interstitial page, waits for the challenge iframe, interacts, and waits for the redirect. It does not solve the CAPTCHA cryptographically — the underlying Patchright/Camoufox browser is trusted enough that Cloudflare auto-passes it.

What is adaptive element tracking?

Scrapling can remember the structural fingerprint of an element it matched. When a site redesign breaks your CSS selector, it re-finds the same element by fuzzy structural matching instead of failing. It is Scrapling's most distinctive feature among these tools.

Last updated: 2026-05-28