Web Scraping APIs

What Is a CAPTCHA Solver?

What Is a CAPTCHA Solver? — conceptual illustration
On this page

A CAPTCHA solver is software that automatically completes CAPTCHA challenges on behalf of an automated client. It receives the challenge from a target site, processes it using AI models, browser automation, or a human-in-the-loop service, and returns a token the site accepts as proof of being human — allowing a scraper, bot, or test runner to continue past the gate without manual intervention.

Quick facts

Also known asCAPTCHA bypass, CAPTCHA automation, anti-CAPTCHA
Common types solvedreCAPTCHA v2/v3, hCaptcha, Cloudflare Turnstile, FunCaptcha, image CAPTCHAs
Primary use caseKeeping scrapers, automated tests, and account workflows running
Typical pricing$1–$3 per 1,000 solves (machine), $1–$2 per 1,000 (human)
Risk levelMedium — must respect site terms; widely used for public-data scraping and QA

How CAPTCHA solvers work

Most solvers follow the same three-step pattern. First, the scraper detects a CAPTCHA on the page (or is told to expect one) and extracts the parameters the challenge needs — the site key, page URL, and challenge type. Second, it forwards those parameters to a solving backend: either an in-house AI model trained on millions of past challenges, a network of low-cost human workers, or a hybrid that escalates hard cases to humans. Third, the backend returns a token (a long opaque string) that the scraper drops into the page's form or sends with the next request. The target site validates the token against its CAPTCHA provider, sees a passing score, and lets the request through. For invisible CAPTCHAs like reCAPTCHA v3 or Turnstile, the solver often runs the challenge inside a real browser fingerprint so the resulting token carries trusted behavioral and TLS signals.

Why CAPTCHA solvers matter for web scraping

CAPTCHAs are the most visible layer of bot defense, and any non-trivial scraping project will hit them. Without a solver, a single CAPTCHA-protected page can stall a job indefinitely. With one, the scraper recovers automatically and keeps moving. Solvers also matter because they let you scale: solving 50,000 challenges by hand is not a workflow, but solving them at $2 per thousand is a line item. The flip side is that solvers are not a silver bullet — they handle the challenge itself, but if your IP, headers, or TLS fingerprint look automated, the site will just serve another challenge a few requests later. A solver is one component of a working scraping stack, not the whole thing.

Common implementations

There are three common shapes. Pure-API services (2Captcha, Anti-Captcha, CapSolver) take a job over HTTP and return a token; you wire them in yourself. Browser-automation libraries (Playwright/Puppeteer plugins) inject the solver into a real browser session and click through challenges programmatically. Full scraping APIs like Scrappey roll the solver into the same request that fetches the page — you send a URL, the API handles proxies, JS rendering, fingerprinting, and CAPTCHAs as one call, and you get the final HTML or JSON back. Most production scrapers end up using either the third option or a combination of the first two.

Limitations and alternatives

Solvers cost real money per challenge, so a poorly-built scraper that triggers a CAPTCHA on every request quickly becomes expensive. They also add latency — solving a Turnstile challenge can take 8–20 seconds. The first line of defense is to avoid the CAPTCHA in the first place: rotate quality residential proxies, send realistic browser fingerprints, throttle request rate, and reuse cookies across requests in the same session. When you do hit one, fall back to the solver. For sites that gate every request behind a CAPTCHA, switching to an official API (if one exists) or a managed scraping endpoint is almost always cheaper than solving thousands of challenges per hour.

Code example

python
import requests

resp = requests.post(
    'https://publisher.scrappey.com/api/v1',
    json={
        'cmd': 'request.get',
        'url': 'https://example.com/protected',
        'autoparse': True
    },
    headers={'Authorization': 'YOUR_API_KEY'}
)

# CAPTCHA + proxy + fingerprinting handled server-side
html = resp.json()['solution']['response']

Related terms

Concept map

How CAPTCHA Solver connects

The terms most directly tied to this one. Hover a node to see its neighbours, click to preview, drag to rearrange.

0 terms · 0 connections
You are here · Web Scraping APIs
Building map…

Frequently asked questions

Are CAPTCHA solvers legal?

Using a solver on public data, your own accounts, or for QA testing is generally legal in most jurisdictions. Using one to bypass authentication you don't own, to violate a site's terms of service in a contractually enforceable way, or to commit fraud is not. The tool is neutral; the use case matters.

How accurate are CAPTCHA solvers?

Image and reCAPTCHA v2 solve rates from quality providers sit in the 90–99% range. Turnstile and reCAPTCHA v3 are harder because they score behavior, not just the challenge — accuracy depends as much on the surrounding fingerprint as on the solver itself.

How much does CAPTCHA solving cost?

Machine solvers typically charge $1–$3 per 1,000 solves. Human solvers are in the same range but slower. Integrated scraping APIs roll the cost into the per-request price, which is usually cheaper than solving at scale yourself.

Can sites detect that a CAPTCHA solver was used?

Not directly — the token a solver returns looks identical to one a human would produce. But sites detect the surrounding context: an IP with no browsing history, a missing TLS fingerprint, or a perfect 200ms response time are all stronger signals than the token itself.

Last updated: 2026-05-26