CUA vs web scraping — different categories
Classic web scraping: anonymous HTTP/browser requests against public pages. You see what a logged-out visitor sees. Per-request billing, high concurrency, low latency.
Computer Use Agent: the user grants the agent permission, the agent logs in as the user. Per-task billing, low concurrency (one VM per task), higher latency (30 seconds to several minutes per task), clean legal posture because the data belongs to the user.
Mental model: CUAs are "Plaid for any website" — bringing the open-banking pattern (user permission + structured data extraction) to portals that have no public API. Utility bills, bank statements, payroll exports, insurance claims, tax filings, e-commerce backend orders.
When each one wins
Use a CUA when: the data lives behind a login the user owns; the portal has no API; the workflow requires MFA, step-up authentication, or human-grade UI navigation; you need one-off retrievals at human scale (5 documents per user × 200 users).
Use traditional scraping when: the data is public-facing (e-commerce listings, SERPs, social media, news, real estate); you need low latency (sub-second); you need high concurrency (100+ parallel requests); the per-request cost matters (scraping is 10–100× cheaper for the same data when both work).
At 100k items, scraping might cost €20–€100 on Scrappey. Running 100k CUA tasks could cost $5,000–$100,000 depending on platform. The cost gap is what makes the categories distinct rather than competing.
The market in May 2026
Anthropic Computer Use — direct API, operates the actual host machine via mouse and keyboard primitives. Best for custom agent pipelines. Scores 56% on WebVoyager because it operates real desktops with all their messy complexity, not controlled browser-only VMs.
OpenAI Operator (CUA) — hosted product with built-in browser capabilities, scores 87% on WebVoyager in controlled environments.
Skyvern — open-source (YC-backed), Vision-LLM driven, 85.8% WebVoyager. Strong on invoice retrieval, job applications, government forms, insurance quotes. Both cloud-hosted and self-hostable.
Browser Use — the leading open-source browser-only agent at 89% WebVoyager, 78k+ GitHub stars. Connect any LLM, run locally or self-hosted. Supports OpenAI, Anthropic, Gemini, Ollama for local models.
Deck — managed VMs with credential vault, SOC 2, positioned as "Plaid for any website" with 100k+ utility provider integrations.
