What Is JavaScript Rendering in Web Scraping?

By the Scrappey Research Team

Paste into ChatGPT, Claude, or any LLM

What Is JavaScript Rendering in <a href=

On this page

JavaScript rendering is the process of executing a page's JavaScript in a real browser engine so that content built on the client side appears before you extract it. Many modern sites send a near-empty HTML shell and then build the visible page with JavaScript - fetching data, assembling the DOM, and injecting content after load. A plain HTTP request only sees the empty shell. Rendering runs that JavaScript (in a headless browser) so the finished page exists to scrape, the same way it would in a normal browser.

Needed when	Content is built client-side (SPAs, lazy-loaded data)
Runs in	A headless browser engine (Chromium, Firefox, WebKit)
Tools	Playwright, Puppeteer, Selenium, or a rendering API
Cost	Slower and heavier than a raw HTTP request
Faster alternative	Call the underlying JSON/XHR API directly when one exists

How JavaScript rendering works

A rendering scraper launches a real browser engine - usually headless Chromium via Playwright or Puppeteer - and points it at the URL. The browser does what any browser does: downloads the HTML, runs the scripts, makes the background XHR/fetch calls those scripts trigger, and builds the final DOM. The scraper then waits for the content to be ready (a specific element to appear, or the network to go idle) and reads the rendered HTML. This is heavier than a raw request because you are running an entire browser per page - more CPU, more memory, more time - but it is the only way to see content that does not exist until JavaScript creates it.

When you actually need rendering

Rendering is essential for client-side apps, but it is overused. The test is simple: fetch the raw HTML and look for your data. If it is already in the response - many sites server-render or embed data in a <script> JSON blob - you do not need a browser at all, and a plain HTTP request is an order of magnitude faster and cheaper. If the raw HTML is an empty shell and your data only appears after scripts run, you need rendering. A middle path is often best: open the network tab, find the JSON API the page's JavaScript is calling, and request that endpoint directly. Calling the underlying API skips the browser entirely and returns clean structured data.

Rendering and anti-bot detection

Running a headless browser solves the content problem but creates a detection one. Automation frameworks leak signals - a navigator.webdriver flag set to true, missing or inconsistent browser properties, a TLS or fingerprint that does not match the user agent the browser claims - and headless detection looks for exactly these. So rendering at scale is not just "run a browser"; it is run a browser whose fingerprint is coherent, behind a residential IP, at a human pace. That coherence is hard to maintain across thousands of sessions, which is why a managed scraping API that renders JavaScript and matches a believable fingerprint in the same call is often more reliable than a self-hosted headless fleet.

Code example

python

# Render in a headless browser to get client-side content
from playwright.sync_api import sync_playwright

with sync_playwright() as p:
    browser = p.chromium.launch(headless=True)
    page = browser.new_page()
    page.goto('https://example.com/app')
    page.wait_for_selector('.product')   # wait until JS injects the data
    html = page.content()                # fully rendered DOM
    browser.close()

# Or send the URL to a scraping API with render=true and skip the fleet.

Related terms

What Is a Headless Browser?

A headless browser is a real web browser — Chrome, Firefox, or WebKit — that runs without a visible window, driven entirely by code instead …

What Is Playwright?

Playwright is a cross-browser automation framework from Microsoft that drives Chromium, Firefox, and WebKit through a single API. An automat…

What Is Puppeteer?

Puppeteer is Google's Node.js library for driving a Chromium browser from code, over the Chrome DevTools Protocol (CDP) - the same channel C…

How to Reverse-Engineer API Requests for Scraping

Reverse-engineering API requests for scraping means watching the network traffic a website makes, spotting the JSON endpoints that feed its …

What Is Headless Browser Detection?

Headless browser detection is the set of probes anti-bot systems use to distinguish a headless or instrumented Chrome session from a real us…

What Is a Web Scraping API?

A web scraping API is a hosted HTTP service that visits a web page for you and hands back the result — rendered HTML, JSON, or already-parse…

What Is a User Agent?

A user agent is a short text string a client sends in the User-Agent HTTP header to tell a server what software is making the request. Every…

How to Scrape Prices: Build a Price Monitor That Survives Anti-Bot

To scrape prices reliably you fetch each product page through a residential proxy in the right country, parse the current price out of the p…

Concept map

How JavaScript Rendering connects

The terms most directly tied to this one. Hover a node to see its neighbours, click to preview, drag to rearrange.

0 terms · 0 connections

You are here · Web Scraping APIs

Tools & solutions for this topic

Frequently asked questions

When do I need JavaScript rendering to scrape a page?

When the data you want is not in the raw HTML response and only appears after the page's JavaScript runs - typical of single-page apps and lazy-loaded content. If the data is already present in the initial HTML, you do not need rendering and a plain HTTP request is faster and cheaper.

What is the difference between rendering and a plain HTTP request?

A plain HTTP request downloads the HTML the server sends and stops. JavaScript rendering also runs that HTML in a real browser engine, executing scripts and background API calls so client-side content is built before you read it. Rendering is far heavier but sees content a raw request never will.

Is there a faster alternative to rendering JavaScript?

Often yes: find the JSON API the page's JavaScript calls (visible in the browser network tab) and request that endpoint directly. It skips the browser entirely and returns clean structured data, which is faster and lighter than rendering - when such an endpoint exists and is accessible.

Does running a headless browser get me blocked?

It can, because automation leaves signals like navigator.webdriver=true and fingerprint inconsistencies that headless-detection systems look for. Rendering reliably at scale requires a coherent fingerprint, a residential IP, and human-like pacing - not just launching a browser in headless mode.

Last updated: 2026-06-08