LLM-ready output

Website to Markdown on any URL

One API call. Clean, LLM-ready Markdown — even from sites behind WAFs, JS challenges, and bot-detection. Drop it straight into RAG pipelines, Custom GPTs, or agent tools.

Pair with success-only billing — you don't pay for pages we can't deliver.

curl -X POST https://publisher.scrappey.com/api/v1 \
  -H "Content-Type: application/json" \
  -d '{
    "cmd": "request.get",
    "url": "https://example.com/article",
    "markdown": true
  }'

Built for AI workloads

Where Markdown output matters most

RAG pipelines

Pull clean Markdown from URLs you are authorized to access into your retriever — no HTML noise, no scripts, no styling junk. Stable structure means better chunking and embeddings.

Custom GPTs and Claude Projects

Upload Markdown files straight into Custom GPTs, Claude Projects, or any knowledge base. Scrappey handles the fetch and conversion; the model sees clean prose.

AI agents with web access

Give your agent a tool that returns Markdown from JavaScript-heavy URLs you are authorized to access. Works with LangChain, LlamaIndex, and MCP-compatible hosts.

Research and evaluation datasets

Build reproducible, text-clean corpora from public web pages. Markdown preserves headings and lists, which matters for downstream training and eval.

Why Scrappey Markdown

What sets this apart from HTML-only converters

Works on JavaScript-heavy sites

Most Markdown tools fail silently on sites behind a CDN or JS verification flow. Scrappey handles the browser rendering and verification workflow first, converts second.

Clean, consistent output

We strip scripts, styles, and layout chrome, then convert to Markdown using a tuned readability pipeline. Headings, lists, and tables survive.

One API, one rate

The Markdown mode is not a separate product. It is a parameter on the same API you use for HTML or JSON — same pay-as-you-go rate.

LLM-aware defaults

Output is opinionated for language-model consumption — no base64 images, no redundant whitespace, stable section order.

What the output looks like

Clean structure. No script tags, no tracking pixels, no style bloat.

Input URL
https://example.com/articles/
  web-scraping-2026
Output (Markdown)
# Web Scraping in 2026

The scraping landscape has shifted...

## Key trends

- Modern website complexity is now table stakes
- LLM-ready output is a new expectation
- Pay-as-you-go pricing is rising

## Further reading

1. [Introduction to RAG](/intro)
2. [Structured data extraction](/json)

Frequently asked

How is this different from /tools/markdown-converter?
The tool page is a free in-browser playground — paste a URL, get Markdown. This page is the API-driven product: programmatic access at pay-as-you-go rates for integration into your app, agent, or RAG pipeline.
Does it work on sites behind bot-protection walls?
Yes. Scrappey handles web application firewalls, bot detection, and JavaScript challenges before converting to Markdown. You get clean output where HTML-only converters fail.
Can I also get raw HTML or JSON?
Yes. Omit the markdown parameter to get the raw HTML response, or pair the request with a JSON parser. The underlying request is the same — flip the markdown flag based on what your pipeline needs.
Is there a LangChain / LlamaIndex / MCP integration?
Community integrations exist for all three. Documentation and examples are at docs.scrappey.com.

Turn any URL into LLM-ready Markdown

A free demo trial to start. No credit card. No subscription.

Create free account
footer-frame

Start building with Scrappey

Try It For Free. No Subscription Required. No Credit Card Required. Instant Set-Up. Your Free Trial Is Waiting For You!

Frequently asked questions

What is Scrappey.com?

Scrappey.com is a web scraping API that handles all the complex aspects of web scraping, such as handling dynamic content, rotating proxies, advanced request handling, headless browsers, and verification processing. It offers an all-in-one solution for extracting publicly available data from websites.

How does Scrappey.com work?

Scrappey.com provides a web scraping API that allows you to send requests to extract publicly available data from websites. It handles dynamic content and modern website complexity, including rotating proxies, advanced request handling, and verification processing. You can easily extract publicly available data from websites using their built-in features like headless browsers and AI-powered data extraction.

Can I customize the proxies used for scraping?

Yes, with Scrappey.com, you have the option to use Sticky Rotating Proxies for seamless scraping. Alternatively, you can also set your own proxies if desired.

Is there a free trial available?

Yes, Scrappey.com offers a free trial where you can try it out without a subscription or credit card. Instant setup is provided, so you can explore the full capabilities of the platform right away.

What happens if a request fails?

We only charge for successful requests. Failed requests are not counted towards your usage, so you only pay for what works.

I need to scroll or click on a button on the page I want to scrape

No problem, you can pass any JavaScript snippet that needs to be executed by using our JavaScript scenario parameter. This allows you to interact with dynamic content, scroll pages, click buttons, wait for elements, and perform any custom JavaScript actions before extracting the data.

What is the pricing structure for Scrappey.com?

Scrappey.com offers simple and transparent pricing: €0.20 per 1,000 direct HTTP requests and €1.00 per 1,000 full-browser requests. Residential proxies are included on both tiers — no separate proxy billing, no hidden fees, no complicated pricing tiers. You only pay for successful requests.

Are there any usage restrictions or limitations?

Scrappey.com provides scalable access for extracting publicly available data. Whether you need to extract data from a few pages or a large dataset of publicly accessible content, you can do so with flexible usage options. Please note that Scrappey.com only supports scraping publicly available data, and users must comply with applicable laws and website terms of service.

What support channels are available?

Scrappey.com provides various support channels for assistance. You can refer to their documentation, frequently asked questions section, blog, and uptime status page. Additionally, you can get in touch with them via email or join their Discord community for further support.

I'm not a developer, can you create custom scraping scripts for me?

We don't create custom scraping scripts, however we will gladly write some code snippets helping you to use our most powerful features: AI-powered data extraction and JavaScript scenario. Our documentation includes examples in multiple programming languages to get you started quickly.

What is a request and how are they counted?

Each API call to Scrappey counts as one request. Our pricing is based on successful requests. By default, JavaScript rendering is enabled, which allows you to extract data from modern websites with dynamic content. All features including proxies, challenge handling, and reliable web access handling are included in each request.

How fast is Scrappey's API and what if a site is hard to scrape?

Scrappey's API is optimized for fast response time, even when working with JavaScript-heavy websites and browser verification flows, where access is authorized. If other tools struggle with sites that use browser verification, Scrappey is designed to handle these workflows efficiently, ensuring reliable data retrieval. Our reliable web access handling, residential proxies, and intelligent retry logic work together to maximize success rates.