Extract

Web Scraping API for HTML Extraction at Scale

Extract raw or rendered HTML from public URLs with a web scraping API built for JavaScript rendering, proxy rotation, and reliable webpage retrieval at scale.

GETapi.opengraph.io/api/1.1/scrape
?site=https://example.com/pricing
Response 200 OK
status200
finalUrl“https://example.com/pricing”
title“Pricing — Example”
renderedfalse
html“<!DOCTYPE html><html><head>...”
htmlLength48291
proxytrue
cachedfalse
1B+URLs processed
1,000sof developers

Powering link previews, metadata extraction, Markdown conversion, scraping workflows, and URL intelligence at scale.

Use cases

Retrieve webpage HTML without managing scraping infrastructure

The Web Scraping API helps developers fetch raw or rendered HTML from public URLs without maintaining browsers, proxies, retries, and custom scraping pipelines.

HTML extractionRendered page retrievalContent pipelinesWebsite monitoringSEO analysisInternal research toolsData collection workflowsAI and content processing
API response

HTML and page response data from one URL

Send a URL and receive the webpage content and response details your application needs for downstream processing, extraction, monitoring, or analysis.

Raw HTML

Retrieve the original HTML response from a public webpage.

Rendered HTML

Fetch content after JavaScript rendering when pages require a browser-like environment.

Final URL

Understand redirects and the resolved destination URL after all hops.

Status and response data

Understand whether the request succeeded, redirected, or failed.

Page title

Capture basic page-level context alongside the HTML response.

Response metadata

Request and response details to support logging, debugging, and processing.

Proxy-backed retrieval

Use scraping infrastructure without managing your own proxy rotation.

Developer-ready output

Receive webpage content in a format your system can process, store, or pass to other tools.

API comparison

Choose the right Extract API

Three extraction APIs, three different outputs. Here is how to know which one your workflow needs.

Web Scraping API

This page

Fetch raw or rendered HTML from public URLs.

Use when:

You need page HTML and want control over how your system parses, stores, or processes the content.

Learn more

HTML to Markdown API

Convert webpages or HTML into clean Markdown.

Use when:

You need readable Markdown for AI apps, RAG pipelines, summarization, research, or content processing.

Learn more

Content Extraction API

Extract readable or structured webpage content.

Use when:

You want cleaner extracted content without processing full raw HTML yourself.

Learn more
JavaScript rendering

Scrape JavaScript-rendered websites without running browsers yourself

Modern websites often load important content after the initial HTML response. OpenGraph.io can render pages in a browser-like environment so your scraping workflow can retrieve content that depends on JavaScript.

  • Retrieve rendered HTML from JavaScript-heavy pages
  • Avoid managing your own headless browser infrastructure
  • Support pages that need browser-style rendering to load content
  • Reduce reliance on custom Playwright or Puppeteer setups
  • Use one API for both raw and rendered webpage retrieval
full_render=falseRaw HTML
<!-- Content not yet loaded --><div id="app"></div>
full_render=trueRendered HTML
<div id="app">  <h1>Pricing</h1>  <p>$29/month...</p></div>
Scraping infrastructure

Scraping infrastructure handled for you

Building a reliable scraping workflow means managing proxies, retries, rendering, failures, and blocked requests. OpenGraph.io handles the infrastructure so your team can focus on using the HTML instead of maintaining the retrieval layer.

Proxy-backed requests

Use scraping infrastructure without setting up or maintaining your own proxy pool.

Request retries

Improve reliability when pages fail, timeout, or return inconsistent responses.

Rendered retrieval

Fetch content from pages that need JavaScript rendering without running a browser.

Scalable API access

Move from one-off URL checks to higher-volume webpage retrieval workflows.

Cleaner operations

Avoid maintaining browser workers, proxy vendors, and custom scraping scripts.

Developer-first workflow

Use a simple API instead of stitching together browsers, proxies, and parsing logic.

Build vs buy

A simpler alternative to maintaining scraping scripts

Build it yourself

You manage:

  • Browser automation (Playwright, Puppeteer)
  • Headless browser infrastructure
  • Proxy vendors and rotation logic
  • Retries and failure handling
  • Rendering edge cases and timeouts
  • Infrastructure scaling
  • Response cleanup and parsing

Use the Web Scraping API

OpenGraph.io handles:

  • URL retrieval with one API call
  • Optional JavaScript rendering
  • Proxy-backed requests included
  • Response handling and retries
  • Scalable access without infra
  • Integration with the full URL Intelligence Platform
Start Free
How it works

From URL to HTML in seconds

01

Send a URL

Pass a public webpage URL to OpenGraph.io. Add rendering or proxy options as needed.

02

OpenGraph.io retrieves the page

The platform fetches the page, handles rendering options, and processes the response.

03

Receive HTML output

Use the returned HTML in your app, pipeline, extraction workflow, monitoring system, or internal tool.

Developer experience

Built for developers who need reliable webpage retrieval

Simple API access

Use an API key to retrieve webpage HTML from public URLs. No SDK required.

Rendering when needed

Enable JavaScript rendering for pages that need a browser-like environment.

Docs when you need them

Full API reference with parameters, response schema, and code examples.

Free requests to start

The free tier includes 100 requests per month. Test real URLs before scaling.

Scales with your workflow

Support one-off retrieval, batch processing, monitoring, and internal pipelines.

Works across URL workflows

Pair scraping output with Markdown conversion, content extraction, screenshots, and previews.

FAQ

A web scraping API lets developers send a URL and retrieve webpage content — such as raw HTML or rendered HTML — without building the scraping infrastructure themselves.
Raw HTML is the initial response returned by a webpage server. Rendered HTML includes content that appears after JavaScript runs in a browser-like environment — useful for SPAs and dynamic pages.
Not exactly. A headless browser API typically gives broader browser automation controls. The Web Scraping API is focused on retrieving webpage HTML and response data through a simple REST API.
Yes. The API supports JavaScript rendering for pages where important content loads after the initial HTML response — no Playwright or Puppeteer setup required.
The Web Scraping API returns raw or rendered HTML for teams that want lower-level access to webpage content. The HTML to Markdown API converts webpages into cleaner Markdown for AI, RAG, research, and content workflows.
No. Proxy-backed requests are part of the scraping infrastructure. You do not need to set up or rotate your own proxies.
Full documentation — including request parameters, response schema, rendering options, and code examples — is available in the API reference.

Start scraping webpages with one API

Use OpenGraph.io to retrieve raw or rendered HTML from public URLs without maintaining your own browser, proxy, and scraping infrastructure.

No credit card required. Free requests included.