Web Extract
Lookups
Web Extract
Extract structured data from any web page — either as clean markdown or as JSON matching CSS/XPath selectors you provide.
POST
Web Extract
Authentication
Requires an API key in theAuthorization header.
Request
The URL to extract from. Must be a valid
http:// or https:// URL.Extraction mode.
markdown (default) returns clean page content; selectors
returns structured JSON keyed by the selectors map.Map of output field name → CSS/XPath selector. Used when
mode is selectors.Render the page in a headless browser before extracting. Default:
true.Block ad networks while loading. Default:
true.Block tracking scripts while loading. Default:
true.CSS selector to wait for before extracting (useful for lazy-loaded content).
Page load timeout in milliseconds. Default:
30000. Maximum: 60000.Optional custom request headers to send with the page load.
Example request
Response
Whether the extraction succeeded.
The final URL that was extracted (after redirects).
HTTP status code returned by the target page.
The extracted data — markdown content, or a JSON object keyed by your
selectors.Page metadata such as
title, description, and language.Machine-readable error code when
success is false.Human-readable error message when
success is false.Credits consumed.
0 on a failed (refunded) extraction.Total processing time in milliseconds.