GHOSTCRAWL
v1.0 · open beta

Crawl the web
at scale.

GhostCrawl runs full Chromium, Firefox, and WebKit engines so pages render, hydrate, and settle exactly as they would in a real browser — giving you clean, complete, well-formed data every time.

Try it

Real calls. Real output.

Pick a site and a /v1 lane, then hit Run — and see the actual response our engines returned. Real results, no signup.

response scrape

Real output from ghostcrawl's chrome, firefox & webkit engines — every lane, every response.

Capabilities

Every endpoint. One clean pipeline.

Every endpoint runs through the same real browser engines with consistent device profiles. Mix and match for any data flow.

/v1/scrape

Scrape

URL → clean Markdown / JSON / HTML. Shadow-DOM, lazy frames, infinite scroll — fully rendered and settled before you get the result.

Try it ▸
/v1/search

Search

Web search with rendered snippets, dedup & content-hash. Parallel SERPs, ranked output, consistent results every call.

Try it ▸
/v1/extract

Extract

Schema-typed extraction with optional LLM fallback. Define your shape, get strict types back. Sync or async.

Try it ▸
/v1/crawl

Crawl

Site-wide URL discovery with politeness budget & depth. Pair with /v1/map for sitemap-only enumeration.

Try it ▸
/v1/content

Render

Rendered HTML, full-page screenshots (png/jpeg/webp), or print-mode PDFs. Same identity, every artifact.

Try it ▸
/v1/sessions

Sessions

Long-lived browser sessions. Pause, resume, navigate, terminate, re-attach — same identity, days later.

Try it ▸
/v1/sessions/navigate

Navigate

Drive a live browser session manually from your UI — type URLs, click, navigate. Page state streams back over WS.

Try it ▸
/v1/ws

Stream

WebSocket attach to a live session. Page events, debug frames, intercept hooks — straight from the engine.

Try it ▸

3

Native browser engines

12+

Browser × OS identities

0

Probe-surface inconsistencies

Reusable sessions

Coverage

Consistent on every surface.

A consistent browser profile across every rendering surface — cookies, storage, and runtime — from first byte to last frame.

Identity your call

1

Reuse one consistent session — or start a fresh, clean session per request.

Engines authentic

3

Chromium · Firefox · WebKit. Each engine runs natively — Chromium surfaces come from Chromium, Firefox from Firefox.

Sessions stable

Return visits keep their cookies, storage, and browser state. No re-login, no fresh-session tax.

Rendering faithful

end-to-end

Pages run their full lifecycle — load, hydrate, settle — so the data you get back matches what users see.

Engines

Three engines. Every identity. One stack.

Each engine runs production-grade end-to-end. iOS requests use WebKit, matching Apple's platform requirement.

hardened

Chromium

For Chrome, Edge, Brave identities — desktop and mobile.

Windows macOS Linux Android
hardened

Firefox

For Firefox identities. Privacy posture preserved.

Windows Linux Android
hardened

WebKit

For all iOS requests. Chrome, Firefox & Safari on iOS share this engine.

iOS Safari iOS Chrome iOS Firefox macOS Safari
Pricing

Simple, flat monthly plans.

Run it yourself for free, or let us host it. One predictable price per month — no metering, no surprises.

Free

$0 / forever

Run it yourself. Free forever.

  • Self-hosted runtime (your machine, your IP)
  • All three browser engines (Chromium / Firefox / WebKit)
  • Community support
Get the self-host image

Pro

$19 / month

Managed cloud scraping for serious projects.

  • All three browser engines (Chromium / Firefox / WebKit)
  • Fully managed cloud — nothing to run
  • Up to 10 concurrent crawls
  • Sticky sessions
  • Geo targeting
  • Managed browsing behavior
  • Premium fingerprint
  • Email support
Start free trial

Growth

$39 / month

Full managed browsing behavior at growing volume.

  • All three browser engines (Chromium / Firefox / WebKit)
  • Fully managed cloud — nothing to run
  • Up to 25 concurrent crawls
  • Sticky sessions
  • Geo targeting
  • Full managed browsing behavior
  • Premium fingerprint
  • Email support
Start free trial

Scale

$79 / month

High-volume crawling with full managed browsing behavior.

  • All three browser engines (Chromium / Firefox / WebKit)
  • Fully managed cloud — nothing to run
  • Up to 50 concurrent crawls
  • Sticky sessions
  • Geo targeting
  • Full managed browsing behavior
  • Bring-your-own behavior scripts
  • Premium fingerprint
  • Priority support
Start free trial

Enterprise

Custom

Custom volume, dedicated support, and SLA.

Contact sales
FAQ

Common questions.

What makes GhostCrawl different from a headless browser?
Standard headless browsers produce inconsistent, low-fidelity page state — modern web apps fail to hydrate, JavaScript doesn't always run as expected, and the data you get back drifts from what users see. GhostCrawl runs full browser engines end-to-end so pages render, hydrate, and settle the way they're meant to.
Do I need residential proxies?
Residential proxies aren't strictly required, but they're recommended for the cleanest, most reliable results — it's the standard choice for serious, high-volume crawling. GhostCrawl plugs into your proxy provider of choice via SOCKS5.
Is this legal / ethical to use?
Public data, your data, contracted data — yes. We don't support credential stuffing, accessing accounts or content you aren't authorized to, or violating a site's published terms of service. Acceptable-use terms are part of every account.
How is this priced?
Flat monthly plans — Free to self-host, $19/mo Pro, $39/mo Growth, $79/mo Scale, and custom Enterprise. Pick a plan, get everything in it; no metering and no per-request billing. Managed anti-ban routing is included on every paid plan — there is nothing extra to buy.
Can I self-host?
Yes — the Free plan is the self-hosted runtime. Same engines, same identity stack, running on your own hardware and IP, free forever. Prefer not to run it? The managed cloud plans start at $19/mo.