What is WebGlean?

WebGlean turns any URL into structured, AI-ready data. It handles JavaScript rendering, content cleaning, and structured extraction so you don't have to.

Use it when you need to:

  • Feed web content into an LLM or RAG pipeline
  • Monitor pages for changes
  • Extract structured data from websites
  • Build agents that browse the web

Features

FeatureDescription
JS renderingFull Playwright browser — handles SPAs, lazy loading, dynamic content
Content cleaningMozilla Readability strips nav, ads, footers, cookie banners
7 APIsScrape, Crawl, Extract, Map, Monitor, Search, Batch
AI extractionClaude-powered structured JSON extraction from any page
Credit-basedPay per use — credits never expire

Quick example

curl -X POST https://www.webglean.com/v1/scrape \
  -H "Authorization: Bearer wg_your_key" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com"}'
{
  "success": true,
  "data": {
    "markdown": "# Example Domain\n\nThis domain is for use in illustrative examples...",
    "metadata": {
      "title": "Example Domain",
      "statusCode": 200
    }
  }
}

Next steps

  • Quickstart — get your first scrape running in 2 minutes
  • Authentication — create and manage API keys
  • API reference — full endpoint documentation
  • SDKs — official Node.js and Python client libraries
  • CLI — run scrapes from your terminal
  • MCP Server — give AI agents live web access