What is WebGlean?
WebGlean turns any URL into structured, AI-ready data. It handles JavaScript rendering, content cleaning, and structured extraction so you don't have to.
Use it when you need to:
- Feed web content into an LLM or RAG pipeline
- Monitor pages for changes
- Extract structured data from websites
- Build agents that browse the web
Features
| Feature | Description |
|---|---|
| JS rendering | Full Playwright browser — handles SPAs, lazy loading, dynamic content |
| Content cleaning | Mozilla Readability strips nav, ads, footers, cookie banners |
| 7 APIs | Scrape, Crawl, Extract, Map, Monitor, Search, Batch |
| AI extraction | Claude-powered structured JSON extraction from any page |
| Credit-based | Pay per use — credits never expire |
Quick example
curl -X POST https://www.webglean.com/v1/scrape \
-H "Authorization: Bearer wg_your_key" \
-H "Content-Type: application/json" \
-d '{"url": "https://example.com"}'
{
"success": true,
"data": {
"markdown": "# Example Domain\n\nThis domain is for use in illustrative examples...",
"metadata": {
"title": "Example Domain",
"statusCode": 200
}
}
}
Next steps
- Quickstart — get your first scrape running in 2 minutes
- Authentication — create and manage API keys
- API reference — full endpoint documentation
- SDKs — official Node.js and Python client libraries
- CLI — run scrapes from your terminal
- MCP Server — give AI agents live web access
WebGlean