Secure Scraping API
The Scraping API allows AI agents to safely fetch web content while automatically scanning for threats like indirect prompt injections and malicious payloads.Why Use Secure Scraping?
When AI agents browse the web, they can encounter:- Indirect Prompt Injections: Hidden instructions in web pages designed to hijack the agent
- Malicious Content: Scripts, hidden text, or encoded payloads
- Exfiltration Attempts: Content designed to extract sensitive data
Endpoints
Scrape URL
Securely scrape a single URL.Batch Scrape
Scrape multiple URLs efficiently.SDK Usage
- Python
- Node.js
Threat Detection
The scraping API detects:| Threat | Description |
|---|---|
indirect_prompt_injection | Hidden instructions in HTML comments, CSS, or JavaScript |
malicious_script | Dangerous JavaScript or encoded payloads |
hidden_content | Invisible text or elements designed to manipulate AI |
exfiltration_attempt | Content designed to extract data via the AI |
Options
| Parameter | Type | Default | Description |
|---|---|---|---|
render_js | boolean | false | Render JavaScript (slower but more complete) |
extract_text | boolean | true | Extract clean text only |
timeout | integer | 30 | Request timeout in seconds |