Crawl any website or URL list to detect broken links and the exact failure mode. Returns HTTP status code, error category (4xx, 5xx, DNS, timeout, SSL, redirect-loop), response time, the source page, the link anchor text, and the full redirect chain. Configurable crawl depth, link-type filters (internal / external / asset), and concurrency. Built for bulk: feed it 100,000 URLs or seed it with a starting domain and let it discover everything. Ideal for SEO maintenance, migration QA, content audits, and link-hygiene monitoring.
# Start a run via the Apify API curl -X POST "https://api.apify.com/v2/acts/santamaria-automations~broken-link-checker/runs?token=YOUR_TOKEN" \ -H "Content-Type: application/json" \ -d '{ "startUrls": [ "https://example.com", "https://docs.example.com" ], "maxDepth": 3, "followExternal": true, "includeAssets": false, "maxLinksPerPage": 200 }' # Or use with AI agents via MCP: # https://mcp.apify.com?tools=santamaria-automations/broken-link-checker
| Field | Type | Example |
|---|---|---|
| url | string | https://example.com/old-page |
| source_page | string | https://example.com/blog/2024-recap |
| anchor_text | string | our 2024 roadmap |
| status_code | integer | 404 |
| error_category | string | client_error |
| is_broken | boolean | true |
| response_time_ms | integer | 142 |
| link_type | string | internal |
| redirect_count | integer | 0 |
| checked_at | string | 2026-06-13T10:15:42Z |