CLI
indxel crawl
Crawl a live website. Audit every page. Get cross-page analysis.
Usage
bash
npx indxel-cli crawl <url> [options]Flags
| Parameter | Type | Description |
|---|---|---|
| --push | boolean | Send results to the Indxel dashboard (requires --api-key or INDXEL_API_KEY) |
| --api-key <key> | string | Project API key for dashboard push (or set INDXEL_API_KEY env var) |
| --max-pages <n> | number | Maximum pages to crawl (default: 50) |
| --max-depth <n> | number | Maximum link depth (default: 5) |
| --delay <ms> | number | Delay between requests in ms (default: 200) |
| --ignore <patterns> | string | Comma-separated glob patterns to skip (e.g. '/admin/*,/api/*') |
| --strict | boolean | Treat warnings as errors |
| --json | boolean | Output as JSON |
| --skip-assets | boolean | Skip og:image and favicon verification |
| --skip-sitemap | boolean | Skip sitemap.xml check |
| --skip-robots | boolean | Skip robots.txt check |
Example
Basic crawlbash
$ npx indxel-cli crawl https://mysite.com
indxel crawl — https://mysite.com
Crawling... 47/50 pages (2.3s)
Score: 88/100 (B)
✓ 42 pages pass
⚠ 3 pages with warnings
✗ 2 pages with errors
Cross-page issues:
✗ 2 duplicate titles found
✗ 1 broken internal link (/old-page → 404)
⚠ 3 pages with thin content (< 100 words)
⚠ 1 orphan page (not linked from anywhere)
Sitemap: 45/47 pages in sitemap (2 missing)
Robots: OK — no important pages blocked
Assets: 1 broken og:image (/blog/old-post)Push to dashboard
Send results to your dashboardbash
# Using flag
npx indxel-cli crawl https://mysite.com --push --api-key ixu_your_key
# Using environment variable
export INDXEL_API_KEY=ixu_your_key
npx indxel-cli crawl https://mysite.com --pushAPI key
Get your account API key from the dashboard settings. It starts with
ixu_.JSON output
bash
# Get all pages with score below 80
npx indxel-cli crawl https://mysite.com --json | jq '.pages[] | select(.score < 80)'
# Get cross-page issues only
npx indxel-cli crawl https://mysite.com --json | jq '.analysis'Cross-Page Analysis
The crawl automatically detects issues that span multiple pages:
- Duplicate titles — Pages sharing the same title tag
- Duplicate descriptions — Pages sharing the same meta description
- H1 issues — Missing or multiple H1 tags
- Broken internal links — Links pointing to 404s
- Redirect chains — Pages with redirect chains
- Thin content — Pages with fewer than 100 words
- Orphan pages — Pages not linked from any other page
- Slow pages — Pages with high response times
- Structured data summary — JSON-LD types found across the site