CLI

indxel crawl

Crawl a live website. Audit every page. Get cross-page analysis.

Usage

bash
npx indxel-cli crawl <url> [options]

Flags

ParameterTypeDescription
--pushbooleanSend results to the Indxel dashboard (requires --api-key or INDXEL_API_KEY)
--api-key <key>stringProject API key for dashboard push (or set INDXEL_API_KEY env var)
--max-pages <n>numberMaximum pages to crawl (default: 50)
--max-depth <n>numberMaximum link depth (default: 5)
--delay <ms>numberDelay between requests in ms (default: 200)
--ignore <patterns>stringComma-separated glob patterns to skip (e.g. '/admin/*,/api/*')
--strictbooleanTreat warnings as errors
--jsonbooleanOutput as JSON
--skip-assetsbooleanSkip og:image and favicon verification
--skip-sitemapbooleanSkip sitemap.xml check
--skip-robotsbooleanSkip robots.txt check

Example

Basic crawlbash
$ npx indxel-cli crawl https://mysite.com

  indxel crawl — https://mysite.com

  Crawling... 47/50 pages (2.3s)

  Score: 88/100 (B)

  ✓ 42 pages pass
  ⚠ 3 pages with warnings
  ✗ 2 pages with errors

  Cross-page issues:
    ✗ 2 duplicate titles found
    ✗ 1 broken internal link (/old-page → 404)
    ⚠ 3 pages with thin content (< 100 words)
    ⚠ 1 orphan page (not linked from anywhere)

  Sitemap: 45/47 pages in sitemap (2 missing)
  Robots: OK — no important pages blocked
  Assets: 1 broken og:image (/blog/old-post)

Push to dashboard

Send results to your dashboardbash
# Using flag
npx indxel-cli crawl https://mysite.com --push --api-key ixu_your_key

# Using environment variable
export INDXEL_API_KEY=ixu_your_key
npx indxel-cli crawl https://mysite.com --push

API key

Get your account API key from the dashboard settings. It starts with ixu_.

JSON output

bash
# Get all pages with score below 80
npx indxel-cli crawl https://mysite.com --json | jq '.pages[] | select(.score < 80)'

# Get cross-page issues only
npx indxel-cli crawl https://mysite.com --json | jq '.analysis'

Cross-Page Analysis

The crawl automatically detects issues that span multiple pages:

  • Duplicate titles — Pages sharing the same title tag
  • Duplicate descriptions — Pages sharing the same meta description
  • H1 issues — Missing or multiple H1 tags
  • Broken internal links — Links pointing to 404s
  • Redirect chains — Pages with redirect chains
  • Thin content — Pages with fewer than 100 words
  • Orphan pages — Pages not linked from any other page
  • Slow pages — Pages with high response times
  • Structured data summary — JSON-LD types found across the site