Sitemap vs crawl — cross-referenced.

Sitemap Audit

XML sitemaps tell search engines which pages you want indexed. But sitemaps often drift from reality — containing deleted pages, redirects, or non-canonical URLs while missing important new pages. EchoBat cross-references your sitemap entries against actual crawl results to identify discrepancies. This reveals pages in your sitemap that return errors, indexable pages missing from the sitemap, and canonical mismatches between sitemap URLs and actual canonical tags.

How It Works

EchoBat fetches and parses XML sitemaps during the Discovery phase. During the crawl, it records the actual status code and canonical tag for every URL. The Sitemap Health lens then cross-references: sitemap entries are checked against crawl results, and crawled indexable pages are checked against sitemap entries. Discrepancies are grouped by type and ranked by severity.

Why It Matters

  • Find drift between your sitemap and actual site state
  • Remove dead URLs that waste crawl budget in the sitemap
  • Discover indexable pages missing from the sitemap
  • Catch canonical mismatches that confuse search engines