Sites
Your owned-domain index. Sites is where you watch what Lumear has crawled, kick off re-crawls, and triage pages that AI keeps citing back to you.
What “Sites” actually means
Every domain you add to a brand becomes a row on the Sites page. Each domain has a list of pages we’ve crawled, with their last-seen HTTP status, page-type classification (homepage / blog / product / pricing / docs / etc.), word count, and crawl timestamp.
The page count and the per-page page-type breakdown are what feed the recommendation engine. A thin or stale index will produce thin recommendations.
Filtering pages
- Search — substring match against URL and title.
- Domain — narrow to one brand domain (only relevant when a brand has multiple).
- Page type — filter to blog posts, product pages, comparisons, etc. Useful for “show me every comparison page on the site”.
- Section — filter by the first URL path segment (e.g.
/blog,/products,/investors). Best for “everything under the Resources tree”.
Crawl scopes
Each domain has three crawl modes; pick based on how recently the site changed:
- Topical (default) — Firecrawl
/map+ a filtered batch scrape. Lumear picks the pages most likely to match the brand’s prompt sets. Fast and cheap; recommended for re-crawls. - Cited only — re-fetch only the URLs AI assistants have actually cited in recent runs. Tiny scope, near-real-time freshness. Good for “a single page changed and I want it re-indexed.”
- Full audit — crawl every page Firecrawl can find up to the cap. Expensive; reserve for first crawls and post-redesign refreshes.
Per-page status
Each row carries an HTTP status badge. 2xx means the page was indexed successfully; 3xx means it redirected (we follow); 4xxmeans broken (Lumear ignores the URL); 5xx usually means the origin was flaky and we’ll retry on the next crawl.
Using Sites to debug recommendations
If your Recommendations feel off-target, the Sites page is usually the first place to look:
- Filter by section to see how thin a key area is. A “build a /comparison/ page” recommendation makes more sense if your section count there is zero.
- Look at the page-type distribution. If you have 200 blog posts and three product pages, your AEO content gap may be product-side, not editorial.
- Spot stale crawls — pages last crawled 30+ days ago may have drifted from what AI actually sees today.