Machine-Readable Infrastructure

A single documented allow/block matrix exists covering training crawlers, search crawlers, and user-triggered fetchers separately, per bot per property. (Enable AI Search Access)

The deployed robots.txt on every domain and subdomain matches the central matrix — no divergence between properties.

No search-visibility crawler the matrix allows (OAI-SearchBot, Bingbot, PerplexityBot) is blocked anywhere in practice.

evidence: server logs show 200s for each allowed user agent

WAF, CDN, and bot-management layers do not override permissive robots.txt rules. (Edge Worker Bot Management)

evidence: test-fetch key pages as each AI user agent through the production edge

Actual bot traffic in server logs matches declared policy. (Parse Logs for AI Bot Behavior)

Meta robots and X-Robots-Tag states match intent on every template — no inherited noindex leaks.

Any llms.txt investment is justified by observed fetches in your own server logs, not advocacy. (Schema to Avoid)

Machine-Readable Infrastructure

Schema and structured data

Product feeds and catalog data

Crawler access and bot policy

Index freshness and discovery

Rendering and extractability