Enable AI Search Access
Last reviewed:
Use this when access — not content quality — is the suspected cause of citation or crawl absence in AI search environments. Comparable pages from comparable sites are being cited; yours are not.
Confirm you’re solving an access problem, not a content problem
If comparable pages from comparable sites are being cited and yours are not, access is a plausible cause. If nothing on the topic from any site is being cited, content quality is the first lever — run Build an AI-Visible Content Page instead.
Decision point: work in order — infrastructure (CDN/WAF) first → robots.txt → X-Robots-Tag/meta robots → rendered output. Each layer can override the layer below it.
Map the crawler to the target surface
Each AI product uses a distinct crawler token — fixing the wrong one changes nothing.
- ChatGPT Search:
OAI-SearchBot— notGPTBot, which is training-only - Bing Copilot and Bing AI summaries:
Bingbot— one token controls all Bing Search and Bing AI surfaces - Google AI Overviews and AI Mode:
Googlebot— no separate AI inclusion mechanism exists - Google Gemini training and Vertex AI grounding:
Google-Extended— separate from Search; blocking it does not affect Google Search AI features - Perplexity:
PerplexityBot
Confirm core SEO eligibility first
If fundamentals are broken, AI-specific controls are irrelevant.
- The URL returns a consistent 200 — soft 404s suppress indexing without a visible error signal
- Canonical is intentional and doesn’t point into a redirect chain or a noindex target
- The page is not noindex via meta robots or X-Robots-Tag
- Internal links support discovery from already-crawled pages
Audit robots.txt for each relevant crawler
Wildcard rules and named rules interact in non-obvious ways.
- Check Disallow rules for
User-agent: *— the wildcard applies to all bots unless a named rule overrides it OAI-SearchBotandGPTBotare independent tokens — a rule for one does not apply to the other- Blocking
Googlebotremoves the page from every Google Search AI feature — useGoogle-Extendedto opt out of training only - Wildcards in agent names (
User-agent: GPT*) may not work as intended — use exact token names
Validate: allowing
OAI-SearchBotby name while aUser-agent: *Disallow rule blocks it anyway is the most common failure here — check the wildcard rule resolves the way you expect for every named bot.
Review meta robots and X-Robots-Tag
JS-injected directives and CDN headers are the two most commonly missed sources.
- Check meta robots in both page source and rendered output — JS-injected noindex is common and invisible in source
- X-Robots-Tag from the CDN layer overrides page-level meta robots
robots.txtblocks crawl; noindex blocks indexing — a page can be crawlable and noindex at the same time
Review infrastructure controls
CDN and WAF rules run before robots.txt is read — they override everything below them.
- WAF rulesets and bot-management tools can block crawlers before page-level rules apply
- Rate limiting produces incomplete crawls even when robots.txt is permissive
- Geo-filtering can route AI crawler IPs away from intended content — verify crawler IP ranges are not filtered against each platform’s published ranges
Validate rendered output
Content must be present in rendered HTML, not visible only after JS execution or user interaction.
- Compare source HTML to rendered HTML using Search Console URL Inspection or a headless tool
- Blocked JS, CSS, or API calls silently remove content from what crawlers see
- Lazy-load, tab, accordion, and interaction-gated content is not reliably extracted
Review preview and extraction controls
Snippet controls that block preview also block AI answer extraction.
nosnippet,max-snippet, anddata-nosnippetprevent content from appearing in AI answers- Check whether these are set intentionally on these pages or inherited via a template default
- Bing supports
data-nosnippetfor both Bing Search snippets and Bing AI answers
Accelerate freshness after changes
After removing blocks, push freshness signals rather than waiting for organic recrawl.
- Submit via IndexNow for Bing and Yandex — it processes within minutes of submission
- Use Search Console URL Inspection to request indexing for Google priority pages
- Resubmit the sitemap if affected URLs were missing from it
Monitor after changes
Access fixes remove a barrier — content quality still determines citation frequency.
- Bing AI Performance in Bing Webmaster Tools shows citation counts, cited URLs, and grounding query phrases
- Google Search Console does not currently expose AI-specific citation data
- A page that becomes accessible but fails on content quality, trust, or extractability will not gain citations — that’s expected, not a sign the access fix failed
Decision point: blocking
Googlebotto prevent training access blocks all Google Search AI features. UseGoogle-Extendedto opt out of training only — this is the single most consequential crawler-token mistake in this workflow.