Rank & RecommendGet the playbook
All tools
Free tool

AI Robots Config Helper

Decide which AI crawlers can read your site, with the tradeoffs spelled out. Pick a posture, copy the robots.txt block, done.

robots.txt · paste at your-site.com/robots.txt
# AI crawler rules generated by RANK & RECOMMEND
# Recommended: blocks training-only crawlers, allows live and search bots that cite you.

# ---- Blocked: training-only crawlers ----

# OpenAI: Crawls the public web to train ChatGPT models
User-agent: GPTBot
Disallow: /

# Anthropic: Crawls the public web to train Claude models
User-agent: ClaudeBot
Disallow: /

# Anthropic: Older Anthropic crawler identifier (training)
User-agent: anthropic-ai
Disallow: /

# Google: Controls whether your content trains Gemini and Vertex AI
User-agent: Google-Extended
Disallow: /

# Apple: Trains Apple Intelligence models
User-agent: Applebot-Extended
Disallow: /

# Common Crawl: General crawl used to train many open and closed LLMs
User-agent: CCBot
Disallow: /

# ByteDance: Aggressive crawler used for TikTok / Doubao training
User-agent: Bytespider
Disallow: /

# Meta: Trains Llama and other Meta AI
User-agent: Meta-ExternalAgent
Disallow: /

# Cohere: Crawls for Cohere model training
User-agent: cohere-ai
Disallow: /

# ---- Allowed: live and search bots that can cite you ----

# OpenAI: Live fetch when a ChatGPT user clicks a link in an answer
User-agent: ChatGPT-User
Allow: /

# OpenAI: Indexes pages for ChatGPT Search and inline citations
User-agent: OAI-SearchBot
Allow: /

# Anthropic: Live fetch when a Claude user opens a link in an answer
User-agent: Claude-User
Allow: /

# Anthropic: Indexes pages for Claude's search and citations
User-agent: Claude-SearchBot
Allow: /

# Perplexity: Indexes pages for Perplexity answers and source citations
User-agent: PerplexityBot
Allow: /

# Perplexity: Live fetch when a Perplexity user runs a query
User-agent: Perplexity-User
Allow: /

# Google: Various Google products, including some AI research
User-agent: GoogleOther
Allow: /

# Meta: Indexes pages for Facebook link previews
User-agent: FacebookBot
Allow: /

# DuckDuckGo: Powers DuckDuckGo's AI assistant answers
User-agent: DuckAssistBot
Allow: /

# Diffbot: Knowledge graph crawler used by AI products
User-agent: Diffbot
Allow: /
ALL TRACKED AI CRAWLERS
USER-AGENTOWNERPURPOSEIN THIS CONFIG
GPTBotOpenAICrawls the public web to train ChatGPT modelsBLOCKED
ChatGPT-UserOpenAILive fetch when a ChatGPT user clicks a link in an answerALLOWED
OAI-SearchBotOpenAIIndexes pages for ChatGPT Search and inline citationsALLOWED
ClaudeBotAnthropicCrawls the public web to train Claude modelsBLOCKED
Claude-UserAnthropicLive fetch when a Claude user opens a link in an answerALLOWED
Claude-SearchBotAnthropicIndexes pages for Claude's search and citationsALLOWED
anthropic-aiAnthropicOlder Anthropic crawler identifier (training)BLOCKED
PerplexityBotPerplexityIndexes pages for Perplexity answers and source citationsALLOWED
Perplexity-UserPerplexityLive fetch when a Perplexity user runs a queryALLOWED
Google-ExtendedGoogleControls whether your content trains Gemini and Vertex AIBLOCKED
GoogleOtherGoogleVarious Google products, including some AI researchALLOWED
Applebot-ExtendedAppleTrains Apple Intelligence modelsBLOCKED
CCBotCommon CrawlGeneral crawl used to train many open and closed LLMsBLOCKED
BytespiderByteDanceAggressive crawler used for TikTok / Doubao trainingBLOCKED
Meta-ExternalAgentMetaTrains Llama and other Meta AIBLOCKED
FacebookBotMetaIndexes pages for Facebook link previewsALLOWED
DuckAssistBotDuckDuckGoPowers DuckDuckGo's AI assistant answersALLOWED
cohere-aiCohereCrawls for Cohere model trainingBLOCKED
DiffbotDiffbotKnowledge graph crawler used by AI productsALLOWED
MONITOR WHO ACTUALLY OBEYS YOUR ROBOTS.TXT
Members get an AI Crawler Visit Log that shows which AI crawlers hit which paths, plus alerts when one ignores your robots.txt. Join the waitlist for the crawler log and the free GEO Playbook.
LIKED THIS TOOL?
Get the full GEO Playbook free. 14 pages.
Send me the PDF →Try membership $1