Is Cloudflare blocking your AI crawlers?

By The AI Visibility Checker team7 min read

Last updated: June 8, 2026

TL;DR

You can allow GPTBot in robots.txtand still be invisible to ChatGPT — because Cloudflare blocked the request at the edge before robots.txt ever mattered. Bot Fight Mode, Super Bot Fight Mode's “AI Scrapers and Crawlers” toggle, and Managed robots.txt all do this by default. Here's how to tell if it's happening to you, and how to fix it without turning off your bot protection.

The silent block: how it happens

robots.txt is a polite request a crawler reads after it connects. A CDN or WAF block happens before that — the request gets a 403 (or a managed challenge) and never sees your content or your rules. So the two layers can disagree: your robots.txt says Allow, but Cloudflare says no. The result is a site that looks perfectly open to you and is completely invisible to ChatGPT, Claude and Perplexity. It is the single most common reason a technically healthy site gets no AI citations.

Three Cloudflare features cause this, often without anyone choosing them deliberately:

  • Super Bot Fight Mode → “AI Scrapers and Crawlers”. A single toggle that blocks GPTBot, ClaudeBot, PerplexityBot, Google-Extended and others. On many plans it defaults to on.
  • Bot Fight Mode.Challenges “automated” traffic broadly; legitimate AI crawlers that don't solve the challenge get dropped.
  • Managed robots.txt. Cloudflare can prepend its own Disallow rules at the top of your robots.txt — so even if you added User-agent: GPTBot / Allow: / below, the earlier block wins.

How to check if you're blocked

Don't guess — verify. Three ways, fastest first:

  1. Audit your URL as the AI bots. Run a free audit— we fetch your site with each AI crawler's user-agent and report exactly which ones get a 403, a challenge, or an empty page. That's the fastest way to see the gap the way the models do.
  2. Check your Cloudflare dashboard.Security → Bots. Look at whether Bot Fight Mode is on, and (if you have Super Bot Fight Mode) whether “AI Scrapers and Crawlers” is set to Block.
  3. Read your logs. Filter access logs for user-agents containing GPTBot, ClaudeBot, PerplexityBot and check the status codes. A wall of 403s is your answer. No hits at all can mean the block happens before logging.

Should you block or allow AI bots?

This is a real decision, not a default to leave on autopilot. Blocking the training and retrieval crawlers keeps your content out of AI answers entirely — which protects content you don't want reused, but also removes any chance of being the cited source when someone asks ChatGPT about your topic. For most marketing and product sites, visibility is the goal, so you want these bots allowed. For publishers and paywalled content, the calculus is different. The important thing is that the choice is intentional — most silent blocks are accidental, and accidental invisibility helps no one.

It's also not all-or-nothing. If you want AI bots to access some pages but pay for others — or be blocked from premium content while your marketing pages stay visible — you can enforce that per URL at the edge with Bot Paywall instead of a blunt site-wide block.

How to let the AI crawlers through

If you've decided to be visible, fix both layers — the edge and robots.txt:

  1. In Cloudflare: Security → Bots → Configure Super Bot Fight Mode → set AI Scrapers and Crawlers to Allow (or create a WAF exception for the verified AI bots you want).
  2. If you use Managed robots.txt, check what Cloudflare prepended and make sure it isn't disallowing the bots you want.
  3. In your own /robots.txt, explicitly allow them — see our GPTBot reference and robots.txt for AI crawlers for the exact, copy-paste directives and the full crawler list.
  4. Re-audit to confirm the 403s are gone and the bots now see your real content.

FAQ

I allowed GPTBot in robots.txt — why is it still blocked?

Because the block is happening at the CDN/WAF layer, before robots.txt is read. robots.txt only works once the crawler is allowed to connect. Fix the Cloudflare bot settings first, then robots.txt does its job.

Does allowing AI bots hurt my Google ranking?

No. GPTBot, ClaudeBot and PerplexityBot are separate from Googlebot. Allowing or blocking them has no effect on your Google rankings — it only changes whether you can appear in AI assistants' answers.

Won't allowing AI bots increase my server load?

The well-behaved crawlers respect robots.txt crawl directives and back off on errors. If load is your concern, the right tool is rate-limiting or charging per request with Bot Paywall — not an accidental block that also kills your visibility.

Not sure whether your AI crawlers are getting through? Run a free audit — it checks every major AI bot in about 30 seconds — or see the plans.

  • 7 min read

    Get cited by ChatGPT, Claude & Perplexity

    A practical, no-fluff checklist for making your site quotable by AI assistants in 2026: crawlability, JavaScript rendering, schema, llms.txt, and how to measure it.

  • 6 min read

    AEO vs GEO vs SEO: what’s the difference?

    AEO, GEO and SEO explained without the hype: what each optimizes for, where they overlap, when each matters, and whether you actually need all three in 2026.

  • 7 min read

    How to track AI traffic in GA4 (ChatGPT & Perplexity)

    AI referral traffic hides inside GA4’s Referral channel. Build a custom channel group and segment to see visits from ChatGPT, Perplexity, Gemini and Copilot — and understand what GA4 can’t show you.