We respect your privacy.

We use strictly necessary cookies to keep you signed in and to protect against CSRF. With your permission we also use a small amount of first-party analytics to improve the product. We do not sell your data and we do not use third-party advertising trackers. See our cookie policy and privacy policy .

Home/Glossary/robots.txt

Standards

robots.txt

Updated 2026-05-17

robots.txt is a plain-text file at the site root that tells web crawlers which paths they may or may not fetch. It is the canonical place to allow or disallow specific AI crawlers like GPTBot, ClaudeBot, and PerplexityBot. Crawlers honor it on a per-User-agent basis — order and specificity matter.

Minimal allow-all example

User-agent: *
Allow: /
Sitemap: https://example.com/sitemap.xml

Block GPTBot specifically

User-agent: GPTBot
Disallow: /

Related

Where this comes up

See how your site scores on robots.txt + every other AI-discoverability signal.

Free audit