We respect your privacy.

We use strictly necessary cookies to keep you signed in and to protect against CSRF. With your permission we also use a small amount of first-party analytics to improve the product. We do not sell your data and we do not use third-party advertising trackers. See our cookie policy and privacy policy .

AI crawlers

CCBot

Updated 2026-05-17

CCBot is the crawler for Common Crawl, an open-source web archive that feeds the training corpora of many smaller AI engines and academic models. Blocking CCBot reduces (but doesn't eliminate) inclusion in derivative AI training datasets.

Where this comes up

See how your site scores on CCBot + every other AI-discoverability signal.

Free audit