AI Crawler Impact Report

Analyse your server logs to discover which AI crawlers are accessing your site, how much bandwidth they consume, and how to control them with robots.txt.

What this tool does

AI crawlers like GPTBot (OpenAI), ClaudeBot (Anthropic), Bytespider (ByteDance), and others are visiting your website regularly to train large language models and power AI search products. Most site owners have no visibility into this activity.

The AI Crawler Impact Report analyses your server log files and gives you a complete picture of AI crawler activity on your site:

How it works

1

Upload logs

Upload a server log file or paste log entries directly. Supports Apache, Nginx, and CloudFront formats.

2

Instant analysis

Your logs are parsed entirely in the browser. Nothing is sent to a server. Results appear in seconds.

3

Get your report

See a detailed breakdown by crawler with bandwidth, request counts, and a generated robots.txt.

AI crawlers we detect

Crawler Company Purpose
GPTBot OpenAI Training data for GPT models and ChatGPT search
ClaudeBot Anthropic Training data for Claude models
Bytespider ByteDance Training data for TikTok AI features
CCBot Common Crawl Open web dataset used by many AI companies
Google-Extended Google Training data for Gemini and Bard
Applebot-Extended Apple Apple Intelligence training data
Meta-ExternalAgent Meta Training data for Llama models
PerplexityBot Perplexity AI search engine indexing

Example output

AI Crawler Impact Report
GPTBot 12,847 requests 3.2 GB ||||||||||||||||| 42% ClaudeBot 8,234 requests 2.1 GB ||||||||||| 27% Bytespider 5,691 requests 1.4 GB |||||||| 19% CCBot 2,103 requests 0.5 GB ||| 7% PerplexityBot 1,547 requests 0.4 GB || 5% Total: 30,422 requests consuming 7.6 GB of bandwidth

Your data stays private. All log analysis happens in your browser using JavaScript. Your log files are never uploaded to any server. Once you close the page, your data is gone.

Need real-time AI crawler monitoring?

This free tool analyses a single log file. For continuous, real-time tracking of every AI crawler visiting your site, with automatic identity verification and historical trends, use LogLens.