Title: PufferSights &#8211; AI Crawler Insights
Author: senols
Published: <strong>June 9, 2026</strong>
Last modified: June 9, 2026

---

Search plugins

![](https://s.w.org/plugins/geopattern-icon/puffersights-ai-crawler-insights.svg)

# PufferSights – AI Crawler Insights

 By [senols](https://profiles.wordpress.org/senols/)

[Download](https://downloads.wordpress.org/plugin/puffersights-ai-crawler-insights.0.1.0.zip)

 * [Details](https://wordpress.org/plugins/puffersights-ai-crawler-insights/#description)
 * [Reviews](https://wordpress.org/plugins/puffersights-ai-crawler-insights/#reviews)
 *  [Installation](https://wordpress.org/plugins/puffersights-ai-crawler-insights/#installation)
 * [Development](https://wordpress.org/plugins/puffersights-ai-crawler-insights/#developers)

 [Support](https://wordpress.org/support/plugin/puffersights-ai-crawler-insights/)

## Description

PufferSights monitors 100+ known AI crawler and AI agent user agents, hashes IP 
addresses, groups traffic by bot, provider, crawl purpose, content type, and response
status, tracks human referrals from AI surfaces, and can publish a dynamic llms.
txt content map for public site content.

The dashboard summarizes:

 * HTTP traffic by bot.
 * Crawl purpose.
 * Content type.
 * Response status.
 * AI referrals and crawl-to-refer ratio.
 * Top crawled content.
 * Tracked agent count.
 * Dynamic llms.txt content map.
 * robots.txt audit and policy snippets.

The crawler registry is based on current public operator documentation and industry
references for OpenAI, Anthropic, Perplexity, Google, Apple, Common Crawl, Meta,
ByteDance, Microsoft, Amazon, and related AI crawler operators.

The plugin does not contact any external service. All analytics data is stored in
your own WordPress database.

robots.txt publishing is off by default. The plugin can generate and optionally 
publish policies for:

 * Monitor only.
 * Block training crawlers.
 * Allow AI search/user-action bots while blocking training crawlers.
 * Block all known AI bots.

llms.txt publishing is on by default and can be disabled in the PufferSights settings.
The generated `/llms.txt` file lists selected published public pages and posts in
Markdown so AI assistants can find the site’s main public content more easily.

### Important Notes

User-agent detection is not bot verification. User agents can be spoofed. Raw IP
addresses are not stored; the plugin stores a salted hash for rough uniqueness.

robots.txt is voluntary. Use a WAF, CDN, or server-level controls when technical
enforcement is required.

Google-Extended and Applebot-Extended are robots.txt control tokens rather than 
normal request user agents, so they appear in robots.txt audits and policy snippets
but usually do not appear in request logs.

llms.txt is a content map, not an access-control policy. It does not replace robots.
txt and does not force AI systems to use or cite your content.

### Privacy

PufferSights stores local analytics for public, logged-out requests only. It does
not track wp-admin pages, logged-in users, AJAX requests, or WP-Cron requests.

The plugin stores:

 * Request time and date.
 * Event type, such as AI crawler request or AI referral.
 * HTTP method.
 * Request path without query string.
 * HTTP response status.
 * MIME/content group.
 * Matched crawler or AI referral provider.
 * User-agent string and user-agent hash.
 * Salted one-way hash of the request IP address.
 * Referrer origin only, such as `https://chatgpt.com`, without referrer path or
   query string.

The plugin does not store raw IP addresses, cookies, browser local storage, or complete
referrer URLs. It does not send analytics, telemetry, crawler records, or site data
to third-party services.

If llms.txt publishing is enabled, the plugin serves a Markdown overview of selected
published public posts and pages at `/llms.txt`. Drafts, private posts, and password-
protected posts are not included.

Administrators can disable tracking, clear captured events, and configure retention
from the PufferSights admin page. The default retention period is 90 days. On uninstall,
the plugin removes its custom analytics table, saved options, and scheduled cleanup
hook.

The plugin also adds suggested disclosure text to WordPress’ Privacy Policy Guide.

## Installation

 1. Upload the `puffersights-ai-crawler-insights` folder to `wp-content/plugins`.
 2. Activate PufferSights – AI Crawler Insights.
 3. Open the PufferSights menu in wp-admin.

## FAQ

### Are detected AI crawlers verified?

No. The plugin matches user-agent strings. A request can claim to be GPTBot, ClaudeBot,
Googlebot, or another crawler without being verified. Treat the dashboard as user-
agent analytics unless you add server, CDN, or WAF verification.

### Does the plugin block AI crawlers?

Not by default. robots.txt publishing is off by default. If enabled, the plugin 
can append crawler-specific robots.txt rules, but robots.txt is voluntary and does
not technically enforce access.

### What is llms.txt?

llms.txt is a proposed Markdown convention for giving AI assistants a concise map
of important public site content. PufferSights can serve a dynamic `/llms.txt` file
with selected published pages and posts. It does not expose drafts, private posts,
or password-protected posts.

### Does the plugin send data to third parties?

No. The plugin does not use external analytics APIs and does not send analytics,
telemetry, crawler records, or site data to any third party.

### What happens when I delete the plugin?

The `uninstall.php` cleanup removes the plugin options, scheduled cleanup hook, 
and custom analytics table.

## Reviews

There are no reviews for this plugin.

## Contributors & Developers

“PufferSights – AI Crawler Insights” is open source software. The following people
have contributed to this plugin.

Contributors

 *   [ senols ](https://profiles.wordpress.org/senols/)

[Translate “PufferSights – AI Crawler Insights” into your language.](https://translate.wordpress.org/projects/wp-plugins/puffersights-ai-crawler-insights)

### Interested in development?

[Browse the code](https://plugins.trac.wordpress.org/browser/puffersights-ai-crawler-insights/),
check out the [SVN repository](https://plugins.svn.wordpress.org/puffersights-ai-crawler-insights/),
or subscribe to the [development log](https://plugins.trac.wordpress.org/log/puffersights-ai-crawler-insights/)
by [RSS](https://plugins.trac.wordpress.org/log/puffersights-ai-crawler-insights/?limit=100&mode=stop_on_copy&format=rss).

## Changelog

#### 0.1.0

Initial release.

## Meta

 *  Version **0.1.0**
 *  Last updated **18 hours ago**
 *  Active installations **Fewer than 10**
 *  WordPress version ** 6.5 or higher **
 *  Tested up to **7.0**
 *  PHP version ** 7.4 or higher **
 * Tags
 * [AI](https://wordpress.org/plugins/tags/ai/)[analytics](https://wordpress.org/plugins/tags/analytics/)
   [bots](https://wordpress.org/plugins/tags/bots/)[crawlers](https://wordpress.org/plugins/tags/crawlers/)
   [llms.txt](https://wordpress.org/plugins/tags/llms-txt/)
 *  [Advanced View](https://wordpress.org/plugins/puffersights-ai-crawler-insights/advanced/)

## Ratings

No reviews have been submitted yet.

[Your review](https://wordpress.org/support/plugin/puffersights-ai-crawler-insights/reviews/#new-post)

[See all reviews](https://wordpress.org/support/plugin/puffersights-ai-crawler-insights/reviews/)

## Contributors

 *   [ senols ](https://profiles.wordpress.org/senols/)

## Support

Got something to say? Need help?

 [View support forum](https://wordpress.org/support/plugin/puffersights-ai-crawler-insights/)