GEO Visibility
GEO Visibility
AI visibility intelligence
FeaturesBlogAbout
Log in
All posts
Playbooks4 min read

llms.txt: the honest guide — what it is, who reads it, and whether you should ship one

844,000 sites have adopted llms.txt in 18 months. Zero major AI companies have officially confirmed they consume it. Here's what's true, what's marketing, and exactly what to put in yours if you ship it anyway.

GEO Visibility Research
Published May 9, 2026

Jeremy Howard proposed llms.txt on 3 September 2024. By Q1 2026 over 844,000 sites had implemented it (Mintlify, Q1 2026). The vendor blogs say it's a 2-3× citation lift. The AI labs say nothing.

This guide separates what's verified from what's vendor narrative, and gives you a working template if you decide to ship one anyway.

The spec — non-negotiable structure

The canonical document at /llms.txt must be CommonMark and follow this strict order:

  1. H1 — site / project name (# Site Name). Required.
  2. Blockquote summary — one paragraph (> ...) immediately after the H1. Required.
  3. Optional context paragraphs — plain prose, no markdown blocks.
  4. H2 sections — bullet list of - [Title](URL): Description links.
  5. "Optional" H2 — has reserved semantics. Consumers running tight context budgets may skip these.

That's the entire spec. There's a sibling file, /llms-full.txt, that concatenates the full markdown content of every linked page. Use it when you want long-context retrieval systems to ingest your docs in one fetch.

The community LLMs.txt Validator flags llms-full.txt over 500 KB — a useful red-line for your generator's full variant.

Who actually reads it — the honest answer

ConsumerOfficially confirmed?Behavior observed
OpenAI / ChatGPTNoNo public statement; no observed traffic from OAI-SearchBot requesting /llms.txt.
Anthropic / ClaudeBotNoSame.
Perplexity / PerplexityBotNoSame.
Google (Gemini, AIO, AI Mode)NoSame.
Mintlify-hosted docs trafficYes (Mintlify Q1 2026 telemetry)Bots fetch llms-full.txt more frequently than llms.txt on docs platforms.
You.comUnclearSome community reports.

The honest summary: no major engine has confirmed consumption. What we have is observational telemetry on docs platforms (Mintlify) and a lot of pattern-matching from SEO vendors. Skepticism is warranted — but the file costs ~5 minutes to ship and creates zero downside.

What it actually does, in our opinion

llms.txt is most useful as a stake in the ground for entity disambiguation:

  • Tells a retrieval system what you call yourself and what your top-priority URLs are.
  • Concentrates your highest-authority content in one fetch — useful if a long-context model decides to ingest it.
  • Acts as a structured signal that you've thought about AI consumption (correlates with sites that also nail schema + author entities).

What it is not:

  • A direct rank lever.
  • A replacement for robots.txt (different layer — robots.txt is permission, llms.txt is hint).
  • A guarantee of citation lift. Anyone claiming a specific number (1.4×, 2.2×) is extrapolating from a vendor cohort without a control group.

A template that actually validates

Save as /public/llms.txt (or wherever your static files live):

# Your Brand

> One-paragraph summary of what your brand does, what category you compete in,
> and one phrase a journalist would use to introduce you. Keep it under 60 words.

This is your brand's "stake in the ground" for AI search. The lines above are
the only required fields; everything below is optional context that
long-context retrieval systems may ingest.

## Core pages

- [Homepage](https://yourbrand.com/): Headline + main value prop.
- [Pricing](https://yourbrand.com/pricing): Tier breakdown and what's in/out.
- [About](https://yourbrand.com/about): Founders, team, what makes you different.

## Documentation

- [Quickstart](https://yourbrand.com/docs/quickstart): 5-minute hands-on.
- [API reference](https://yourbrand.com/docs/api): Full endpoint catalog.
- [Integrations](https://yourbrand.com/docs/integrations): Stack compatibility.

## Trust & authority

- [Security](https://yourbrand.com/security): SOC 2, encryption, data handling.
- [Customer stories](https://yourbrand.com/customers): Named accounts + outcomes.
- [Changelog](https://yourbrand.com/changelog): Recent updates (proves freshness).

## Optional

- [Blog](https://yourbrand.com/blog): Long-form research.
- [Newsroom](https://yourbrand.com/press): Press coverage.

Run it through the validator before shipping. The blockquote line is the #1 thing people get wrong — it must come immediately after the H1 with no blank line and no other markdown between them.

Cloudflare's nested-llms.txt pattern — worth copying

Their root /llms.txt points each product to its own product-scoped llms.txt, and there's a single archive at developers.cloudflare.com/llms-full.txt for offline / bulk ingestion. This nested approach scales better than a single mega-file once you have more than ~20 top-level sections.

Fetch their llms.txt if you want to see the pattern. Particularly useful for SaaS with multiple products.

What to do if you don't ship one

Run our free audit — it checks for llms.txt presence and content quality, and also checks the 14 AI crawler user-agent rules in your robots.txt. The robots.txt side has more confirmed consumption than llms.txt does today, so it's the higher-leverage half of the access-protocol stack.

Sources

  • llms.txt spec — https://llmstxt.org/
  • Jeremy Howard introduction (Sep 3 2024) — https://www.answer.ai/posts/2024-09-03-llmstxt.html
  • LLMs.txt Validator — https://llmstxtvalidator.dev/
  • Mintlify: "The value of llms.txt: hype or real?" — https://www.mintlify.com/blog/the-value-of-llms-txt-hype-or-real
  • Cloudflare docs llms.txt — https://developers.cloudflare.com/llms.txt

Want to know if your llms.txt actually validates? Run a free audit — we check spec compliance, file size, and whether your robots.txt is letting in the AI crawlers that matter.

llms.txtGEOAI CrawlersSchema
Share

Want this analysis for your site?

Our free audit runs the same checks across ChatGPT, Claude, Gemini, Perplexity, and Claude — 9 analyzers, ~3 minutes, no sign-up required.

Run free audit

Like this post? Get the next one.

One email per new piece of research. Engine-by-engine field notes, primary-sourced.

We use your email only for new-post notifications. Unsubscribe in one click.

Keep reading

Related posts

All posts →
Playbooks6 min

What is GEO? A plain-English guide to Generative Engine Optimization

May 20, 2026
Playbooks5 min

Schema.org for AI search in 2026 — which JSON-LD types still matter after Google killed FAQ rich results

May 8, 2026
Engine Research5 min

What every AI engine actually cites in 2026 — engine-by-engine field guide

May 10, 2026
On this page
  • The spec — non-negotiable structure
  • Who actually reads it — the honest answer
  • What it actually does, in our opinion
  • A template that actually validates
  • Cloudflare's nested-llms.txt pattern — worth copying
  • What to do if you don't ship one
  • Sources
GEO Visibility
GEO Visibility
AI visibility intelligence

Track how ChatGPT, Claude, Gemini, and Perplexity talk about your brand, then turn those gaps into action.

A Product of AI Guru®

Product

  • Features
  • Blog
  • About

Intelligence

  • AI Visibility Tracking
  • Multi-region tracking
  • Cross-page schema coverage
  • Citation Intelligence
  • Localization Signal Score
  • Social Listening
  • Brand memory

Company

  • Privacy Policy
  • Terms of Service
  • [email protected]

© 2026 GEO Visibility. All rights reserved.

Built for operators monitoring AI search visibility, citations, and answer-engine share of voice.