Skip to main content
All posts

$ ~/ym8 --read llms-txt-guide

How to Create llms.txt: The robots.txt for AI

Technical2026-02-268 min read

what_is_llms_txt

llms-txt-overview.md

llms.txt is a plain-text file placed at the root of your website that provides AI language models with a structured summary of who you are, what you do, and how your brand should be described. Think of it as the robots.txt for the AI era — but instead of telling crawlers which pages to index, it tells AI models what your brand is and what matters most.

The concept emerged from the growing recognition that AI engines do not process websites the way search engines do. A search engine indexes pages and ranks them. An AI engine needs to understand entities, relationships, and context in order to generate accurate answers. llms.txt bridges this gap by providing that context in a format language models can parse efficiently.

While still an emerging standard, llms.txt adoption is accelerating. Thousands of sites have already implemented it, and AI engine providers are increasingly referencing these files during retrieval-augmented generation (RAG). For brands serious about AEO, implementing llms.txt is one of the highest-impact, lowest-effort actions available.

file_format

File Format and Structure

llms.txt uses a simple Markdown-based format. The file is human-readable and machine-parseable. Here is the standard structure that most implementations follow.

llms.txt

H1 heading: The file starts with a single H1 heading — your brand or site name. This sets the primary entity context for the entire file.

Blockquote summary: A one-to-two sentence blockquote immediately after the heading that defines what your brand is and what it does. This is the most important line in the file — it is the sentence you want AI models to use when describing you.

H2 sections: Subsequent sections use H2 headings to cover topics like key pages, products, services, team, and any other context that helps AI models describe your brand accurately.

Linked references: Each section can include Markdown links to relevant pages. These links give AI models a citation path and a way to retrieve more detail when generating answers.

placement

Where to Place llms.txt

The standard placement is at your domain root: https://yourdomain.com/llms.txt. This mirrors the convention established by robots.txt and is where AI systems look first. The file should be served as plain text with a text/plain or text/markdown content type.

In addition to the root-level llms.txt, you can also create a more detailed version at /llms-full.txt. The standard llms.txt should be concise — a focused summary that AI models can consume quickly. The full version can include extended descriptions, complete product listings, team information, and deeper context.

For further structured machine-readable context, consider adding a llm-profile.json at /.well-known/llm-profile.json. This JSON file provides the same information in a structured format that AI systems can parse programmatically. Together, llms.txt and llm-profile.json create a comprehensive AI-readable identity layer for your brand.

content_guide

What to Include in Your llms.txt

[01] Brand identity statement. A clear, one-sentence definition of what your brand is. Avoid marketing fluff. Write the sentence you want AI to repeat when asked about you. Example: "Acme Corp is a B2B SaaS platform that automates accounts payable for mid-market companies."

[02] Core products and services. List your primary offerings with brief, factual descriptions. Include differentiators that you want AI models to associate with your brand. Link to the relevant product pages.

[03] Key pages. Link to the 5-15 most important pages on your site. These should include your homepage, product pages, about page, and any high-value content that defines your expertise. Each link should have a brief description.

[04] Factual claims and data points. Include verifiable facts: founding year, number of customers, geographic coverage, certifications, notable clients (with permission), and any metrics that establish credibility. AI models weigh factual claims they can cross-reference.

[05] What NOT to include. Avoid subjective superlatives ("the world's best"), unverifiable claims, internal jargon, and promotional language. AI models are trained to discount marketing copy. Factual, verifiable information is weighted more heavily.

adoption

Current Adoption Status

adoption-status.log

As of early 2026, llms.txt is an emerging convention rather than a formal standard. There is no RFC or W3C specification — the format evolved organically from the AI optimisation community. Despite this, adoption is substantial and growing rapidly.

Major technology companies, SaaS platforms, media publishers, and e-commerce sites have implemented llms.txt. The convention is referenced in AI retrieval system documentation and is increasingly supported by RAG pipeline tooling. Several AEO monitoring platforms now check for llms.txt presence as part of their technical AEO audits.

The trajectory is clear: llms.txt is following the same adoption curve as robots.txt did in the early web era. Early adopters who implement it now benefit from first-mover advantage. As AI engines increasingly integrate these files into their retrieval processes, having a well-crafted llms.txt will become table stakes for any brand that cares about how AI represents them.

Implementing llms.txt takes less than an hour for most sites. The return on that investment — more accurate AI representations, better citation rates, and improved Share of Model — makes it one of the best time-to-value ratios in the entire AEO toolkit.

implementation

Implementation Checklist

checklist.sh

Create /llms.txt at your domain root with your brand summary, key pages, and core offerings.

Create /llms-full.txt with extended descriptions, product details, and team information.

Add /.well-known/llm-profile.json with structured JSON metadata for programmatic access.

Verify all files are accessible (HTTP 200) and served with correct content types.

Reference llms.txt in your /.well-known/ai.txt file for additional discoverability.

Set a quarterly review cadence to keep the content accurate as your brand evolves.

related_posts