generate_llms_txt
Generate llms.txt from HTML pages, grouping by URL prefix. Creates spec-compliant Markdown for LLM crawlers to find canonical content.
Instructions
Generate a draft llms.txt from a folder of HTML pages.
llms.txt (https://llmstxt.org) is the emerging standard for telling
LLM crawlers where your canonical content lives. This tool reads each
page's <title> and meta description, groups by URL prefix, and
produces spec-compliant Markdown ready to drop at /llms.txt.
section_strategy controls grouping:
- first_path_segment (default): groups by first URL path
segment, e.g. all /blog/... pages under "Blog".
- flat: a single "Pages" section.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| folder | Yes | ||
| site_url | Yes | ||
| site_title | No | ||
| site_summary | No | ||
| pattern | No | *.html | |
| section_strategy | No | first_path_segment | |
| limit | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||