check_robots
Fetch and parse a site's robots.txt from any page URL. Retrieve user-agent groups, allow/disallow rules, and declared sitemaps to understand crawling permissions.
Instructions
Fetch and parse a site's /robots.txt — user-agent groups, allow/disallow rules, and declared sitemaps.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Any URL on the site |