Skip to main content
Glama
AutomateLab-tech

automatelab-ai-seo

Official

extract_entities

Extract named entities, linked concepts, and sameAs graph nodes from page content or raw text using rule-based heuristics. Build entity maps for schema generation or audit entity coverage.

Instructions

Extract named entities, linked concepts, and sameAs graph nodes from a page's content and structured data. Combines body-text NER heuristics with JSON-LD @type / sameAs walking.

Read-only when given url (one HTTP GET). Zero network when given text.

Deterministic, rule-based; no LLM. Output is a list of entities with type, confidence, and any sameAs URIs found in structured data.

When to use: building an entity map for schema generation, or auditing whether a page's entities match its target topic. To validate the JSON-LD itself, use audit_schema.

Either url or text must be provided.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
urlNoPublic URL to fetch and analyze. Either this OR `text` is required.
textNoRaw text/HTML to analyze directly. Either this OR `url` is required.
respect_robotsNoIf true (default), respect robots.txt when fetching `url`. Ignored when `text` is used.
Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description fully carries the burden. It discloses read-only behavior for URL input, zero network for text input, determinism, rule-based nature, and output format (list with type, confidence, sameAs URIs). No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise (~100 words), front-loaded with the main purpose, then covers behavior, usage, and alternatives. Every sentence adds value with no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description explains purpose, behavior, when to use, and output format. It lacks explicit handling of mutual exclusivity (e.g., what if both `url` and `text` are provided) and error cases, but overall it is sufficiently clear for a retrieval tool with three parameters.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema covers 100% of parameters with descriptions. The description adds no new parameter-level meaning beyond the schema; it merely restates that 'url' or 'text' must be provided. Baseline 3 is appropriate given full schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description states a specific verb and resource: 'Extract named entities, linked concepts, and sameAs graph nodes from a page's content and structured data.' It clearly distinguishes itself from sibling tool `audit_schema` by noting that to validate JSON-LD, that tool should be used instead.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use ('building an entity map for schema generation, or auditing whether a page's entities match its target topic') and when not to use ('To validate the JSON-LD itself, use `audit_schema`'). Provides clear context and an alternative.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/AutomateLab-tech/ai-seo-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server