Schema | extracto-mcp

extracto-mcp

Describes the environment variables required to run the server.

Name	Required	Description
`EXTRACTO_API_KEY`	Yes	Your key from app.getextracto.dev/keys.
`EXTRACTO_BASE_URL`	No	Override the API host (defaults to https://app.getextracto.dev).
`EXTRACTO_TIMEOUT_MS`	No	Per-request timeout in ms (default 90000).

Features and capabilities supported by this server

Capability	Details
`tools`	{ "listChanged": true }

Functions exposed to the LLM to take actions

Name	Description
extractA	Extract structured data from a public web page and return it as validated, typed JSON. Extracto renders the page (JavaScript included), runs a schema-constrained extraction, and returns ONLY fields that match the schema. Missing data comes back as null rather than a hallucinated guess. Best for a single known URL. This call is synchronous (up to ~90s); for heavy or anti-bot pages prefer `extract_async`.
extract_asyncA	Submit an asynchronous extraction job for a heavy, slow, or anti-bot-protected page. Returns a job id immediately; poll it with `get_job` until status is "success" or "failed". Use this instead of `extract` when a page is large or likely to need stealth rendering.
get_jobA	Fetch the current status and (once complete) the result of an async extraction job created with `extract_async`. Status is one of pending, processing, success, failed.
list_jobsA	List your recent async extraction jobs, newest first.

Interactive templates invoked by user choice

Name	Description
No prompts

Contextual data attached and managed by the client

Name	Description
No resources

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/massanaRoger/extracto-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server