Skip to main content
Glama
massanaRoger

extracto-mcp

by massanaRoger

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
EXTRACTO_API_KEYYesYour key from app.getextracto.dev/keys.
EXTRACTO_BASE_URLNoOverride the API host (defaults to https://app.getextracto.dev).
EXTRACTO_TIMEOUT_MSNoPer-request timeout in ms (default 90000).

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": true
}

Tools

Functions exposed to the LLM to take actions

NameDescription
extractA

Extract structured data from a public web page and return it as validated, typed JSON. Extracto renders the page (JavaScript included), runs a schema-constrained extraction, and returns ONLY fields that match the schema. Missing data comes back as null rather than a hallucinated guess. Best for a single known URL. This call is synchronous (up to ~90s); for heavy or anti-bot pages prefer extract_async.

extract_asyncA

Submit an asynchronous extraction job for a heavy, slow, or anti-bot-protected page. Returns a job id immediately; poll it with get_job until status is "success" or "failed". Use this instead of extract when a page is large or likely to need stealth rendering.

get_jobA

Fetch the current status and (once complete) the result of an async extraction job created with extract_async. Status is one of pending, processing, success, failed.

list_jobsA

List your recent async extraction jobs, newest first.

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/massanaRoger/extracto-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server