Skip to main content
Glama
AceDataCloud

mcp-webextrator

by AceDataCloud

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
ACEDATACLOUD_API_TOKENYesYour AceDataCloud API token

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": false
}
prompts
{
  "listChanged": false
}
resources
{
  "subscribe": false,
  "listChanged": false
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
webextrator_extractA

Extract structured content from a web page using the WebExtrator API.

Navigates to the specified URL, renders the page, and extracts structured data
such as product details, article content, or general page information.

Use this when:
- You need to extract structured data from a web page
- You want product details, article content, or general page data
- You need LLM-enhanced semantic normalization of extracted content

Returns:
    JSON response containing the extracted structured content.
webextrator_renderA

Render a web page and return the fully rendered HTML.

Uses a headless browser to navigate to the specified URL, waits for JavaScript
to execute, and returns the final rendered HTML source.

Use this when:
- You need the fully rendered HTML of a JavaScript-heavy page
- You want to inspect the DOM after dynamic content has loaded
- You need to capture single-page application (SPA) content

Returns:
    JSON response containing the rendered HTML content.
webextrator_get_usage_guideA

Get a comprehensive guide for using the WebExtrator tools.

Provides detailed information on how to use the WebExtrator tools effectively,
including parameters, examples, and best practices for web extraction and rendering.

Returns:
    Complete usage guide for WebExtrator tools.
webextrator_get_taskA

Retrieve the result of a single previously created extract or render task.

Use this when:
- You submitted an async extract or render request with a callback_url
- You want to poll for the result of a specific task by its ID

Returns:
    JSON response containing the task status and result data.
webextrator_get_tasks_batchA

Retrieve the results of multiple previously created extract or render tasks.

Use this when:
- You submitted multiple async requests and want to check their results together
- You want to paginate through a list of tasks

Returns:
    JSON response containing the list of task statuses and result data.

Prompts

Interactive templates invoked by user choice

NameDescription
webextrator_guideGuide for choosing the right WebExtrator tool for web extraction and rendering tasks.
webextrator_workflow_examplesCommon workflow examples for WebExtrator tasks.

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/AceDataCloud/WebExtratorMCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server