Skip to main content
Glama

Firecrawl MCP Server

by ampcome-mcps
MIT License
40,978
  • Apple

firecrawl_extract

Extract structured data from web pages using AI, such as product details, prices, and names, based on custom prompts and JSON schemas.

Instructions

Extract structured information from web pages using LLM capabilities. Supports both cloud AI and self-hosted LLM extraction.

Best for: Extracting specific structured data like prices, names, details from web pages. Not recommended for: When you need the full content of a page (use scrape); when you're not looking for specific structured data. Arguments:

  • urls: Array of URLs to extract information from

  • prompt: Custom prompt for the LLM extraction

  • systemPrompt: System prompt to guide the LLM

  • schema: JSON schema for structured data extraction

  • allowExternalLinks: Allow extraction from external links

  • enableWebSearch: Enable web search for additional context

  • includeSubdomains: Include subdomains in extraction Prompt Example: "Extract the product name, price, and description from these product pages." Usage Example:

{ "name": "firecrawl_extract", "arguments": { "urls": ["https://example.com/page1", "https://example.com/page2"], "prompt": "Extract product information including name, price, and description", "systemPrompt": "You are a helpful assistant that extracts product information", "schema": { "type": "object", "properties": { "name": { "type": "string" }, "price": { "type": "number" }, "description": { "type": "string" } }, "required": ["name", "price"] }, "allowExternalLinks": false, "enableWebSearch": false, "includeSubdomains": false } }

Returns: Extracted structured data as defined by your schema.

Input Schema

NameRequiredDescriptionDefault
allowExternalLinksNoAllow extraction from external links
enableWebSearchNoEnable web search for additional context
includeSubdomainsNoInclude subdomains in extraction
promptNoPrompt for the LLM extraction
schemaNoJSON schema for structured data extraction
systemPromptNoSystem prompt for LLM extraction
urlsYesList of URLs to extract information from

Input Schema (JSON Schema)

{ "properties": { "allowExternalLinks": { "description": "Allow extraction from external links", "type": "boolean" }, "enableWebSearch": { "description": "Enable web search for additional context", "type": "boolean" }, "includeSubdomains": { "description": "Include subdomains in extraction", "type": "boolean" }, "prompt": { "description": "Prompt for the LLM extraction", "type": "string" }, "schema": { "description": "JSON schema for structured data extraction", "type": "object" }, "systemPrompt": { "description": "System prompt for LLM extraction", "type": "string" }, "urls": { "description": "List of URLs to extract information from", "items": { "type": "string" }, "type": "array" } }, "required": [ "urls" ], "type": "object" }

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/ampcome-mcps/firecrawl-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server