Skip to main content
Glama

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
MCP_HTTP_PORTNoPort for HTTP mode3001
MCP_TRANSPORTNoTransport mode: stdio or httpstdio
CRAWLER_MAX_CHARSNoDefault cap on returned page content in characters20000
CRAWLER_TIMEOUT_MSNoPer-request timeout in milliseconds15000
CRAWLER_USER_AGENTNoUser-Agent for all requestscrawler-mcp/1.0

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": true
}

Tools

Functions exposed to the LLM to take actions

NameDescription
fetch_pageA

Fetch a single web page and return its readable content as Markdown, plain text, or raw HTML. Automatically renders JavaScript-heavy pages with a headless browser when needed.

extract_linksB

Extract all hyperlinks from a web page, resolved to absolute URLs. Optionally restrict to links on the same domain.

crawl_siteA

Recursively crawl a website starting from a URL, following links up to a maximum depth and page count. Returns a short content summary for each page visited. Stays on the same domain by default.

extract_by_selectorB

Extract specific data from a page using a CSS selector. Returns each matching element's text, or an attribute value when attribute is given (e.g. selector='a.product', attribute='href').

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/shadab15github/crawler-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server