Skip to main content
Glama

SourceSync.ai MCP Server

by scmdr

ingestSitemap

Extract and ingest website content via sitemap.xml, enabling path filtering, link limits, and customizable chunking for efficient data integration into AI models.

Instructions

Ingests content from a website using its sitemap.xml. Supports path filtering and link limits.

Input Schema

NameRequiredDescriptionDefault
ingestConfigYes
namespaceIdNo
tenantIdNo

Input Schema (JSON Schema)

{ "$schema": "http://json-schema.org/draft-07/schema#", "additionalProperties": false, "properties": { "ingestConfig": { "additionalProperties": false, "properties": { "chunkConfig": { "additionalProperties": false, "description": "Optional Chunk config. When not passed, default chunk config will be used.", "properties": { "chunkOverlap": { "type": "number" }, "chunkSize": { "type": "number" } }, "required": [ "chunkSize", "chunkOverlap" ], "type": "object" }, "config": { "additionalProperties": false, "properties": { "excludePaths": { "items": { "type": "string" }, "type": "array" }, "includePaths": { "items": { "type": "string" }, "type": "array" }, "maxLinks": { "type": "number" }, "metadata": { "additionalProperties": { "anyOf": [ { "type": "string" }, { "items": { "type": "string" }, "type": "array" } ] }, "type": "object" }, "url": { "type": "string" } }, "required": [ "url" ], "type": "object" }, "source": { "const": "SITEMAP", "type": "string" } }, "required": [ "source", "config" ], "type": "object" }, "namespaceId": { "type": "string" }, "tenantId": { "type": "string" } }, "required": [ "ingestConfig" ], "type": "object" }

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/scmdr/sourcesyncai-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server