ingestSitemap
Extract and ingest website content via sitemap.xml, enabling path filtering, link limits, and customizable chunking for efficient data integration into AI models.
Instructions
Ingests content from a website using its sitemap.xml. Supports path filtering and link limits.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
ingestConfig | Yes | ||
namespaceId | No | ||
tenantId | No |
Input Schema (JSON Schema)
You must be authenticated.
Other Tools from SourceSync.ai MCP Server
- createConnection
- createNamespace
- deleteDocuments
- deleteNamespace
- fetchDocuments
- fetchUrlContent
- getConnection
- getIngestJobRunStatus
- getNamespace
- hybridSearch
- ingestConnector
- ingestFile
- ingestSitemap
- ingestText
- ingestUrls
- ingestWebsite
- listConnections
- listNamespaces
- resyncDocuments
- revokeConnection
- semanticSearch
- updateConnection
- updateDocuments
- updateNamespace
- validateApiKey
Related Tools
- @scmdr/sourcesyncai-mcp
- @graphlit/graphlit-mcp-server
- @ScrapeGraphAI/scrapegraph-mcp
- @fengin/search-server
- @mcp2everything/mcp2tavily