Best Apache Spark MCP Servers

Apache Spark is an open-source unified analytics engine for large-scale data processing. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs.

View all Apache Spark MCP Servers

Why this server?
Provides tools for searching and retrieving Apache Spark documentation, enabling full-text keyword searches with section filtering and access to the full content of documentation pages.
MCP Spark Documentation Server
Documentation Access Search Developer Tools
martoc
A
license
A
quality
A
maintenance
Provides full-text search and retrieval tools for Apache Spark documentation using SQLite FTS5 with BM25 ranking. It enables AI assistants to efficiently search, filter by section, and read specific Spark documentation pages.
Last updated 2026-07-20
2
MIT
Why this server?
Provides read-only access to Apache Spark data through SQL models, allowing for querying live data via natural language questions without requiring SQL knowledge. Tools include listing available tables, retrieving column information, and executing SQL SELECT queries against Spark.
Apache Spark MCP Server byofficial
Databases RAG Systems
CDataSoftware
A
license
-
quality
D
maintenance
Apache Spark MCP Server by CData
Last updated 2025-10-18
MIT
Why this server?
Offers information about Duyet's expertise with Apache Spark through CV resources and tools, enabling discussions about data engineering projects.
Duyet MCP Server
Search Documentation Access Developer Tools
duyet
F
license
B
quality
B
maintenance
An experimental Model Context Protocol server that enables AI assistants to access information about Duyet, including his CV, blog posts, and GitHub activity through natural language queries.
Last updated 2026-07-19
8
2
Why this server?
Query Spark SQL clusters via Thrift/HiveServer2 protocol, enabling read-only SQL queries, schema discovery, and multiple authentication methods.
Spark SQL MCP Server
Databases Cloud Platforms
aidancorrell
A
license
-
quality
C
maintenance
An MCP server that enables AI assistants to query Spark SQL clusters via the Thrift/HiveServer2 protocol.
Last updated 2026-03-26
MIT
Why this server?
Provides searchable documentation for Apache Spark as part of the data engineering knowledge base.
Unified Docs Hub
Documentation Access Search Developer Tools
boodrow
A
license
-
quality
D
maintenance
Provides AI assistants with searchable access to documentation from 170+ curated repositories and 1000+ popular GitHub projects across 20+ categories including trading, AI/ML, DevOps, and web development.
Last updated 2025-06-19
3
MIT
Why this server?
Connects to Apache Spark History Server to query and analyze Spark applications, jobs, stages, executors, SQL queries, and more, enabling AI agents to investigate performance, failures, and bottlenecks.
Apache Spark History Server
Monitoring Observability
kubeflow
A
license
-
quality
B
maintenance
Exposes Spark History Server data as tools for AI agents, enabling natural language querying of Spark applications, jobs, stages, and performance metrics.
Last updated 2026-07-16
183
Apache 2.0
Why this server?
Utilizes Apache Spark for writing Parquet/ORC file formats to MinIO storage as part of data processing pipelines.
Datris MCP Server
Data Platforms Databases RAG Systems
datris
A
license
-
quality
D
maintenance
MCP server with 32 tools for ETL ingestion, AI-generated data quality rules, AI transformations, vector search, and natural-language SQL. Works across Postgres, MongoDB, Kafka, S3/MinIO, HashiCorp Vault, and five vector stores (Qdrant, Weaviate, Milvus, Chroma, pgvector).
Last updated 2026-07-15
11
AGPL 3.0
Why this server?
Supports Apache Spark SQL dialect for SQL generation, schema introspection, validation, and transpilation.
SQLMind
Databases AI & Machine Learning Developer Tools
Veloce-AI
A
license
-
quality
B
maintenance
An MCP server that provides SQL generation, validation, transpilation, and schema introspection across 10 SQL dialects, using a property graph schema and phase-locked reasoning to convert natural language to accurate SQL.
Last updated 2026-06-27
2
MIT
Why this server?
Provides query optimization and data discovery capabilities for Apache Spark by exposing logical and physical query plans, catalog and table information to AI systems.
PySpark MCP Server
Databases Data Platforms Developer Tools
SemyonSinchenko
A
license
-
quality
D
maintenance
A server implementation of MCP for Apache Spark that provides query plans and catalog information to AI systems for query optimization and data discovery.
Last updated 2026-02-26
18
Apache 2.0

Apache Spark MCP Server byofficial