Best Apache Spark MCP Servers
Apache Spark is an open-source unified analytics engine for large-scale data processing. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs.
Why this server?
Offers information about Duyet's expertise with Apache Spark through CV resources and tools, enabling discussions about data engineering projects.
FlicenseBqualityFmaintenanceAn experimental Model Context Protocol server that enables AI assistants to access information about Duyet, including his CV, blog posts, and GitHub activity through natural language queries.Last updated282Why this server?
Provides tools for searching and retrieving Apache Spark documentation, enabling full-text keyword searches with section filtering and access to the full content of documentation pages.
AlicenseAqualityBmaintenanceProvides full-text search and retrieval tools for Apache Spark documentation using SQLite FTS5 with BM25 ranking. It enables AI assistants to efficiently search, filter by section, and read specific Spark documentation pages.Last updated2MITWhy this server?
Provides read-only access to Apache Spark data through SQL models, allowing for querying live data via natural language questions without requiring SQL knowledge. Tools include listing available tables, retrieving column information, and executing SQL SELECT queries against Spark.
Alicense-qualityCmaintenanceApache Spark MCP Server by CDataLast updatedMITWhy this server?
Utilizes Apache Spark for writing Parquet/ORC file formats to MinIO storage as part of data processing pipelines.
Alicense-qualityFmaintenanceMCP server with 32 tools for ETL ingestion, AI-generated data quality rules, AI transformations, vector search, and natural-language SQL. Works across Postgres, MongoDB, Kafka, S3/MinIO, HashiCorp Vault, and five vector stores (Qdrant, Weaviate, Milvus, Chroma, pgvector).Last updated8Why this server?
Provides searchable documentation for Apache Spark as part of the data engineering knowledge base.
Alicense-qualityCmaintenanceProvides AI assistants with searchable access to documentation from 170+ curated repositories and 1000+ popular GitHub projects across 20+ categories including trading, AI/ML, DevOps, and web development.Last updated2MIT