Enables querying of ADP data through SQL, making employee and payroll information accessible.
Allows accessing and querying Airtable data through a SQL interface.
Provides SQL-based access to Amazon DynamoDB NoSQL database data.
Enables querying data stored in Amazon S3 buckets through a SQL interface.
Allows retrieval of Asana project management data through SQL queries.
Provides SQL-based access to Basecamp project management data.
Enables querying BigCommerce e-commerce platform data through SQL.
Allows accessing Bitbucket repository and project data through SQL queries.
Provides SQL-based access to Box cloud storage content and metadata.
Enables querying Confluence wiki content and metadata through SQL.
Allows accessing Couchbase NoSQL database data through a SQL interface.
Provides SQL-based access to data stored in Databricks.
Enables querying Dropbox files and metadata through a SQL interface.
Allows accessing eBay marketplace data through SQL queries.
Provides SQL-based access to Elasticsearch index data.
Enables querying EnterpriseDB database through SQL.
Allows accessing Facebook social media data through SQL queries.
Provides SQL-based access to GitHub repository and project data.
Enables querying Gmail email data through a SQL interface.
Allows accessing Google Ads campaign and performance data through SQL queries.
Provides SQL-based access to Google Analytics website performance data.
Enables querying Google Calendar events and schedules through SQL.
Allows accessing Google Campaign Manager 360 advertising data through SQL queries.
Provides SQL-based access to data stored in Google Cloud Storage.
Enables querying Google Drive files and metadata through SQL.
Allows accessing Google Sheets spreadsheet data through SQL queries.
Provides SQL-based access to GraphQL API data.
Enables querying Greenhouse recruitment data through SQL.
Allows accessing Apache Hive data warehousing system through SQL queries.
Provides SQL-based access to HubSpot CRM and marketing data.
Enables querying Instagram social media data through SQL.
Allows accessing Jira issue tracking and project management data through SQL queries.
Enables querying MailChimp email marketing data through SQL.
Allows accessing MariaDB database data through SQL queries.
Provides SQL-based access to MongoDB NoSQL database data.
Enables querying MYOB accounting data through SQL.
Allows accessing MySQL database data through SQL queries.
Provides SQL-based access to Neo4j graph database data.
Enables querying Odoo ERP and business application data through SQL.
Allows accessing Okta identity management data through SQL queries.
Provides SQL-based access to PayPal payment transaction data.
Enables querying Pinterest social media data through SQL.
Allows accessing PostgreSQL database data through SQL queries.
Provides SQL-based access to Presto distributed SQL query engine data.
Enables querying QuickBooks accounting data through SQL.
Allows accessing Redis in-memory data structure store through SQL queries.
Provides SQL-based access to RSS feed data.
Enables querying Sage accounting software data through SQL.
Allows accessing Salesforce CRM data through SQL queries.
Provides SQL-based access to SAP ERP system data.
Enables querying SendGrid email delivery data through SQL.
Allows accessing Shopify e-commerce platform data through SQL queries.
Provides SQL-based access to SingleStore database data.
Enables querying Slack messaging platform data through SQL.
Allows accessing Snapchat Ads data through SQL queries.
Provides SQL-based access to Snowflake cloud data warehouse.
Enables querying Splunk log and machine data through SQL.
Allows accessing Square payment processing data through SQL queries.
Provides SQL-based access to Stripe payment processing data.
Enables querying SurveyMonkey survey data through SQL.
Allows accessing Teradata database data through SQL queries.
Provides SQL-based access to Trello project management data.
Enables querying Trino distributed SQL query engine data through SQL.
Allows accessing Twilio communication API data through SQL queries.
Provides SQL-based access to WooCommerce e-commerce platform data.
Enables querying WordPress CMS content and metadata through SQL.
Allows accessing Xero accounting software data through SQL queries.
Provides SQL-based access to XML data files.
Enables querying YouTube Analytics data through SQL.
Allows accessing Zendesk customer service platform data through SQL queries.
Provides SQL-based access to Zoho business application suite data.
hdfs-mcp-server-by-cdata
CData's Model Context Protocol (MCP) Server for HDFS
:heavy_exclamation_mark: This project builds a read-only MCP server. For full read, write, update, delete, and action capabilities and a simplified setup, check out our free CData MCP Server for HDFS (beta).
Purpose
We created this read-only MCP Server to allow LLMs (like Claude Desktop) to query live data HDFS supported by the CData JDBC Driver for HDFS.
CData JDBC Driver connects to HDFS by exposing them as relational SQL models.
This server wraps that driver and makes HDFS data available through a simple MCP interface, so LLMs can retrieve live information by asking natural language questions — no SQL required.
Setup Guide
- Clone the repository:
- Build the server:This creates the JAR file: CDataMCP-jar-with-dependencies.jar
- Download and install the CData JDBC Driver for {source}: https://www.cdata.com/drivers/hdfs/download/jdbc
- License the CData JDBC Driver:
- Navigate to the
lib
folder in the installation directory, typically:- (Windows)
C:\Program Files\CData\CData JDBC Driver for HDFS\
- (Mac/Linux)
/Applications/CData JDBC Driver for HDFS/
- (Windows)
- Run the command
java -jar cdata.jdbc.hdfs.jar --license
- Enter your name, email, and "TRIAL" (or your license key).
- Navigate to the
- Configure your connection to the data source (Salesforce as an example):
- Run the command
java -jar cdata.jdbc.hdfs.jar
to open the Connection String utility. - Configure the connection string and click "Test Connection"
Note: If the data sources uses OAuth, you will need to authenticate in your browser.
- Once successful, copy the connection string for use later.
- Run the command
- Create a
.prp
file for your JDBC connection (e.g.hdfs.prp
) using the following properties and format:- Prefix - a prefix to be used for the tools exposed
- ServerName - a name for your server
- ServerVersion - a version for your server
- DriverPath - the full path to the JAR file for your JDBC driver
- DriverClass - the name of the JDBC Driver Class (e.g. cdata.jdbc.hdfs.HDFSDriver)
- JdbcUrl - the JDBC connection string to use with the CData JDBC Driver to connect to your data (copied from above)
- Tables - leave blank to access all data, otherwise you can explicitly declare the tables you wish to create access for
Using the Server with Claude Desktop
- Create the config file for Claude Desktop ( claude_desktop_config.json) to add the new MCP server, using the format below. If the file already exists, add the entry to the
mcpServers
in the config file. WindowsLinux/MacIf needed, copy the config file to the appropriate directory (Claude Desktop as the example). WindowsLinux/Mac - Run or refresh your client (Claude Desktop).
Note: You may need to fully exit or quit your Claude Desktop client and re-open it for the MCP Servers to appear.
Running the Server
- Run the follow the command to run the MCP Server on its own
Usage Details
Once the MCP Server is configured, the AI client will be able to use the built-in tools to read, write, update, and delete the underlying data. In general, you do not need to call the tools explicitly. Simply ask the client to answer questions about the underlying data system. For example:
- "What is the correlation between my closed won opportunities and the account industry?"
- "How many open tickets do I have in the SUPPORT project?"
- "Can you tell me what calendar events I have today?"
The list of tools available and their descriptions follow:
Tools & Descriptions
In the definitions below, {servername}
refers to the name of the MCP Server in the config file (e.g. {classname_dash}
above).
{servername}_get_tables
- Retrieves a list of tables available in the data source. Use the{servername}_get_columns
tool to list available columns on a table. The output of the tool will be returned in CSV format, with the first line containing column headers.{servername}_get_columns
- Retrieves a list of columns for a table. Use the{servername}_get_tables
tool to get a list of available tables. The output of the tool will be returned in CSV format, with the first line containing column headers.{servername}_run_query
- Execute a SQL SELECT query
Troubleshooting
- If you cannot see your CData MCP Server in Claude Desktop, be sure that you have fully quit Claude Desktop (Windows: use the Task Manager, Mac: use the Activity Monitor)
- If Claude Desktop is unable to retrieve data, be sure that you have configured your connection properly. Use the Connection String builder to create the connection string (see above) and copy the connection string into the property (.prp) file.
- If you are having trouble connecting to your data source, contact the CData Support Team.
- If you are having trouble using the MCP server, or have any other feedback, join the CData Community.
License
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
All Supported Sources
Related MCP Servers
- -securityAlicense-qualitySharePoint MCP Server by CDataLast updated -MIT License
- -securityAlicense-qualityApache Hadoop Hive MCP Server by CDataLast updated -MIT License
- -securityAlicense-qualityTeradata MCP Server by CDataLast updated -MIT License
- -securityAlicense-qualityApache Hbase MCP Server by CDataLast updated -MIT License