Hopsworks MCP Server
Supports Apache Spark environments for job execution and data processing.
Allows working with Git repositories within Hopsworks.
Allows working with OpenSearch indexes.
Allows managing PyTorch models in the Model Registry.
Allows managing scikit-learn models in the Model Registry.
Allows managing TensorFlow models in the Model Registry.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Hopsworks MCP Serverlist feature groups in my project"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Hopsworks MCP Server
MCP server for Hopsworks integration, providing a straightforward interface for LLMs to interact with Hopsworks.
Capabilities
Platform & Authentication
Authentication - Connect to Hopsworks instances
Projects - Create and manage Hopsworks projects
Datasets - Handle file operations on Hopsworks
Python Environments - Manage Python environments and dependencies
Secrets - Securely store and retrieve sensitive information
Feature Store
Feature Store - Interact with feature stores and run SQL queries
Feature Groups - Manage feature groups and their data
External Feature Groups - Connect to external data sources as feature groups
Features - Work with individual features and their metadata
Feature Views - Create and use feature views for model training and serving
Expectations - Create and manage data validation rules
Embeddings - Manage vector embeddings and similarity search
Queries - Join, filter, and analyze feature data
Spine Groups - Create and use spine groups for training data generation
Training Datasets - Create and manage datasets for model training
Transformation Functions - Create and manage feature transformation functions (one-to-one, one-to-many, many-to-one, many-to-many) with support for statistics-based transformations
Model Lifecycle
Model Registry - Create, save, retrieve and manage ML models (TensorFlow, PyTorch, scikit-learn, Python, LLM)
Model Serving - Deploy, manage and monitor ML models in production with advanced features like transformers, inference logging and batching
Jobs & Processing
Jobs - Create and schedule jobs
Executions - Run and monitor job executions
Flink Clusters - Manage Flink clusters and jobs
Integrations
Git Integration - Work with Git repositories within Hopsworks
Kafka - Create and manage Kafka topics and schemas
OpenSearch - Work with OpenSearch indexes
Installation
pip install -e .Development
# Install development dependencies
pip install -e ".[dev]"
# Run the server
fastmcp run main.py
# Use the interactive development environment
fastmcp dev main.pyUsage with Claude or other LLMs
Running the Server
You can run the Hopsworks MCP server in several ways:
# Run the server directly
python main.py
# Run using FastMCP
fastmcp run main.py
# Use the interactive development environment
fastmcp dev main.py
# Install in Claude Desktop for persistent access
fastmcp install main.py --name "Hopsworks Tools"Configuring with Claude
To use the Hopsworks MCP server with Claude, you need to add it to Claude's configuration. The configuration file is typically located at:
macOS:
~/Library/Application Support/Claude Desktop/config.jsonWindows:
%APPDATA%\Claude Desktop\config.jsonLinux:
~/.config/Claude Desktop/config.json
Add the following configuration to your Claude settings:
{
"mcpServers": {
"hopsworks": {
"command": "/path/to/your/python",
"args": [
"/path/to/mcp-hopsworks/main.py"
],
"env": {
"PYTHONPATH": "/path/to/mcp-hopsworks",
"HOPSWORKS_API_KEY": "your_api_key_here",
"HOPSWORKS_HOST": "your_hopsworks_host_url"
}
}
}
}Replace the placeholders with your specific paths and credentials:
/path/to/your/python: The full path to your Python executable (e.g.,/usr/bin/python3or/Users/username/miniconda3/bin/python)/path/to/mcp-hopsworks: The full path to your mcp-hopsworks directoryyour_api_key_here: Your Hopsworks API keyyour_hopsworks_host_url: Your Hopsworks instance URL (e.g., "https://your-instance.hopsworks.ai")
Troubleshooting Connection Issues
If Claude has trouble connecting to the Hopsworks MCP server:
Python Path: Ensure you're using the absolute path to the Python executable that has the required packages installed:
# Find your Python path which python3 # Or python3 -c "import sys; print(sys.executable)"Environment Variables: Make sure all required environment variables are set:
HOPSWORKS_API_KEY: Required for authentication with HopsworksHOPSWORKS_HOST: The URL of your Hopsworks instancePYTHONPATH: Should include the path to the mcp-hopsworks directory
Required Packages: Verify that all required packages are installed:
pip install -e .Python Version: Ensure you're using Python 3.10 or higher:
python --version
After updating your configuration, restart Claude completely for the changes to take effect.
Requirements
Python 3.10+
Hopsworks API access (API key with recommended scopes: featurestore, project, job, kafka)
Best Practices
Installation
The Hopsworks Python client is installed with the Python profile (
hopsworks[python]) to ensure all necessary dependencies are available for pure Python environments.For Spark environments, additional configuration may be required.
API Key
When generating an API key, include the following scopes:
featurestore,project,job, andkafkafor full functionality.Store API keys securely and never commit them to version control.
Engine Selection
Use the appropriate engine based on your environment:
python: For pure Python environments (default)spark: For Apache Spark environmentshive: For Hive query execution
Version Compatibility
The major and minor version of the Hopsworks Python library should match those of your Hopsworks deployment.
Check your Hopsworks version in the Project's settings tab.
Transformation Functions
Creating transformation functions:
One-to-one: Transform a single feature into a single output feature
One-to-many: Transform a single feature into multiple output features
Many-to-one: Combine multiple features into a single output feature
Many-to-many: Transform multiple input features into multiple output features
Execution modes:
default: Uses Pandas UDF for batch operations, Python UDF for online inferencepython: Always uses Python UDF regardless of operation typepandas: Always uses Pandas UDF regardless of operation type
Use statistics-based transformations for feature normalization and scaling
Use context variables to share common parameters across multiple transformations
Use the
drop_featuresparameter to exclude input features from the output
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/MagicLex/hopsworks-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server