Dataiku DSS MCP Server
Provides tools to manage recipes, datasets, and scenarios in Dataiku DSS, including creating, updating, deleting, running, and inspecting various objects.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Dataiku DSS MCP Serverlist datasets in project 'Sales'"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Dataiku DSS MCP Server
A comprehensive Model Context Protocol (MCP) server for Dataiku DSS integration. This project provides Claude Code with direct access to Dataiku DSS for managing recipes, datasets, and scenarios.
🚀 Quick Start
Prerequisites
Node.js 18.0.0+
Dataiku DSS instance with API access
Valid DSS API key
Installation
# Install globally
npm install -g @zhangzichao2008/mcp-dataiku
# Or use with npx
npx @zhangzichao2008/mcp-dataikuConfiguration
Copy environment template:
cp .env.sample .envConfigure your DSS connection in
.env:
DSS_HOST=https://your-dss-instance.com:10000
DSS_API_KEY=your-api-key-here
DSS_INSECURE_TLS=true # Only if using self-signed certificatesClaude Code Integration
Register the MCP server with Claude Code:
claude mcp add dataiku-dss \
-e DSS_HOST=https://your-dss-instance.com:10000 \
-e DSS_API_KEY=your-api-key-here \
-e DSS_INSECURE_TLS=true \
-- npx @zhangzichao2008/mcp-dataiku📚 MCP Tool Catalog
Core Recipe Management Tools
Tool | Description | Key Parameters |
| Create new recipe |
|
| Update existing recipe |
|
| Delete recipe |
|
| Execute recipe |
|
Core Dataset Management Tools
Tool | Description | Key Parameters |
| Create new dataset |
|
| Update dataset settings |
|
| Delete dataset |
|
| Build dataset |
|
| Get dataset schema |
|
| Get dataset metrics |
|
Core Scenario Management Tools
Tool | Description | Key Parameters |
| Create new scenario |
|
| Update scenario settings |
|
| Delete scenario |
|
| Execute scenario |
|
🔧 Advanced Tools
Tool | Description | Key Parameters |
| Get detailed run logs and error messages |
|
| Extract actual Python/SQL code from recipes |
|
| Get complete data flow/pipeline structure |
|
| Get sample data from datasets |
|
| Get recent run history across scenarios/recipes |
|
| List all available Dataiku projects | - |
Additional Dataset Tools
Tool | Description | Key Parameters |
| List all datasets in a project |
|
| Get detailed information about a dataset |
|
| Clear data from a dataset |
|
Additional Recipe Tools
Tool | Description | Key Parameters |
| List all recipes in a project |
|
| Get detailed information about a recipe |
|
| Validate Python/SQL syntax of a recipe |
|
| Test recipe logic without actual execution |
|
Additional Scenario Tools
Tool | Description | Key Parameters |
| List all scenarios in a project |
|
| Get detailed information about a scenario |
|
| Add a trigger to a scenario |
|
| Remove a trigger from a scenario |
|
| Get run history for a scenario |
|
| Get step configuration including Python code |
|
| Clone an existing scenario with modifications |
|
Advanced Tools
Tool | Description | Key Parameters |
| Search for datasets, recipes, scenarios by name/pattern |
|
| List available Python/R environments |
|
| Get project-level variables and configuration |
|
| List available data connections |
|
| Get detailed job execution information |
|
| Cancel running jobs/scenarios |
|
| Update multiple objects with similar changes |
|
| Get complete data flow/pipeline structure |
|
| Export project configuration as JSON/YAML |
|
| Copy project structure to new project |
|
Total: 46 Tools
🔧 Usage Examples
Core Operations
Creating a Python Recipe
{
"project_key": "ANALYTICS_PROJECT",
"recipe_type": "python",
"recipe_name": "data_cleaner",
"inputs": ["raw_data"],
"outputs": [{"name": "clean_data", "new": true, "connection": "filesystem_managed"}],
"code": "import pandas as pd\ndf = dataiku.Dataset(\"raw_data\").get_dataframe()\ndf_clean = df.dropna()\ndataiku.Dataset(\"clean_data\").write_with_schema(df_clean)"
}Building a Dataset
{
"project_key": "BI",
"dataset_name": "user_analytics",
"mode": "RECURSIVE_BUILD"
}Getting Dataset Sample
{
"project_key": "FINANCE_PROJECT",
"dataset_name": "transactions",
"rows": 500,
"columns": ["customer_id", "amount"]
}Getting Scenario Logs
{
"project_key": "ANALYTICS_PROJECT",
"scenario_id": "data_processing"
}Exploring Project Structure
{
"project_key": "SALES_ANALYTICS"
}🏗️ Architecture
mcp-dataiku/
├── src/
│ ├── dataiku-client.ts # Dataiku API client
│ └── mcp-server.ts # MCP server implementation
├── package.json
├── tsconfig.json
├── .env.sample
└── README.md🔒 Security
API Key Protection: Store API keys in environment variables, never in code
SSL Configuration: Support for self-signed certificates with
DSS_INSECURE_TLS=truePermission Validation: All operations respect DSS user permissions
Error Handling: Sensitive information is not exposed in error messages
📈 Monitoring
The MCP server provides logging for monitoring:
# Check logs for debugging
tail -f dataiku_mcp.log🤝 Contributing
Fork the repository
Create a feature branch:
git checkout -b feature/amazing-featureCommit changes:
git commit -m 'Add amazing feature'Push to branch:
git push origin feature/amazing-featureOpen a Pull Request
Development Setup
# Install dependencies
npm install
# Run in development mode
npm run dev
# Build for production
npm run build
# Run basic validation tests (no actual API calls)
npm test
# Run comprehensive tests (requires Dataiku DSS)
node test-comprehensive.js
# Clean build artifacts
npm run clean
# Publish new version (patch version)
npm run publish:patch📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
🔗 Links
📞 Support
If you encounter any issues or have questions, please open an issue on GitHub.
Ready to enhance your Dataiku workflows with AI assistance! 🚀
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/steven0lisa/mcp-dataiku'
If you have feedback or need assistance with the MCP directory API, please join our Discord server