Generates PyAirbyte pipeline code to move data between 600+ Airbyte source and destination connectors, with automated configuration management and best practices.
Supports generating pipelines from GitHub as a source connector to move data to various destinations or dataframes.
Supports generating pipelines with MySQL as a source connector to extract and move data to destinations.
Uses OpenAI's API with file search to provide context-aware code generation and instructions for building data pipelines.
Supports outputting pipeline data to Pandas DataFrames for Python-based data analysis and manipulation.
Supports generating pipelines from Salesforce as a source connector to extract and move data to various destinations.
Provides support channel for help with the MCP server configuration and usage.
Supports generating pipelines to Snowflake as a destination connector for loading data from various sources.
Supports creating pipelines that output to dataframes for visualization using Streamlit applications.
Supports generating pipelines from Stripe as a source connector to extract and move data to destinations.
PyAirbyte MCP Server
What is the PyAirbyte MCP Service?
The PyAirbyte Managed Code Provider (MCP) service is an AI-powered backend that generates PyAirbyte pipeline code and instructions. It leverages OpenAI and connector documentation to help users quickly scaffold and configure data pipelines between sources and destinations supported by Airbyte. The MCP service automates code generation, provides context-aware guidance, and streamlines the process of building and deploying data pipelines. If you want to learn more on how the service works check out this video.
Generates PyAirbyte pipeline code based on user instructions and connector documentation.
Uses OpenAI and file search to provide context-aware code and instructions.
Available as a remote MCP server for Cursor.
Quick Start
For Cursor
The easiest way to get started is using our hosted MCP server. Add this to your Cursor MCP configuration file (.cursor/mcp.json):
Requirements:
Your own OpenAI API key
No local installation required
Works immediately after configuration
Configuration Steps:
Get your OpenAI API key from OpenAI Platform
Create or edit
.cursor/mcp.jsonin your project directory (for project-specific) or~/.cursor/mcp.json(for global access)Add the configuration above with your actual OpenAI API key
turn off / on the MCP server
Start generating PyAirbyte pipelines!
Security Note
API keys are provided via MCP environment variables in the configuration
This ensures secure API key handling through the MCP protocol
Cursor is currently the only client that appears to support passing in ENV for remote servers. We will add Cline support as soon as available.
Usage
Once configured, you can use the MCP server in your AI assistant by asking it to generate PyAirbyte pipelines.
🚀 How to Use in Cline
1. Verify Connection
Look for the MCP server status in Cline's interface
You should see "pyairbyte-mcp" listed with 1 tool available
If it shows 0 tools or is red, check your mcp.json. If you need more help, please ask in this slack channel.
2. Generate Pipelines with Natural Language
Simply ask Cline to generate a PyAirbyte pipeline! Here are example prompts:
Basic Examples:
4. Available Source/Destination Options
Sources: Any Airbyte source connector (e.g.,
source-postgres,source-github,source-stripe,source-mysql,source-salesforce)Destinations: Any Airbyte destination connector (e.g.,
destination-snowflake,destination-bigquery,destination-postgres) ORdataframefor Pandas analysis
5. Pro Tips
Use "dataframe" as destination if you want to analyze data in Python/Pandas
Be specific about your source and destination names (use official Airbyte connector names and use source- or destination- to specify)
Ask follow-up questions if you need help with specific configuration or setup
The tool will automatically use your OpenAI API key (configured in the MCP settings) to generate enhanced, well-documented pipeline code with best practices and detailed setup instructions!
Just start by asking Cline to generate a pipeline for your specific use case! 🎯
Features
Automated Code Generation: Creates complete PyAirbyte pipeline scripts
Configuration Management: Handles environment variables and credentials securely
Documentation Integration: Uses OpenAI to provide context-aware instructions
Multiple Output Formats: Supports both destination connectors and DataFrame output
Best Practices: Includes error handling, logging, and proper project structure
Generate pipeline for over 600 connectors: If it is in the Airbyte Connector Registry, the MCP server can create it.
Available Tools
generate_pyairbyte_pipeline
Generates a complete PyAirbyte pipeline with setup instructions.
Parameters:
source_name: The official Airbyte source connector name (e.g., 'source-postgres', 'source-github')destination_name: The official Airbyte destination connector name (e.g., 'destination-postgres', 'destination-snowflake') OR 'dataframe' to output to Pandas DataFrames
Returns:
Complete Python pipeline code
Setup and installation instructions
Environment variable templates
Best practices and usage guidelines