Skip to main content
Glama

pixelle-mcp-Image-generation

by AIDC-AI

https://github.com/user-attachments/assets/65422cef-96f9-44fe-a82b-6a124674c417

๐Ÿ“‹ Recent Updates

  • โœ… 2025-09-29: Added RunningHub cloud ComfyUI support, enabling workflow execution without local GPU and ComfyUI environment

  • โœ… 2025-09-03: Architecture refactoring from three services to unified application; added CLI tool support; published to PyPI

  • โœ… 2025-08-12: Integrated the LiteLLM framework, adding multi-model support for Gemini, DeepSeek, Claude, Qwen, and more

๐Ÿš€ Features

  • โœ… ๐Ÿ”„ Full-modal Support: Supports TISV (Text, Image, Sound/Speech, Video) full-modal conversion and generation

  • โœ… ๐Ÿš€ Dual Execution Modes: Local ComfyUI self-hosted environment + RunningHub cloud ComfyUI service, users can flexibly choose based on their needs

  • โœ… ๐Ÿงฉ ComfyUI Ecosystem: Built on ComfyUI, inheriting all capabilities from the open ComfyUI ecosystem

  • โœ… ๐Ÿ”ง Zero-code Development: Defines and implements the Workflow-as-MCP Tool solution, enabling zero-code development and dynamic addition of new MCP Tools

  • โœ… ๐Ÿ—„๏ธ MCP Server: Based on the MCP protocol, supporting integration with any MCP client (including but not limited to Cursor, Claude Desktop, etc.)

  • โœ… ๐ŸŒ Web Interface: Developed based on the Chainlit framework, inheriting Chainlit's UI controls and supporting integration with more MCP Servers

  • โœ… ๐Ÿ“ฆ One-click Deployment: Supports PyPI installation, CLI commands, Docker and other deployment methods, ready to use out of the box

  • โœ… โš™๏ธ Simplified Configuration: Uses environment variable configuration scheme, simple and intuitive configuration

  • โœ… ๐Ÿค– Multi-LLM Support: Supports multiple mainstream LLMs, including OpenAI, Ollama, Gemini, DeepSeek, Claude, Qwen, and more

๐Ÿ“ Project Architecture

Pixelle MCP adopts a unified architecture design, integrating MCP server, web interface, and file services into one application, providing:

  • ๐ŸŒ Web Interface: Chainlit-based chat interface supporting multimodal interaction

  • ๐Ÿ”Œ MCP Endpoint: For external MCP clients (such as Cursor, Claude Desktop) to connect

  • ๐Ÿ“ File Service: Handles file upload, download, and storage

  • ๐Ÿ› ๏ธ Workflow Engine: Supports both local ComfyUI and cloud ComfyUI (RunningHub) workflows, automatically converts workflows into MCP tools

๐Ÿƒโ€โ™‚๏ธ Quick Start

Choose the deployment method that best suits your needs, from simple to complex:

๐ŸŽฏ Method 1: One-click Experience

๐Ÿ’ก Zero configuration startup, perfect for quick experience and testing

๐Ÿš€ Temporary Run

# First you need to install the uv environment # Start with one command, no system installation required uvx pixelle@latest

๐Ÿ“š View uvx CLI Reference โ†’

๐Ÿ“ฆ Persistent Installation

# Here you need to install it in the python3.11 environment # Install to system pip install -U pixelle # Start service pixelle

๐Ÿ“š View pip CLI Reference โ†’

After startup, it will automatically enter the configuration wizard to guide you through execution engine selection (ComfyUI/RunningHub) and LLM configuration.

๐Ÿ› ๏ธ Method 2: Local Development Deployment

๐Ÿ’ก Supports custom workflows and secondary development

๐Ÿ“ฅ 1. Get Source Code

git clone https://github.com/AIDC-AI/Pixelle-MCP.git cd Pixelle-MCP

๐Ÿš€ 2. Start Service

# Interactive mode (recommended) uv run pixelle

๐Ÿ“š View Complete CLI Reference โ†’

๐Ÿ”ง 3. Add Custom Workflows (Optional)

# Copy example workflows to data directory (run this in your desired project directory) cp -r workflows/* ./data/custom_workflows/

โš ๏ธ Important: Make sure to test workflows in ComfyUI first to ensure they run properly, otherwise execution will fail.

๐Ÿณ Method 3: Docker Deployment

๐Ÿ’ก Suitable for production environments and containerized deployment

๐Ÿ“‹ 1. Prepare Configuration

git clone https://github.com/AIDC-AI/Pixelle-MCP.git cd Pixelle-MCP # Create environment configuration file cp .env.example .env # Edit .env file to configure your ComfyUI address and LLM settings

๐Ÿš€ 2. Start Container

# Start all services in background docker compose up -d # View logs docker compose logs -f

๐ŸŒ Access Services

Regardless of which method you use, after startup you can access via:

๐Ÿ’ก Port Configuration: Default port is 9004, can be customized via environment variable PORT=your_port.

โš™๏ธ Initial Configuration

On first startup, the system will automatically detect configuration status:

  1. ๐Ÿš€ Execution Engine Selection: Choose between local ComfyUI or RunningHub cloud service

  2. ๐Ÿค– LLM Configuration: Configure at least one LLM provider (OpenAI, Ollama, etc.)

  3. ๐Ÿ“ Workflow Directory: System will automatically create necessary directory structure

๐ŸŒ RunningHub Cloud Mode Advantages

  • โœ… Zero Hardware Requirements: No need for local GPU or high-performance hardware

  • โœ… No Environment Setup: No need to install and configure ComfyUI locally

  • โœ… Ready to Use: Register and get API key to start immediately

  • โœ… Stable Performance: Professional cloud infrastructure ensures stable execution

  • โœ… Auto Scaling: Automatically handles concurrent requests and resource allocation

๐Ÿ  Local ComfyUI Mode Advantages

  • โœ… Full Control: Complete control over execution environment and model versions

  • โœ… Privacy Protection: All data processing happens locally, ensuring data privacy

  • โœ… Custom Models: Support for custom models and nodes not available in cloud

  • โœ… No Network Dependency: Can work offline without internet connection

  • โœ… Cost Control: No cloud service fees for high-frequency usage

๐Ÿ†˜ Need Help? Join community groups for support (see Community section below)

๐Ÿ› ๏ธ Add Your Own MCP Tool

โšก One workflow = One MCP Tool, supports two addition methods:

๐Ÿ“‹ Method 1: Local ComfyUI Workflow - Export API format workflow files ๐Ÿ“‹ Method 2: RunningHub Workflow ID - Use cloud workflow IDs directly

๐ŸŽฏ 1. Add the Simplest MCP Tool

  • ๐Ÿ“ Build a workflow in ComfyUI for image Gaussian blur (Get it here), then set the LoadImage node's title to $image.image! as shown below:

  • ๐Ÿ“ค Export it as an API format file and rename it to i_blur.json. You can export it yourself or use our pre-exported version (Get it here)

  • ๐Ÿ“‹ Copy the exported API workflow file (must be API format), input it on the web page, and let the LLM add this Tool

  • โœจ After sending, the LLM will automatically convert this workflow into an MCP Tool

  • ๐ŸŽจ Now, refresh the page and send any image to perform Gaussian blur processing via LLM

๐Ÿ”Œ 2. Add a Complex MCP Tool

The steps are the same as above, only the workflow part differs (Download workflow: UI format and API format)

Note: When using RunningHub, you only need to input the corresponding workflow ID, no need to download and upload workflow files.

๐Ÿ”ง ComfyUI Workflow Custom Specification

๐ŸŽจ Workflow Format

The system supports ComfyUI workflows. Just design your workflow in the canvas and export it as API format. Use special syntax in node titles to define parameters and outputs.

๐Ÿ“ Parameter Definition Specification

In the ComfyUI canvas, double-click the node title to edit, and use the following DSL syntax to define parameters:

$<param_name>.[~]<field_name>[!][:<description>]

๐Ÿ” Syntax Explanation:

  • param_name: The parameter name for the generated MCP tool function

  • ~: Optional, indicates URL parameter upload processing, returns relative path

  • field_name: The corresponding input field in the node

  • !: Indicates this parameter is required

  • description: Description of the parameter

๐Ÿ’ก Example:

Required parameter example:

  • Set LoadImage node title to: $image.image!:Input image URL

  • Meaning: Creates a required parameter named image, mapped to the node's image field

URL upload processing example:

  • Set any node title to: $image.~image!:Input image URL

  • Meaning: Creates a required parameter named image, system will automatically download URL and upload to ComfyUI, returns relative path

๐Ÿ“ Note: LoadImage, VHS_LoadAudioUpload, VHS_LoadVideo and other nodes have built-in functionality, no need to add ~ marker

Optional parameter example:

  • Set EmptyLatentImage node title to: $width.width:Image width, default 512

  • Meaning: Creates an optional parameter named width, mapped to the node's width field, default value is 512

๐ŸŽฏ Type Inference Rules

The system automatically infers parameter types based on the current value of the node field:

  • ๐Ÿ”ข int: Integer values (e.g. 512, 1024)

  • ๐Ÿ“Š float: Floating-point values (e.g. 1.5, 3.14)

  • โœ… bool: Boolean values (e.g. true, false)

  • ๐Ÿ“ str: String values (default type)

๐Ÿ“ค Output Definition Specification

๐Ÿค– Method 1: Auto-detect Output Nodes

The system will automatically detect the following common output nodes:

  • ๐Ÿ–ผ๏ธ SaveImage - Image save node

  • ๐ŸŽฌ SaveVideo - Video save node

  • ๐Ÿ”Š SaveAudio - Audio save node

  • ๐Ÿ“น VHS_SaveVideo - VHS video save node

  • ๐ŸŽต VHS_SaveAudio - VHS audio save node

๐ŸŽฏ Method 2: Manual Output Marking

Usually used for multiple outputs Use $output.var_name in any node title to mark output:

  • Set node title to: $output.result

  • The system will use this node's output as the tool's return value

๐Ÿ“„ Tool Description Configuration (Optional)

You can add a node titled MCP in the workflow to provide a tool description:

  1. Add a String (Multiline) or similar text node (must have a single string property, and the node field should be one of: value, text, string)

  2. Set the node title to: MCP

  3. Enter a detailed tool description in the value field

โš ๏ธ Important Notes

  1. ๐Ÿ”’ Parameter Validation: Optional parameters (without !) must have default values set in the node

  2. ๐Ÿ”— Node Connections: Fields already connected to other nodes will not be parsed as parameters

  3. ๐Ÿท๏ธ Tool Naming: Exported file name will be used as the tool name, use meaningful English names

  4. ๐Ÿ“‹ Detailed Descriptions: Provide detailed parameter descriptions for better user experience

  5. ๐ŸŽฏ Export Format: Must export as API format, do not export as UI format

๐Ÿ’ฌ Community

Scan the QR codes below to join our communities for latest updates and technical support:

Discord Community

WeChat Group

๐Ÿค How to Contribute

We welcome all forms of contribution! Whether you're a developer, designer, or user, you can participate in the project in the following ways:

๐Ÿ› Report Issues

  • ๐Ÿ“‹ Submit bug reports on the Issues page

  • ๐Ÿ” Please search for similar issues before submitting

  • ๐Ÿ“ Describe the reproduction steps and environment in detail

๐Ÿ’ก Feature Suggestions

  • ๐Ÿš€ Submit feature requests in Issues

  • ๐Ÿ’ญ Describe the feature you want and its use case

  • ๐ŸŽฏ Explain how it improves user experience

๐Ÿ”ง Code Contributions

๐Ÿ“‹ Contribution Process

  1. ๐Ÿด Fork this repo to your GitHub account

  2. ๐ŸŒฟ Create a feature branch: git checkout -b feature/your-feature-name

  3. ๐Ÿ’ป Develop and add corresponding tests

  4. ๐Ÿ“ Commit changes: git commit -m "feat: add your feature"

  5. ๐Ÿ“ค Push to your repo: git push origin feature/your-feature-name

  6. ๐Ÿ”„ Create a Pull Request to the main repo

๐ŸŽจ Code Style

  • ๐Ÿ Python code follows PEP 8 style guide

  • ๐Ÿ“– Add appropriate documentation and comments for new features

๐Ÿงฉ Contribute Workflows

  • ๐Ÿ“ฆ Share your ComfyUI workflows with the community

  • ๐Ÿ› ๏ธ Submit tested workflow files

  • ๐Ÿ“š Add usage instructions and examples for workflows

๐Ÿ™ Acknowledgements

โค๏ธ Sincere thanks to the following organizations, projects, and teams for supporting the development and implementation of this project.

License

This project is released under the MIT License (LICENSE, SPDX-License-identifier: MIT).

Related MCP Servers

  • A
    security
    -
    license
    A
    quality
    Enables the generation of images using Together AI's models through an MCP server, supporting customizable parameters such as model selection, image dimensions, and output directory.
    Last updated -
    7
    MIT License
    • Apple
    • Linux
  • -
    security
    -
    license
    -
    quality
    The Comfy MCP Server uses the FastMCP framework to generate images from prompts by interacting with a remote Comfy server, allowing automated image creation based on workflow configurations.
    Last updated -
    31
    MIT License
  • Glifofficial

    A
    security
    -
    license
    A
    quality
    Run AI workflows hosted on Glif.app via MCP, including ComfyUI-based image generators, meme generators, selfies, chained LLM calls, and more
    Last updated -
    31
    26
    MIT License
    • Apple
  • A
    security
    -
    license
    A
    quality
    A server that integrates ComfyUI with MCP, allowing users to generate images and download them through natural language interactions.
    Last updated -
    4
    9
    Apache 2.0

View all related MCP servers

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/AIDC-AI/Pixelle-MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server