Provides OpenAI-compatible text generation models for simple and advanced text generation with system prompts through the Pollinations API
🚀 MCPO - MCP Pollinations Proxy
A Docker-containerized MCP (Model Context Protocol) proxy that combines mcpo CLI tool with Pollinations MCP server, providing AI image, text, audio, and vision generation capabilities through standard REST endpoints.
🌟 Features
🎨 Multimodal AI Capabilities
- Image Generation: Create stunning images from text prompts with 1024x1024 default resolution
- Image-to-Image: Transform existing images using text descriptions
- Vision Analysis: Analyze, describe, compare images and extract text (OCR)
- Text Generation: Simple and advanced text generation with system prompts
- Text-to-Speech: Convert text to speech with multiple voice options
- Audio Generation: Create contextual audio responses
🔧 Technical Features
- OpenAPI REST Endpoints: Standard HTTP/REST interface for all MCP capabilities
- Docker Containerized: Easy deployment and consistent environment
- Real-time Processing: Direct API integration with Pollinations services
- Multiple Model Support: Access various AI models for different tasks
🚀 Quick Start
Prerequisites
- Docker and Docker Compose
- Port 7777 available
Installation & Usage
- Clone the repository
- Build and run the container
- Access the service
- Service runs on:
http://localhost:7777
- OpenAPI docs:
http://localhost:7777/docs
- API endpoints:
http://localhost:7777/api/...
- Service runs on:
Development Commands
🎯 API Endpoints
The service exposes Pollinations MCP server functionality through REST endpoints:
🖼️ Image Generation
POST /api/generateImage
- Generate image from text promptPOST /api/generateImageUrl
- Get image generation URLPOST /api/generateImageToImage
- Transform image with text promptGET /api/listImageModels
- List available image models
📝 Text Generation
POST /api/generateText
- Simple text generationPOST /api/generateAdvancedText
- Advanced text with system promptsGET /api/listTextModels
- List available text models
👁️ Vision & Analysis
POST /api/analyzeImageFromUrl
- Analyze image from URLPOST /api/analyzeImageFromData
- Analyze base64 image dataPOST /api/compareImages
- Compare two imagesPOST /api/extractTextFromImage
- OCR text extraction
🎵 Audio Generation
POST /api/sayText
- Text-to-speech conversionPOST /api/respondAudio
- Generate contextual audio responsesGET /api/listAudioVoices
- List available voices
🏗️ Architecture
Container Stack
- Base: Node.js 18 Alpine Linux
- Python: Installed for mcpo CLI tool
- Port: 7777 exposed for HTTP access
- Host: Configured to bind to 0.0.0.0
Service Flow
- Container starts with
mcpo
CLI tool mcpo
proxies thepollinations-model-context-protocol
MCP server- MCP server capabilities become available via OpenAPI endpoints
- External applications use standard HTTP/REST calls
📁 Project Structure
🔧 Configuration
Default Settings
- Image Resolution: 1024x1024 pixels
- Image Quality: Private=true, NoLogo=true, Enhance=true
- Text Generation: OpenAI-compatible models
- Audio Format: MP3 with Alloy voice
- Vision Models: GPT-4o for image analysis
Environment Variables
The container automatically configures the MCP proxy without additional environment variables needed.
🎨 Usage Examples
Image Generation
Vision Analysis
Text-to-Speech
🤝 Contributing
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature
) - Commit your changes (
git commit -m 'Add amazing feature'
) - Push to the branch (
git push origin feature/amazing-feature
) - Open a Pull Request
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
🙏 Acknowledgments
- Pollinations.AI for the amazing AI APIs
- Model Context Protocol for the MCP standard
- mcpo CLI tool for MCP to OpenAPI conversion
🔗 Links
Built with ❤️ using Docker, Node.js, and Python
This server cannot be installed
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
A Docker-containerized MCP proxy that provides AI image generation, text generation, vision analysis, and text-to-speech capabilities through REST endpoints using Pollinations AI services. Enables multimodal AI interactions including image creation, transformation, OCR, and audio generation through standard HTTP APIs.