Enables image generation and multimodal tasks using Google Gemini 2 API, allowing AI agents to create images from text prompts with configurable output formats and aspect ratios.
GeminiMcpServer
GeminiMcpServer 是一個 Model Context Protocol (MCP) 伺服器,可將 LM Studio(或其他支援 MCP 的客戶端)與 Google Gemini API 無縫連接,用於影像生成與多模態任務處理。
✨ 功能特色
- 🔗 完整 MCP 支援 – 與 LM Studio 等 MCP 客戶端即時整合。
- 🎨 影像生成 – 使用 Google Gemini 2 模型產生圖片。
- 🖼 多模態輸入 – (選用)同時處理文字與圖片輸入。
- ⚡ 本地與雲端混合流程 – 本地執行 LM Studio,同時利用 Gemini API 提供的雲端運算能力。
📦 系統需求
- Node.js v20
- Google Gemini API Key(可於 Google AI Studio 申請)
- MCP 相容客戶端(如 LM Studio)
🚀 安裝與設定
建立 Google API Key:
- 前往 Google AI Studio
- 建立新的 API 金鑰
設定環境變數:
在專案根目錄建立 .env
檔,並加入:
▶ 使用方式
啟動伺服器:
執行測試:
此測試會使用 Gemini 2 的實驗性影像生成 API 建立圖片。
📡 MCP 請求範例
📡 LM Studio設定範例
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
Tools
Enables seamless integration between MCP-compatible clients (like LM Studio) and Google Gemini API for image generation and multimodal tasks. Provides a hybrid local-cloud workflow combining local LM Studio execution with Gemini's cloud-powered image generation capabilities.
Related MCP Servers
- -securityAlicense-qualityThe ultimate Gemini API interface for MCP hosts, intelligently selecting models for the task at hand—delivering optimal performance, minimal token cost, and seamless integration.Last updated -17TypeScriptMIT License
- AsecurityAlicenseAqualityA server that enables Claude Desktop to generate images using Google's Gemini AI models through the Model Context Protocol (MCP).Last updated -714JavaScriptMIT License
- -security-license-qualityAn MCP server implementation that allows using Google's Gemini AI models (specifically Gemini 1.5 Pro) through Claude or other MCP clients via the Model Context Protocol.Last updated -1JavaScript
- AsecurityAlicenseAqualityA dedicated server that wraps Google's Gemini AI models in a Model Context Protocol (MCP) interface, allowing other LLMs and MCP-compatible systems to access Gemini's capabilities like content generation, function calling, chat, and file handling through standardized tools.Last updated -1630TypeScriptMIT License