local_ai_gen
Provides text-to-image, text-to-audio, text-to-speech, and image-to-3D generation using models hosted on Hugging Face.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@local_ai_gengenerate an image of a futuristic city skyline at night"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Local AI Generation MCP Server (local_ai_gen)
This project runs a local MCP (Model Context Protocol) server that exposes tools for:
Text to Image
Text to Music / Audio
Text to Speech
Image/Text to 3D Model
Models Used
Image Generation:
segmind/SSD-1BA fast, distilled version of SDXL that works well on consumer GPUs.Music Generation:
stabilityai/stable-audio-open-1.0A high-quality open model for generating sound effects, short music tracks, and ambient audio. (Note: Requires accepting license terms)Speech Generation:
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoiceA robust, multilingual text-to-speech model.3D Generation:
stabilityai/TripoSRA fast feed-forward model for image-to-3D reconstruction.
Requirements
Python 3.10+
NVIDIA GPU (8GB+ VRAM recommended for running everything smoothly)
Hugging Face User Access Token (for
stable-audio-open-1.0)
Complete Setup Guide
1. Preparation and Hugging Face Authenication
The Stable Audio Open model is gated and requires you to accept its license before downloading.
Go to stabilityai/stable-audio-open-1.0 on Hugging Face.
Log in and click to Accept the Terms/License.
Generate a User Access Token in your Hugging Face Settings.
Run the Hugging Face CLI login locally:
pip install -U "huggingface_hub" hf auth(Paste your token when prompted. You do not need to add it as a git credential).
2. Environment Installation
python -m venv .venv
source .venv/bin/activate
python setup.pyWhat python setup.py does:
Creates output directories (
generated_images,generated_audio,generated_models,models)Clones
third_party/TripoSRif it is missingInstalls everything from
requirements.txtCompiles and installs
torchmcubesagainst your active torch version (needed for 3D generation)Installs
flash-attn.
Run the MCP server
To run the MCP server manually (via standard I/O):
python mcp_server/main.pyNote: The first run of any specific tool will be slower because it has to download the weights for that model into your Hugging Face cache.
Smoke Test
You can run the included smoke test script to verify all models are working correctly:
python smoke_test_generate.pyMCP Tools Exposed
generate_imagegenerate_audiogenerate_speechgenerate_3d_modelhealth_check
Notes
Generated files are written to
generated_images,generated_audio, andgenerated_modelsby default.For
generate_audioandgenerate_speech, you can override the destination with:tool arg
output_dir(highest priority), orenv var
GENAI_OUTPUT_AUDIO_DIRFor
generate_image, override destination with tool argoutput_diror env varGENAI_OUTPUT_IMAGE_DIRFor
generate_3d_model, override destination with tool argoutput_diror env varGENAI_OUTPUT_MODEL_DIR
Using with MCP Clients (Cursor, Claude Desktop, etc.)
To use this server in an MCP-compatible client, add the following to your mcp.json (or the respective MCP configuration file for your client). Make sure to replace <YOUR_PROJECT_PATH> with the absolute path to where you cloned this repository:
{
"mcpServers": {
"local_ai_gen": {
"command": "<YOUR_PROJECT_PATH>/.venv/bin/python",
"args": [
"<YOUR_PROJECT_PATH>/mcp_server/main.py"
],
"env": {}
}
}
}This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/davidemodolo/local-asset-gen-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server