Model Runner
Provides integration with OpenAI's APIs for text generation (completions), vector embeddings, and image generation (DALL-E 2 and DALL-E 3), as well as zero-shot text classification with confidence scores.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Model Runnerclassify this feedback as positive, negative, or neutral"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Model Runner
Stop copy-pasting boilerplate every time you need to call a different AI model.
Model Runner is an MCP server that gives any AI assistant a unified interface to run completions, embeddings, image generation, and classification across all major providers. One tool call instead of per-provider API clients.
Quick Start
Add to your mcpServers config:
{
"mcpServers": {
"model-runner": {
"url": "https://your-cloud-run-url/mcp"
}
}
}Or run locally:
npm install
npm startBefore / After
Before: Your assistant wants to classify customer feedback into sentiment buckets. It cannot call the OpenAI API natively, does not know the exact endpoint shape, and cannot handle auth headers.
// 30 lines of fetch boilerplate. Per provider. Per project.
// Auth headers, message array format, error shapes, all different.After: One tool call:
{
"tool": "run_classification",
"arguments": {
"text": "Waited 40 minutes for support and got no answer",
"labels": ["positive", "negative", "neutral"],
"api_key": "sk-..."
}
}Output:
{
"label": "negative",
"confidence": 0.97,
"reasoning": "The customer experienced a long wait with no resolution, indicating a clearly negative experience."
}Tools
Tool | What it does |
| Browse the full catalog of models by provider and capability |
| Text generation via OpenAI, Anthropic, Groq, or Mistral |
| Vector embeddings via OpenAI or Cohere |
| Image generation via DALL-E 2 or DALL-E 3 |
| Estimate token count before making expensive API calls |
| Zero-shot text classification with confidence scores |
Who is this for?
AI assistant builders who want their agent to invoke ML models without hardcoding provider-specific API logic into every project
Developers prototyping who need a quick way to compare outputs across OpenAI, Anthropic, Groq, and Mistral without writing multiple API clients
Data teams running pipelines who want a single MCP endpoint to classify, embed, or summarize records at scale without managing provider SDKs
Health Check
Both endpoints return the same response and require no authentication:
GET /
GET /healthResponse:
{
"status": "ok",
"server": "model-runner",
"version": "1.0.0",
"tools": 6
}Built by
Mastermind HQ - AI tools built for builders.
License
MIT
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Appeared in Searches
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/josephtandle/replicate-mcp-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server