Best ONNX MCP Servers
ONNX (Open Neural Network Exchange) is an open format for representing machine learning models, allowing models to be transferred between different frameworks and tools.
Why this server?
Utilizes bundled ONNX embeddings for semantic code search capabilities that work offline without requiring API keys.
AlicenseBqualityAmaintenanceFramework-aware code intelligence MCP server that builds a cross-language dependency graph from source code. 53 integrations (Laravel, Django, Rails, Spring, NestJS, Next.js, and more) across 68 languages. 100+ tools for navigation, impact analysis, refactoring, security scanning, session memory, and CI/PR reports — up to 97% token reduction.Last updated1001,99766Why this server?
Utilizes ONNX model files for text-to-speech processing, specifically loading the Kokoro model weights for voice generation.
AlicenseBquality-maintenanceA server that generates MP3 audio files from text using Kokoro TTS technology with optional S3 upload capabilities.Last updated176Why this server?
Supports ONNX model format for AI accelerator inference workloads including MemryX MX3, Coral TPU, Hailo-8, and Intel NCS2.
AlicenseBquality-maintenanceEnables AI assistants to manage homelab infrastructure through automated service installation (Jellyfin, Pi-hole, Ollama, Home Assistant, Frigate NVR), VM operations, AI accelerator support (MemryX, Coral TPU, Hailo-8), and Terraform state management with SSH-based discovery and deployment.Last updated494Why this server?
Uses the ONNX runtime to run the Kokoro TTS model, enabling high-quality text-to-speech conversion without requiring an API key.
AlicenseBqualityCmaintenanceA Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.Last updated4111MITWhy this server?
Uses ONNX runtime for the Kokoro neural voice models, providing high-quality text-to-speech synthesis with multiple voices and emotional expressions
AlicenseAqualityCmaintenanceProvides high-quality text-to-speech synthesis with 10 natural voices, emotion control, and dynamic pacing for professional applications requiring expressive speech output.Last updated52Why this server?
Supports quantized ONNX models for faster inference, providing INT8 quantized judges with smaller memory footprint and improved performance for local semantic validation.
Alicense-qualityBmaintenanceSemantix-Verify is an MCP server for semantic validation of AI/LLM outputs. It exposes a single tool, verify_text_intent(text, intent_description, threshold), which uses a local quantized NLI cross-encoder (INT8 ONNX) to return a 0.0–1.0 probability that the text satisfies the given intent — and, when it doesn't, a structured correction prompt for agent retry loops. Useful for building comLast updated2Why this server?
Uses ONNXRuntime for efficient machine learning model execution to power OCR capabilities in the MCP server.
Alicense-qualityCmaintenanceMCP server that provides computer control capabilities including mouse movements, keyboard actions, screenshot capture with OCR, and window management through a unified API.Last updated140MITWhy this server?
Supports ONNX model integration for machine learning capabilities in Android applications
Alicense-qualityFmaintenanceKotlin MCP Server for Android app development using OpenAI, Gemini, or OpenRouter. Enables AI-assisted coding via Aider, Gradle build/test integration, Kotlin LSP, and Docker-based portability.Last updated30AGPL 3.0Why this server?
Enables exporting trained neural network models to ONNX format for deployment through the Neural MCP server.
Alicense-qualityBmaintenanceProvides GPU-accelerated scientific computing capabilities including symbolic mathematics, quantum wave mechanics simulations, molecular dynamics, and neural network training through four specialized MCP servers.Last updated2MIT