Best NVIDIA MCP Servers
NVIDIA is a technology company that specializes in designing graphics processing units (GPUs) for gaming, professional visualization, data centers, and artificial intelligence applications. They are known for their cutting-edge hardware and software solutions that power everything from gaming PCs to supercomputers and AI systems.
Why this server?
Used in examples for analyzing earnings call transcripts, particularly regarding AI chip demand statements from their CEO.
AlicenseBqualityCmaintenanceDeliver real-time investment research with extensive private and public market data.Last updated3493125MITWhy this server?
Integrates with NVIDIA's cloud API platform to access specialized AI models like Qwen for coding tasks and DeepSeek for analysis through intelligent backend routing
AlicenseBqualityBmaintenanceIntelligent AI routing and integration platform for seamless provider switchingLast updated2182Apache 2.0Why this server?
Provides tools to trace CUDA Runtime and Driver API calls, monitor kernel launches from libraries like cuBLAS and cuDNN, and diagnose GPU stalls and synchronization issues.
AlicenseAqualityBmaintenanceeBPF-based GPU causal observability agent with MCP server. Traces CUDA Runtime and Driver APIs via kernel uprobes and host events via tracepoints to build causal chains explaining GPU latency. 7 tools: get_check, get_trace_stats, get_causal_chains, get_stacks, run_demo, get_test_report, run_sql. Telegraphic compression reduces token usage ~60%. Supports stdio and HTTPS (TLS 1.3) transport.Last updated1182Why this server?
Supports control of simulated robots in the NVIDIA Isaac Sim environment, demonstrated with the MOCA mobile manipulator
AlicenseBqualityCmaintenanceFacilitates robotic movement control by providing functions that enable precise manipulation of linear and angular velocities through natural language commands, compatible with both ROS and ROS2.Last updated171,227Why this server?
Supports NVIDIA CUDA hardware acceleration through the cuda-kernels flag, enabling BVH O(log N) indexing and NVMe parallel kernel computation for geometric memory operations on NVIDIA GPUs.
AlicenseAqualityBmaintenanceHeadless geometric memory engine for AI agents — no Vector DB, no cloud, no API key. Store and retrieve by meaning using native Vector Symbolic Architecture (NVSA) math over O_DIRECT NVMe mapping. Runs entirely on your machine via MCP.Last updated316AGPL 3.0Why this server?
Supports management of RunPod resources with NVIDIA GPUs, including creating and configuring pods with specific NVIDIA GPU types and counts.

RunPod MCP Serverofficial
AlicenseCqualityBmaintenanceThis Model Context Protocol server enables interaction with RunPod's REST API through Claude or other MCP-compatible clients, providing tools for managing pods, endpoints, templates, network volumes, and container registry authentications.Last updated3629155Apache 2.0Why this server?
Enables management of NVIDIA GPU-powered computing resources through RunPod's platform

RunPod MCP Serverofficial
AlicenseCqualityBmaintenanceEnables interaction with the RunPod REST API through Claude or other MCP-compatible clients, providing tools for managing pods, endpoints, templates, network volumes, and container registry authentications.Last updated2629155Apache 2.0Why this server?
Allows text generation using NVIDIA's AI models through their API
AlicenseAqualityCmaintenanceAn MCP server that enables AI applications to access 20+ model providers (including OpenAI, Anthropic, Google) through a unified interface for text and image generation.Last updated228MITWhy this server?
Provides real-time pricing data, cost estimation, and model comparisons for NVIDIA AI models.
AlicenseAqualityCmaintenanceAn MCP (Model Context Protocol) server that provides real-time LLM token pricing data for 60+ AI models across 15 providers.Last updated6102MIT