260,964 tools. Last updated 2026-07-05 10:01

"Understanding Inference Models" matching MCP tools:

rlm_ollama_statusA
Massive Context MCP
Check Ollama server status and available models to see if free local inference is available for processing large contexts.
MIT
list_inference_modelsA
Tuning Engines
List models available for inference via Tuning Engines API. Includes platform models and your deployed trained models.
MIT
spraay_compute_text_inferenceA
Spraay x402 MCP Server
Send chat messages to run text inference using multiple LLMs. Choose from 11 models priced between $0.003-$0.10 USDC per request.
MIT
groq_chat_completionB
UnClick MCP Server
Perform high-speed chat completions using Groq's inference with open models such as Llama 3, Mixtral, and Gemma.
AGPL 3.0
boj_ml_huggingfaceC
BoJ-server
Search and use Hugging Face models, datasets, and spaces through inference, information retrieval, and authentication operations.
Mozilla Public 2.0
exploreA
Wiro MCP Server
Browse curated AI models organized by category. View featured and popular models across sections like 'Recently Added', 'Image Generation', and 'Video'.
MIT

Matching MCP Servers

MiMo Multimodal Understanding
Multimedia Processing AI & Machine Learning
ChanthMiao
F
license
A
quality
A
maintenance
Integrates Xiaomi MiMo's multimodal API to enable understanding of images, audio, and video through natural language prompts.
Last updated 2026-07-04
3
Doubao Vision MCP Server
Image & Video Processing AI & Machine Learning
kira4094
F
license
B
quality
C
maintenance
Enables image understanding using Doubao vision models via MCP, supporting local file paths and URLs with customizable prompts.
Last updated 2026-06-18
1

Matching MCP Connectors

foundrynet-inference
x402 LLM proxy + data-enriched analysis (17 sources) + TimesFM predictive IoT intelligence.
Local Model Suitability MCP
Check if a task runs locally vs cloud. Save money on calls that don't need cloud inference.

get_agent_usage_detailA
vibops-mcp
Retrieve daily LLM usage, cost trend, and model breakdown for a specific agent to optimize inference spending.
MIT
get_agent_usageA
vibops-mcp
Retrieve LLM inference usage per agent with token consumption, GPU cost, and request counts. Pinpoint which agents and teams consume the most resources and cost.
MIT
aichat_list_modelsA
AiChat MCP Server
Retrieve a comprehensive list of supported AI models from the AiChat API, organized by provider including GPT-4/5, DeepSeek, Grok, and GLM models.
MIT
search_modelsA
Wiro MCP Server
Find AI models for generation, editing, and analysis. Search by keyword, category, or owner, and filter by sort options to discover models before running them.
MIT
list_inference_schedulesA
Latent Defense MCP Server
Retrieve all available JEPA inference schedules for analyzing attack paths in your infrastructure graph.
Apache 2.0
ddg_list_modelsA
io.github.daedalusdevelopmentgroup/ddg-agent-services-mcp
List local Ollama models and retrieve queryable paid route labels for accessing AI models with payment options.
MIT
download_inference_dataA
CrowdCent MCP Server
Download prediction challenge inference data for specified periods to a local .parquet file, with optional polling for availability.
MIT
blockrun_modelsA
BlockRun MCP
List available AI models and pricing. Filter by category or provider to compare costs and discover models.
MIT
get_inference_runB
Latent Defense MCP Server
Retrieve the status and results of a specific inference run using its run ID.
Apache 2.0
list_registered_modelsB
SAS MCP Server
Retrieve a list of models from the SAS Model Repository. Specify a maximum number of models to return.
Apache 2.0
testB
dbt-mcp
Run data tests on models, sources, snapshots, and seeds, and execute unit tests on SQL models to ensure data quality and accuracy in your dbt projects.
Apache 2.0
get_inference_data_infoB
CrowdCent MCP Server
Retrieve detailed information about specific inference data periods for CrowdCent prediction challenges, enabling analysis of datasets and submission requirements.
MIT
network_statusB
parallelix-mcp
Check which open-source models are currently served by the ParalleliX network to select the appropriate model for parallel inference tasks.
MIT
get_agent_budgetB
vibops-mcp
Retrieve an agent's inference budget, showing monthly limit, current spend, and whether overages are rejected or warned.
MIT

"Understanding Inference Models" matching MCP tools:

rlm_ollama_statusA

list_inference_modelsA

spraay_compute_text_inferenceA

groq_chat_completionB

boj_ml_huggingfaceC

exploreA

Matching MCP Servers

MiMo Multimodal Understanding

Doubao Vision MCP Server

Matching MCP Connectors

get_agent_usage_detailA

get_agent_usageA

aichat_list_modelsA

search_modelsA

list_inference_schedulesA

ddg_list_modelsA

download_inference_dataA

blockrun_modelsA

get_inference_runB

list_registered_modelsB

testB

get_inference_data_infoB

network_statusB

get_agent_budgetB