180,147 tools. Last updated 2026-06-06 01:32

"llama.cpp" matching MCP tools:

list_modelsA
mcp-llama-swap
View available llama.cpp model configurations and check their current load status.
Apache 2.0
list_llamacpp_modelsB
Msty Admin MCP
Retrieve a list of available LLaMA.cpp models configured for Msty Studio.
MIT
chat_with_llamacpp_modelD
Msty Admin MCP
Chat with LLaMA.cpp models by providing a model name and a list of messages. Ideal for interactive AI conversations.
MIT
quantizeA
mcp-turboquant
Quantize a HuggingFace model to GGUF, GPTQ, or AWQ format with bit width selection (2-8). Reduces model size for deployment on Ollama, vLLM, LM Studio, or llama.cpp.
MIT
swap_modelA
mcp-llama-swap
Swap to a different llama.cpp model in a running session while preserving conversation context. Unloads current model, loads the requested one, and waits for readiness.
Apache 2.0
octave_compile_grammarA
octave-mcp
Compile OCTAVE schema or contract into GBNF or JSON Schema constraint grammar to regulate AI model outputs.
Apache 2.0

Matching MCP Servers

ask_repoA
instagit
Analyze Git repository codebases by asking questions to understand architecture, debug issues, review security, or evaluate code quality with AI-powered insights.
MIT
get_current_modelA
mcp-llama-swap
Identify the currently loaded llama.cpp model in an active Claude Code session to verify which model is active for inference.
Apache 2.0
check_llm_statusA
Excalidraw MCP Server
Check if your local llama.cpp server is running and reachable to ensure offline diagram generation from natural language.
switch_backendB
Delia
Change the active LLM backend for AI task routing. Specify a backend ID to switch between different local models like Ollama, llama.cpp, or Gemini for processing tasks.