Skip to main content
Glama

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
LLAMA_SWAP_CONFIGNoPath to configuration JSON file. Overrides default config path.

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": false
}
prompts
{
  "listChanged": false
}
resources
{
  "subscribe": false,
  "listChanged": false
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
list_modelsA

List available llama.cpp model configurations and their load status.

get_current_modelA

Get the currently loaded llama.cpp model.

swap_modelA

Swap to a different llama.cpp model.

Unloads any currently loaded model, loads the requested one, and waits for the health endpoint to confirm readiness.

Args: model: Alias of the model to load

create_model_configA

Generate a new service config for a llama-server model.

Creates a launchd plist (macOS) or systemd unit (Linux) that can be used with swap_model.

Args: name: Short alias for the model (e.g., "coder", "planner") model_path: Absolute path to the GGUF model file context_size: Context window size (default: 4096) gpu_layers: Number of GPU layers, -1 for all (default: -1) port: Port for llama-server (default: 8000)

Prompts

Interactive templates invoked by user choice

NameDescription
swap-workflowGuided workflow for planning with a reasoning model, then implementing with a coding model.

Resources

Contextual data attached and managed by the client

NameDescription
config_resourceCurrent llama-swap configuration.
status_resourceCurrent model status and health.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/oussama-kh/mcp-llama-swap'

If you have feedback or need assistance with the MCP directory API, please join our Discord server