Skip to main content
Glama

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
AI_MCP_DB_PATHNoSQLite database path. Default: $AI_MCP_CONFIG_DIR/db.sqlite3$AI_MCP_CONFIG_DIR/db.sqlite3
AI_MCP_UI_TOKENNoAccess token for Web UI when exposed (--expose). Default: none
AI_MCP_CONFIG_DIRNoOverride data/config directory. Default: ~/.ai-mcp-server~/.ai-mcp-server
AI_MCP_MASTER_KEYNoFernet master key for api_key encryption. Auto-generated to system keyring if not set.

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": false
}
prompts
{
  "listChanged": false
}
resources
{
  "subscribe": false,
  "listChanged": false
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
usage_guideA

Return current capability inventory and usage instructions.

Call this first whenever you connect.

list_modelsA

List models matching the filters.

Args: capability: capability tags the model must support (e.g. ["vision"]). min_context_length: minimum context window in tokens. endpoint: limit to a single endpoint name. include_unprobed: include models whose capabilities have not been probed yet (default True).

invoke_modelA

Forward a request to the selected (endpoint, model).

Args: endpoint: endpoint name registered via the CLI. model: model_id as returned by list_models. operation: one of chat / embedding / image_gen / tts / stt / rerank. payload: upstream-compatible body (OpenAI shape for openai-compat endpoints). The model field is set automatically. The response is passed through verbatim; errors are returned inside error.

model_performanceB

Return近3天 aggregated call metrics per model (background-updated).

Args: endpoint: limit to a single endpoint name. sort_by: one of call_count / success_count / avg_first_byte_ms / avg_prompt_tokens / avg_output_tokens. limit: max rows to return.

Each row includes call_count, success_count, success_rate, avg_first_byte_ms, avg_prompt_tokens, avg_output_tokens, window_days.

refresh_endpointA

Enqueue probe jobs. Server-internal worker will drain them.

Args: endpoint: endpoint name; if None, refresh every endpoint. capabilities: list of capability tags; if None, choose probes per model using known metadata. refresh_model_list: re-fetch /v1/models first (default True).

add_modelsA

Manually register models or user-confirmed model features.

Args: endpoint: endpoint name. model_ids: one or more model_id to register. context_length: optional context window in tokens. capabilities: optional capability tags to mark as supported (override source). Aliases tts/stt/asr are accepted. feature_overrides: optional key/value overrides. Keys may be capability tags or context_length; capability values must be booleans. Example: {"audio_tts": true, "context_length": 32000}.

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/brianMacao/ai-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server