Skip to main content
Glama

Qwen3-Coder MCP Server

by keithah

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
OLLAMA_KEEP_ALIVENoKeep models loaded for 24 hours24h
OLLAMA_NUM_PARALLELNoHandle parallel requests8
OLLAMA_KV_CACHE_TYPENoHigh-quality 8-bit cacheq8_0
OLLAMA_FLASH_ATTENTIONNoEnable efficient attention mechanism1
OLLAMA_MAX_LOADED_MODELSNoKeep models in memory simultaneously4

Schema

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Tools

Functions exposed to the LLM to take actions

NameDescription
qwen3_code_review

Review code using Qwen3-Coder

qwen3_code_explain

Explain code using Qwen3-Coder

qwen3_code_generate

Generate code using Qwen3-Coder

qwen3_code_fix

Fix bugs in code using Qwen3-Coder

qwen3_code_optimize

Optimize code using Qwen3-Coder

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/keithah/qwen3-coder-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server