Qwen3-Coder MCP Server

Describes the environment variables required to run the server.

Name	Required	Description	Default
`OLLAMA_KEEP_ALIVE`	No	Keep models loaded for 24 hours	24h
`OLLAMA_NUM_PARALLEL`	No	Handle parallel requests	8
`OLLAMA_KV_CACHE_TYPE`	No	High-quality 8-bit cache	q8_0
`OLLAMA_FLASH_ATTENTION`	No	Enable efficient attention mechanism	1
`OLLAMA_MAX_LOADED_MODELS`	No	Keep models in memory simultaneously	4

Functions exposed to the LLM to take actions

Name	Description
qwen3_code_review	Review code using Qwen3-Coder
qwen3_code_explain	Explain code using Qwen3-Coder
qwen3_code_generate	Generate code using Qwen3-Coder
qwen3_code_fix	Fix bugs in code using Qwen3-Coder
qwen3_code_optimize	Optimize code using Qwen3-Coder

Interactive templates invoked by user choice

Name	Description
No prompts

Contextual data attached and managed by the client

Name	Description
No resources

Latest Blog Posts

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/keithah/qwen3-coder-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server