system-media-control-mcp
Supports Cloudflare AI API for using AI models to query and control Windows PC resources and media.
Supports GitHub Models API for using AI models to control Windows PC via natural language commands.
Supports Google Gemini API to enable natural language control of Windows PC, including system monitoring, media control, and clipboard operations.
Supports NVIDIA NIM API for AI model inference to control Windows PC through natural language.
Supports local AI models via Ollama for controlling Windows PC with natural language, including system and media tools.
Supports OpenAI API for using GPT models to control Windows PC via natural language, offering system monitoring, media control, and more.
Supports Perplexity AI API for using their models to control Windows PC through natural language commands.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@system-media-control-mcpcheck system status"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
System & Media Control MCP Client/Server (Windows)
A Node.js-based Model Context Protocol (MCP) automation system that enables Artificial Intelligence (Google Gemini, OpenAI, Anthropic Claude, or Local AI models like Ollama/LM Studio) to query and control your Windows PC using natural language.
📖 Table of Contents
Related MCP server: Windows-MCP
🌟 Features
💻 System & Resource Monitoring
get_system_status: CPU load, RAM usage, primary disk (C:) free capacity, and system uptime.
get_system_info: Retrieve detailed static system hardware specifications (CPU, motherboard, RAM slots, OS, BIOS).
get_disk_space: Returns storage space metrics (total, used, free) for all active system drives.
get_top_processes: Top 5 CPU-intensive running processes with PIDs and CPU percentages.
get_battery_status: Battery health percentage, charging state, and estimated remaining minutes (laptops).
get_network_info: Retrieves local IPv4 addresses, active network adapters, and external IP.
get_dns_servers: Retrieves configured DNS server IP addresses for the active network interfaces.
get_network_latency: Test latency (ping response time) to major targets (e.g. google.com).
get_wifi_networks: Scans and lists nearby Wi-Fi network SSIDs and signal strengths.
get_wifi_status: Retrieves status details of the currently active Wi-Fi connection (SSID, Signal quality, Transmission rate).
get_active_window: Retrieves the title, process name, and PID of the currently active focused foreground window.
get_gpu_info: GPU graphics card details, driver versions, and VRAM memory.
get_audio_devices: Lists available system output and input audio hardware controllers.
🎛 PC & Media Control
media_control: Simulates keyboard media keys (Play/Pause, Next Track, Previous Track, Stop).
get_volume / set_volume / set_mute: Checks system volume level, sets level (0-100), and mutes/unmutes audio.
get_brightness / set_brightness: Reads or changes monitor brightness (0-100%).
system_power_control: Locks the screen, puts the PC to sleep, schedules a shutdown/restart (with a 60s warning), or aborts active power schedules.
close_process: Force terminates a running background/foreground process by name or process ID.
📋 Automation & Clipboard Utilities
get_clipboard: Reads text contents currently on the Windows clipboard.
set_clipboard: Copies a text string to the system clipboard.
clear_clipboard: Clears all text contents currently on the Windows clipboard.
open_url: Launches a website URL in the default web browser.
launch_app: Spawns a desktop application by command name (e.g.
notepad,calc,explorer).take_screenshot: Captures a PNG screenshot of the primary screen and saves it locally.
empty_recycle_bin: Empties the Windows Recycle Bin.
show_desktop: Minimizes all active GUI windows instantly.
⚙️ How It Works
graph TD
User([User Prompt]) --> Client[MCP Client - client.js]
Client -->|1. Prompt + Tools| LLM[AI Provider: Gemini/OpenAI/Claude/Ollama]
LLM -->|2. Tool Call Request| Client
Client -->|3. JSON-RPC over stdin| Server[MCP Server - server.js]
Server -->|4. Runs Native Powershell/C#| WinAPI[Windows Core Audio / CIM / Clipboard]
WinAPI -->|5. Output| Server
Server -->|6. JSON-RPC over stdout| Client
Client -->|7. Tool Response| LLM
LLM -->|8. Natural Language Answer| Client
Client --> Display([Console Display])Background Spawn: The client runs as a Node.js process and spawns the MCP server as a background process (
child_process.spawn).JSON-RPC handshake: During startup, they perform a line-by-line handshake over
stdinandstdoutusing JSON-RPC 2.0.AI routing: The client prompts your chosen AI provider with the user prompt and the server's tools list.
Tool Call Interception: When the AI responds with a Tool Call request, the client intercepts it, translates it to a
tools/callJSON-RPC message, and forwards it to the server'sstdin.Windows Integration: The server runs native PowerShell command scripts, compiling on-the-fly C# interfaces to bypass the COM binder inside PowerShell, interacting directly with core APIs and user32 keyboard emulation.
Result Loop: The tool result is returned over
stdoutto the client, which feeds it back to the AI. The AI then formulates a final user response.Interactive loop: When executed without argument, the client enters a continuous loop where you can chat with the PC Agent.
💻 Prerequisites
Operating System: Windows (required for CIM, user32, and COM audio objects).
Runtime: Node.js v18 or higher.
📦 Installation & Setup
Clone this repository:
git clone https://github.com/noackjona-hash/system-media-control-mcp.git cd system-media-control-mcpInstall npm packages:
npm install
⚙️ Configuration
Create a file named .env in the root folder (automatically ignored by git) and set your preferred AI provider:
# Choose AI Provider: gemini, openai, anthropic, groq, github, deepseek, mistral, together, openrouter, watsonx, perplexity, nvidia, local, mock, etc.
AI_PROVIDER=gemini
# Google Gemini API
GEMINI_API_KEY=AIzaSy...
GEMINI_MODEL=gemini-2.5-flash
# OpenAI API
OPENAI_API_KEY=sk-...
OPENAI_MODEL=gpt-4o-mini
# Anthropic Claude API
ANTHROPIC_API_KEY=sk-ant-...
ANTHROPIC_MODEL=claude-3-5-sonnet-20241022
# Groq AI API
GROQ_API_KEY=gsk_...
GROQ_MODEL=llama-3.3-70b-versatile
# GitHub Models API
GITHUB_TOKEN=ghp_...
GITHUB_MODEL=gpt-4o
# Other supported providers (DeepSeek, Mistral, Together, OpenRouter, Watsonx, Perplexity, Cloudflare, Nebius, NVIDIA NIM, Upstage, Moonshot, etc.)
DEEPSEEK_API_KEY=sk-...
MISTRAL_API_KEY=sk-...
PERPLEXITY_API_KEY=pplx-...
# Local AI / Ollama / LM Studio (OpenAI-compatible)
LOCAL_API_BASE=http://localhost:11434/v1
LOCAL_MODEL=llama3.2:1b(If no API keys are found or configured, it defaults to Mock Mode, running keyword analysis locally to simulate tool execution. Perfect for offline testing!)
🚀 Usage
Continuous Interactive Chat Loop (Recommended)
Run the client with no arguments to start a persistent shell session:
npm startYou can chat continuously:
Ask PC Agent > How busy is my PC?
Ask PC Agent > Open github.com
Ask PC Agent > Mute the volume
Ask PC Agent > exitCLI Command Mode (Single-Shot)
Pass your instruction directly as a command-line argument:
npm start "Show my desktop"
npm start "Copy 'Antigravity' to my clipboard"
npm start "Check the battery status"🛠 Available Tools
Tool Name | Parameters | Description |
| None | Returns CPU load, RAM usage (GB/%), C: disk space, and uptime. |
| None | Returns the top 5 CPU consuming active processes. |
| None | Returns charge level (%), charging status, and remaining time. |
| None | Gets current screen brightness percentage. |
|
| Sets screen brightness to the specified level. |
| None | Gets master volume (0-100) and mute status. |
|
| Sets master volume level. |
|
| Mutes or unmutes system audio. |
|
| Simulates keypress: |
|
| Performs power action: |
| None | Returns text content on the Windows clipboard. |
|
| Copies the text to the Windows clipboard. |
|
| Opens the URL in the default browser. |
|
| Launches the application (e.g. |
| None | Returns local IPs, network adapter name, external IP, and SSID. |
| None | Minimizes all active windows to show the desktop. |
| None | Returns GPU name, driver version, memory, and status. |
| None | Lists all active audio output and input devices. |
|
| Force closes process by name or PID. |
| None | Empties the Windows Recycle Bin. |
| None | Gets storage metrics (total, used, free) for all active drives. |
|
| Captures primary display screenshot and saves it locally. |
| None | Scans and lists nearby Wi-Fi network SSIDs and signal strengths. |
| None | Retrieves static system specs (Motherboard, CPU, RAM modules info). |
| None | Retrieves details of current Wi-Fi SSID, rate, and signal quality. |
|
| Tests latency (ping response time) to custom or default addresses. |
| None | Clears all text contents currently on the Windows clipboard. |
| None | Retrieves configured DNS server IP addresses for the network interface. |
| None | Retrieves the title, process name, and PID of the focused foreground window. |
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/noackjona-hash/system-media-control-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server