MolmoWeb MCP Server
Supports using Ollama as an OpenAI-compatible endpoint for the orchestrator LLM, enabling local LLM usage for autonomous task decomposition and planning in web automation workflows.
Supports using OpenAI models (GPT-4o, GPT-4-turbo, GPT-4) as the orchestrator LLM for the run_web_task tool, enabling autonomous decomposition of complex web tasks into step-by-step browser actions.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@MolmoWeb MCP Serversearch google for recent ai breakthroughs and open the first result"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
molmoweb-mcp
MCP server that exposes MolmoWeb web automation as tools for Claude (or any MCP client). Uses Playwright for browser control.
Architecture
Claude / MCP Client
↓ stdio (MCP protocol)
molmoweb-mcp (this server)
↓ ↓
Playwright browser MolmoWeb API (localhost:8001)Tools
Tool | Description |
| Health check for MolmoWeb backend |
| Open URL in Playwright browser |
| Capture JPEG screenshot (returns base64 image) |
| Get current URL and title |
| Execute click/type/scroll/press_key/hover/navigate/wait |
| Ask MolmoWeb vision model what action to perform |
| Full autonomous agent loop (orchestrator + MolmoWeb + execution) |
Setup
npm install
npx playwright install chromiumStart the MolmoWeb backend
The MolmoWeb vision model must be running at http://127.0.0.1:8001. On Windows with WSL:
# Using the provided script:
run_molmoweb.batConfigure in Claude Code
Add to your ~/.mcp.json (global) or project .mcp.json:
{
"mcpServers": {
"molmoweb": {
"command": "node",
"args": ["/path/to/molmoweb-mcp/server.js"]
}
}
}Run standalone
npm startOrchestrator LLM Support
The run_web_task tool uses an LLM orchestrator to decompose tasks into step-by-step browser actions. Supported providers:
OpenAI: gpt-4o, gpt-4o-mini, gpt-4-turbo, gpt-4
Anthropic: claude-opus-4-6, claude-sonnet-4-6, claude-haiku-4-5-20251001
Custom: Any OpenAI-compatible endpoint (e.g., Ollama)
How It Works
User provides a high-level task (e.g., "Search Google for AI news")
The orchestrator LLM decomposes it into atomic browser instructions
MolmoWeb vision model translates each instruction into pixel-level actions
Playwright executes the actions in a visible Chromium browser
Loop repeats until the task is complete or max steps reached
License
MIT
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/ArashNasr/molmoweb-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server