Skip to main content
Glama

VOZ Knowledge MCP

Local Python MCP server for crawling public or logged-in VOZ threads, archiving posts to SQLite, and exposing search/summarization tools to an AI agent.

MCP SDK needs Python 3.10+. In this Codex workspace, use the bundled Python runtime if the system python3 is 3.9.

Install

python3 -m venv .venv
. .venv/bin/activate
python -m pip install -r requirements.txt

Use any Python 3.10+ runtime. Verify with:

python --version

Related MCP server: Crawl4AI+SearXNG MCP Server

Run As MCP

python -m voz_knowledge_mcp.server

Companion skill for agents: skills/voz-knowledge/SKILL.md.

Tools exposed:

  • read_thread(url, mode="auto", max_pages=None)

  • summarize_thread(url, mode="auto")

  • search_archive(query, limit=20)

  • extract_links(url, mode="auto")

  • crawl_threads(urls, mode="auto", max_pages=None)

  • setup_browser_cdp()

CLI

python -m voz_knowledge_mcp.cli read-thread "https://voz.vn/t/example.123/" --mode public --max-pages 2
python -m voz_knowledge_mcp.cli summarize-thread "https://voz.vn/t/example.123/" --mode public
python -m voz_knowledge_mcp.cli search-archive "youtube reup"

Read Order

mode="auto" reads public first as a baseline, then still tries browser mode. If a browser returns readable posts, MCP uses the browser result and stops. If every browser endpoint fails, MCP keeps the public result when public worked.

mode="public" only reads content that anonymous visitors can see.

Browser Mode and CDP

Browser mode uses CDP, short for Chrome DevTools Protocol. CDP is a local control port exposed by Chromium-family browsers such as Brave, Chrome, Edge, Chromium, Arc, Vivaldi, Opera, and Coc Coc.

Normal browser windows do not expose CDP. Browser mode first uses configured CDP endpoints, then automatically tries to launch installed Chromium-family browsers with local CDP ports. You can also run setup_browser_cdp() explicitly to prepare browser fallback before crawling.

Use 127.0.0.1 endpoints only. Do not expose the CDP port to a public network, because anything that can reach that port can control that browser session.

Automatic launch uses dedicated local browser profiles under archive/browser-profiles/. Log into VOZ once in the launched profile if browser fallback needs authenticated content.

Manual override example with Brave:

/Applications/Brave\ Browser.app/Contents/MacOS/Brave\ Browser --remote-debugging-port=9222
export VOZ_BROWSER_CDP_URL=http://127.0.0.1:9222
python -m voz_knowledge_mcp.cli read-thread "https://voz.vn/t/example.123/" --mode browser

Example with multiple logged-in browsers, tried in order:

/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9223
/Applications/Brave\ Browser.app/Contents/MacOS/Brave\ Browser --remote-debugging-port=9222
export VOZ_BROWSER_CDP_URLS=http://127.0.0.1:9223,http://127.0.0.1:9222
python -m voz_knowledge_mcp.cli read-thread "https://voz.vn/t/example.123/" --mode browser

VOZ_BROWSER_CDP_URLS wins first. After that, the finite fallback order is:

VOZ_CHROME_CDP_URL
VOZ_BRAVE_CDP_URL
VOZ_EDGE_CDP_URL
VOZ_CHROMIUM_CDP_URL
VOZ_ARC_CDP_URL
VOZ_VIVALDI_CDP_URL
VOZ_OPERA_CDP_URL
VOZ_COCCOC_CDP_URL
VOZ_BROWSER_CDP_URL

MCP stops at the first browser endpoint that returns readable posts. If none work, it returns an error instead of retrying forever, or keeps the public baseline when running in auto.

Set VOZ_AUTO_LAUNCH_BROWSERS=0 to disable automatic browser launch and use only configured CDP endpoints.

Do not put passwords in git. archive/ is ignored.

F
license - not found
-
quality - not tested
B
maintenance

Maintenance

Maintainers
Response time
Release cycle
1Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/hbui290/voz-knowledge-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server