Which integrations are available for this server?

Provides web search capabilities via SearXNG, a privacy-friendly metasearch engine, allowing users to search the web and retrieve results in citation-ready Markdown format.

How do I use claw-site?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@claw-site Extract URLs from python.org" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

claw-site

by mymacmeet-crypto

Overview Schema Related Servers Score Discussions

Python

Local

local-mcp

local-mcp is a small Python MCP server with tools for SearXNG web search, fetching/browsing/scraping web pages, summarizing multiple web pages, extracting site URLs, extracting text from images, parsing PDFs/documents, and generating local Markdown or PDF files.

The tool follows this flow:

robots.txt
  -> find sitemap URLs
  -> parse sitemap XML
  -> collect URLs
  -> fetch page
  -> httpx
  -> content found?
     -> yes: extract URLs
     -> no: crawl4ai fallback

Setup

Requires Python 3.10+.

For an interactive control panel, run:

python setup_and_run.py

The menu is shown before any dependency installation. Use option 3 for core dependencies, option 9 for the recommended bundle, option 10 to see which optional tools and parser backends are installed, or option 12 to restart the local SearXNG Docker container on http://127.0.0.1:8888.

The extract_image_text tool also requires the native Tesseract OCR executable:

Windows: install Tesseract OCR. The tool auto-detects the standard C:\Program Files\Tesseract-OCR\tesseract.exe install path; set TESSERACT_CMD if it is installed elsewhere.
macOS/Linux: install tesseract with your system package manager.

cd local-mcp
python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt
crawl4ai-setup

Optional document parser engines can be installed as needed:

pip install ".[document-fast]"        # PyMuPDF4LLM + pdfplumber
pip install ".[document-structured]"  # Docling
pip install ".[document-deep-marker]" # Marker
pip install ".[document-deep-mineru]" # MinerU

document-deep-marker and document-deep-mineru cannot be installed together in the same environment: marker-pdf requires Pillow<11 while mineru requires Pillow>=11. Pick whichever backend you need.

Related MCP server: Scrapi MCP Server

Run

python -m local_mcp
python -m local_mcp --http

HTTP mode listens on 127.0.0.1:3002 by default.

For the full package layout and request flow, see docs/ARCHITECTURE.md.

SearXNG search

Run SearXNG locally and enable JSON output in its settings.yml:

search:
  formats:
    - html
    - json

Then point this MCP server at it:

export SEARXNG_BASE_URL=http://127.0.0.1:8888

For failover, set a comma-separated list:

export SEARXNG_URLS=http://127.0.0.1:8888,https://your-backup-searxng.example

LOCAL_MCP_SEARXNG_URLS is also supported as an alias. Individual web_search calls can override the base URL with the searxng_url parameter.

The setup menu can run the included Docker config for you. Choose option 12, or run the same commands manually:

docker rm -f local-searxng
docker run -d `
  --name local-searxng `
  -p 8888:8080 `
  -v "${PWD}\searxng-settings.yml:/etc/searxng/settings.yml:ro" `
  searxng/searxng:latest

Claude Desktop config

{
  "mcpServers": {
    "local-mcp": {
      "command": "D:\\MCP\\local-mcp\\.venv\\Scripts\\python.exe",
      "args": ["-m", "local_mcp"]
    }
  }
}

Smaller model compatibility

Smaller local models often fail MCP calls because they see too many tools, too many optional arguments, or free-form string options where they should choose from a small set. Use the simple profile for models such as Qwen-class local models:

For the full explanation of these compatibility changes, see docs/low_model_compatibility.md.

LOCAL_MCP_TOOL_PROFILE=simple

The simple profile registers simpler wrapper tools only:

fetch_web_page
list_page_urls
read_document
read_image_text
write_markdown_file
write_report_file
search_web_to_file

For PDF or Markdown reports, prefer write_report_file. It rejects content below its min_words threshold, so smaller models are forced to expand the report before a half-page PDF is written. The default is min_words=900, which is usually closer to a 2-3 page PDF than a short answer.

The default profile is full, which keeps the original tool surface. Use both only for clients and models that handle larger tool lists well.

You can also set the profile directly in a desktop MCP config:

{
  "mcpServers": {
    "local-mcp": {
      "command": "D:\\MCP\\local-mcp\\.venv\\Scripts\\python.exe",
      "args": ["-m", "local_mcp"],
      "env": {
        "LOCAL_MCP_TOOL_PROFILE": "simple"
      }
    }
  }
}

Tools

`web_search`

Parameters:

query: search query to send to SearXNG.
limit: maximum number of URLs to return. Default: 8. Allowed range: 1 to 20.

web_search is a discovery tool, not an answer tool. It returns a minimal JSON envelope: stage, query, requires_fetch, workflow, agent_guidance, next_action, and a urls list of candidate source URLs (in SearXNG order). The URLs alone are not evidence: the intended next step is to call web_fetch on one or more of the urls, then synthesize an answer from the fetched content.

`smart_search`

Parameters:

query: the question or topic to research and answer.
max_sources: maximum number of pages to crawl and summarize. Default: 3. Allowed range: 1 to 10.
time_range: optional SearXNG time range: day, month, or year. Empty means any time.
model: optional model override for the configured LLM provider. Empty uses OLLAMA_MODEL or GEMINI_MODEL, whichever LLM_PROVIDER selects.

smart_search is a one-shot answer tool that runs the whole research pipeline internally: it searches SearXNG for candidate sources, asks an LLM to rank them by relevance, crawls the ranked pages (falling through to lower-ranked sources when a page times out or blocks the request) until max_sources load successfully, then asks the LLM to write a synthesized, inline-cited summary. It returns plain text — the summary followed by a numbered Sources: list of the URLs actually used — unlike web_search, which returns intermediate JSON for the model to process itself.

Uses a local Ollama model by default (LLM_PROVIDER=ollama, OLLAMA_MODEL=qwen2.5:7b) — no API key needed, just a running ollama serve with the model pulled. Set LLM_PROVIDER=gemini and GEMINI_API_KEY in .env (see .env.example) to use Google Gemini instead. No extra Python dependency is needed for either backend — the client calls the REST API directly over httpx.

`deep_research`

Parameters:

query: the research question or topic to investigate in depth.
breadth: new sources to crawl per research round. Default: 4. Allowed range: 1 to 10.
max_iterations: how many reflect → re-search rounds to run (research depth). Default: 2. Allowed range: 1 to 4.
max_sources: hard cap on total pages crawled across all rounds. Default: 12. Allowed range: 1 to 30.
time_range: optional SearXNG time range: day, month, or year. Empty means any time.
verify: run a fact-checking pass that flags report claims the sources do not support. Default: true.
output_file: optional relative Markdown/PDF filename. When set, the report is also written to a file and its path is returned.
model: optional model override for the configured LLM provider.

deep_research is an iterative, deeper version of smart_search. It plans sub-questions and an outline, runs several rounds of search + crawl, takes compact per-source notes (an evidence ledger, rather than concatenating whole pages), reflects on what is still missing to open follow-up searches, then synthesizes a long-form, sectioned Markdown report with inline [n] citations and a claim-verification pass. It returns the report followed by a numbered Sources list, and can write it to a file. Prefer smart_search for a quick one-shot answer; prefer deep_research for broad or high-stakes questions worth reading many sources and cross-checking. It uses the same pluggable LLM backend as smart_search. See docs/deep_research.md.

`web_search_to_file`

Parameters:

query: search query to send to SearXNG.
filename: output Markdown or PDF filename or relative path. The matching extension is appended when omitted.
limit: maximum number of search results to write. Default: 8.
categories: SearXNG categories, for example general, news, images, or general,news. Default: general.
language: SearXNG language code. Default: auto.
pageno: SearXNG result page number. Default: 1.
safesearch: safe-search level, where 0 is off, 1 is moderate, and 2 is strict. Default: 0.
time_range: optional SearXNG time range: day, month, or year.
engines: optional comma-separated SearXNG engines override.
searxng_url: optional SearXNG base URL for this request.
write_mode: append adds a search section to the target file, write creates/replaces content. Default: append.
overwrite: replace an existing file when write_mode is write. Default: false.
ensure_trailing_newline: append a trailing newline to the generated Markdown section. Ignored for PDF output. Default: true.
file_type: output type. Supports md/markdown and pdf. A .pdf filename also selects PDF output. Default: md.

This runs the search server-side and writes the formatted results directly into the generated Markdown or PDF file, so smaller models do not need to pass large search output through a content argument. PDF output requires write_mode="write" because append/chunk mode is Markdown-only.

`web_fetch`

Parameters:

url: page URL to fetch. Scheme-less input like example.com is allowed.
max_chars: maximum content characters before truncation. Use 0 for no truncation. Default: 120000.

web_fetch is an evidence tool. It fetches pages with httpx (with automatic Crawl4AI browser rendering for JavaScript-heavy pages) and returns a minimal JSON envelope: stage, url, requires_analysis, workflow, agent_guidance, next_action, and the Markdown content. The content field is raw source material and carries an agent_guidance string instructing the model to analyze it and write its own cited answer rather than pasting the content to the user.

`extract_urls`

Parameters:

url: page or site URL.
same_domain: only return URLs on the input hostname. Default: true.
same_path: only return URLs under the input URL path prefix, for example https://example.com/blogs returns /blogs and /blogs/... URLs. Default: true.
limit: maximum unique URLs to return. Default: 500.

The response includes URL stats by source, then a Markdown bullet list of absolute URLs with the source that found each URL, such as robots.txt, sitemap.xml, httpx, or Crawl4AI. If no URLs are found, the tool returns the stats and a short message.

`extract_image_text`

Parameters:

image: image file path, image URL, data URL, or base64-encoded image content.
lang: Tesseract language code. Default: eng.

The response is only the text recognized from the image.

`parse_document`

Parameters:

document: document file path, file:// URI, HTTP(S) URL, data URL, or base64 document content.
parser: backend to use: auto, pypdf, pymupdf4llm, pdfplumber, docling, marker, mineru, or text. Default: auto.
output_format: markdown, text, or json. Default: markdown.
pages: optional 1-based page range such as 1-3,5. Empty parses all pages.
include_metadata: include parser/source metadata before Markdown or text output. Default: true.
max_chars: maximum content characters returned before truncation. Default: 120000.

The response is parsed document content. auto prefers PyMuPDF4LLM when installed for fast digital PDFs, falls back to lightweight pypdf, and can use optional engines for structured OCR, deep-learning parsing, CJK/scientific documents, or table coordinates.

`generate_file`

Parameters:

filename: output Markdown or PDF filename or relative path. The matching extension is appended when omitted.
content: Markdown-like content to write.
file_type: output type. Supports md/markdown and pdf. A .pdf filename also selects PDF output. Default: md.
overwrite: replace an existing file at the target path. Default: false.
write_mode: write creates/replaces content, append adds the content as a chunk. Default: write.
ensure_trailing_newline: append a trailing newline to non-empty Markdown content. Ignored for PDF output. Default: true.
min_words: minimum word count required before writing. Use 700-1200 for 2-3 page reports, or 0 for short notes. Default: 0.

The response reports the generated file path, write mode, byte count, character count, and whether an existing file was overwritten. PDF output is generated from Markdown-like text. append/chunk mode is supported for Markdown only; generate PDFs with write_mode="write" and the complete content.

For large files, call generate_file once with write_mode="write" for the first chunk, then call it again with write_mode="append" for later chunks.

You must define a download location in .env; otherwise file-writing tools return Download path not defined.

LOCAL_MCP_FILE_OUTPUT_DIR=D:\Downloads\local-mcp

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/mymacmeet-crypto/local-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server