mcp-web-calc
This server provides web search, URL content extraction, and Wikipedia lookup capabilities without requiring API keys.
Web Search (
search_web): Search using multiple providers (DuckDuckGo, Bing, SearXNG) with automatic fallback. Supports configurable result limits (1–50), language selection, and fast/deep/auto modes.Fetch URL Content (
fetch_url): Retrieve readable content from URLs (HTML or PDF) with configurable truncation modes (compact ~3000 chars, standard ~8000 chars, full unlimited) and output formats (markdown, text, html).Summarize a URL (
summarize_url): Fetch content from a URL and generate a concise summary.Wikipedia Summary (
wiki_get): Retrieve a Wikipedia article summary by title, with multi-language support.Multi-language Wikipedia (
wiki_multi): Retrieve Wikipedia summaries for a term across multiple languages simultaneously using inter-language links for accurate title mapping.
Additional features: Rotating user agents for anti-bot protection, SSRF protection blocking localhost/private network access, and customizable configuration via environment variables (default providers, timeouts, custom SearXNG instances).
Enables web searching through DuckDuckGo's HTML interface to retrieve search results, titles, and snippets.
Provides tools to retrieve page summaries and cross-reference information across multiple languages from Wikipedia.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@mcp-web-calcSearch for 'OpenAI Sora' and summarize the most recent news article"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
MCP Web Search
MCP server for web search and URL/resource loading. It works without API keys by default and stays local-first: search uses free providers, fetch_url extracts useful content from URLs, and binary/media downloads only happen when explicitly requested.
Features
search_web- multi-provider web search with automatic fallback across DuckDuckGo, Bing, and SearXNG.fetch_url- universal URL/resource loader for HTML, PDF, text, Markdown, JSON, XML, CSV, media metadata, and supported site-specific URLs.Clean normalized output with one
contentfield plus metadata, pagination, links, media, attachments, and warnings.Reddit thread extraction through Reddit JSON endpoints instead of brittle Reddit HTML scraping.
Long-resource pagination with
max_length,start_index, andnext_start_index.Optional HTML link/media summaries.
Optional local download artifacts with
download: true.SSRF protection for localhost, private IPs, link-local ranges, IPv6 private ranges, and unsafe redirects.
No paid API required.
Requirements
Node.js 18+
Chrome/Chromium only if you use the Bing provider
MCP Configuration
Claude Code
{
"mcpServers": {
"web-search": {
"command": "npx",
"args": ["-y", "@zhafron/mcp-web-search"]
}
}
}OpenCode
{
"mcp": {
"web-search": {
"type": "local",
"command": ["npx", "@zhafron/mcp-web-search"]
}
}
}Custom Configuration
{
"mcpServers": {
"web-search": {
"command": "npx",
"args": ["-y", "@zhafron/mcp-web-search"],
"env": {
"DEFAULT_SEARCH_PROVIDER": "duckduckgo",
"SEARXNG_URL": "http://localhost:8099"
}
}
}
}Tools
search_web
Search the web through one provider or through the fallback chain.
Input:
{
"q": "openai codex reddit review",
"limit": 10,
"lang": "en",
"provider": "duckduckgo"
}Options:
Option | Description |
| Search query |
| Number of results, 1-50 |
| Search language, default |
| Optional provider: |
Output:
{
"items": [
{
"title": "Example Result",
"url": "https://example.com",
"snippet": "Result summary...",
"source": "duckduckgo"
}
],
"providerUsed": "duckduckgo",
"fallbackUsed": false,
"triedProviders": ["duckduckgo"]
}Fallback order:
DuckDuckGo → SearXNG → Bing
SearXNG → DuckDuckGo → Bing
Bing → DuckDuckGo → SearXNG
fetch_url
Fetch a URL and return extracted content plus metadata in a normalized envelope.
Input:
{
"url": "https://example.com/article",
"format": "markdown",
"max_length": 8000,
"start_index": 0,
"include_links": true,
"include_media": true
}Options:
Option | Description |
| URL to fetch |
|
|
| Maximum returned content characters, default 25000 |
| Start content from this character index |
|
|
| Include extracted links for HTML pages |
| Include extracted image/video/audio references for HTML pages |
| Include comments for site adapters that support comments, default true for Reddit |
| Maximum comments for comment-capable adapters, max 100 |
|
|
| Maximum comment nesting depth |
| Request timeout override |
| Bypass in-memory cache |
| Save original fetched bytes to a managed local file and return it in |
| Optional output directory for downloads; defaults to the system temp directory |
| Cleanup TTL for managed downloads, default 86400 seconds |
| Response/download byte cap override, additionally capped by |
Output:
{
"url": "https://example.com/article",
"final_url": "https://example.com/article",
"title": "Example Article",
"content_type": "text/html",
"resource_type": "html",
"format": "markdown",
"content": "# Example Article\n\n...",
"metadata": {
"status": 200,
"content_type": "text/html",
"byte_length": 12345,
"extractor": "html",
"fetched_at": "2026-05-03T00:00:00.000Z"
},
"links": [],
"media": {
"images": [],
"videos": [],
"audio": []
},
"truncated": false,
"original_length": 1200,
"start_index": 0,
"next_start_index": null,
"warnings": []
}Supported Resources
Resource | Behavior |
HTML pages | Extracts readable article content, title, metadata, optional links, and optional media references |
Text and Markdown | Returns text directly with pagination support |
JSON | Pretty-prints JSON when |
XML and CSV-like text | Returns as text/data content |
Extracts text and PDF metadata | |
Images | Returns metadata by default; saves the file only with |
Audio and video | Returns metadata by default; saves the file only with |
Archives and binary files | Returns metadata by default; downloads only when explicitly requested; archives are not auto-extracted |
Reddit threads | Uses Reddit JSON endpoints and can include comments with limits |
Local Downloads
fetch_url does not download binary/media files to disk by default. This avoids surprise disk usage and persistent local copies of arbitrary web content.
Use download: true when you need the original file available to another tool:
{
"url": "https://httpbin.org/image/png",
"format": "metadata",
"download": true,
"download_ttl_seconds": 86400
}Download attachments look like this:
{
"kind": "download",
"path": "/tmp/mcp-web-search/downloads/mcp-fetch-id-image.png",
"filename": "mcp-fetch-id-image.png",
"original_filename": "image.png",
"content_type": "image/png",
"resource_type": "image",
"byte_length": 8090,
"sha256": "...",
"expires_at": "2026-05-04T00:00:00.000Z"
}Download safety behavior:
Downloads are opt-in only.
Files are written with
0600permissions.Filenames are sanitized and prefixed with a managed artifact ID.
SHA-256 is returned for verification.
Expired managed artifacts are cleaned up through sidecar metadata.
Cleanup only touches managed artifacts inside the configured download directory.
Archives are never auto-extracted.
Reddit Thread Extraction
Reddit thread URLs are handled by a site adapter and fetched through Reddit JSON endpoints.
Input example:
{
"url": "https://www.reddit.com/r/codex/comments/abc123/gpt55_is_so_good/",
"include_comments": true,
"comment_limit": 30,
"comment_sort": "top",
"max_depth": 2
}The output uses resource_type: "site" and metadata.extractor: "reddit-thread".
Reddit public JSON can still rate-limit or return 403/429 depending on Reddit, subreddit rules, and request frequency. When that happens, retry later or reduce request frequency.
Providers
Provider | API Key Required | Notes |
DuckDuckGo | No | Default, simple, no browser required |
Bing | No | Uses Chrome/Chromium through Puppeteer |
SearXNG | No | Best option for self-hosted high-volume usage |
Environment Variables
Variable | Default | Description |
|
| Default search provider: |
|
| SearXNG instance URL |
|
| Request timeout in milliseconds |
|
| Maximum fetched response/download size |
| unset | Set to |
SearXNG Setup
SearXNG is a free self-hosted meta-search engine.
Quick setup with Docker:
mkdir -p ~/docker/searxngCreate ~/docker/searxng/settings.yml with JSON enabled, then run the SearXNG container. The important setting is search.formats containing both html and json.
Example relevant setting:
search:
formats:
- html
- jsonThen set:
export SEARXNG_URL="http://localhost:8099"Chrome Setup for Bing Provider
OS | Command |
Ubuntu/Debian |
|
Fedora |
|
Arch |
|
macOS |
|
Custom path:
export CHROME_PATH="/path/to/chrome"MCP Discovery Compatibility
Some MCP clients have weak schema parsers and fail during discovery on array-valued JSON Schema nodes such as enum or required.
If discovery fails, set:
export MCP_COMPAT_MODE="legacy"This only simplifies advertised tools/list schemas. Tool execution behavior stays the same.
URL Safety
fetch_url blocks unsafe targets before fetching and before following redirects.
Blocked targets include:
localhost hostnames
.localhostand.localhostnamesprivate IPv4 ranges
IPv4 loopback, link-local, carrier-grade NAT, benchmark, multicast, and selected special-use ranges
IPv4-mapped IPv6 addresses that resolve to blocked IPv4 ranges
IPv6 loopback, unspecified, unique-local, multicast, and link-local ranges
redirects that resolve to blocked addresses
The HTTP transport resolves and validates addresses before connecting, then connects to the vetted address while preserving the original host/SNI for normal HTTPS behavior.
Repository Structure
src/server.ts- MCP server and tool schemassrc/providers/- search providerssrc/fetch/- URL/resource loading pipelinesrc/fetch/content/- shared content helpers such as Markdown conversion and readability fallbacksrc/fetch/extractors/- resource extractors for HTML, text/data, PDF, and media metadatasrc/fetch/site-adapters/- domain-specific extractors such as Reddit threadssrc/utils/- shared utilitiestest/- Node test runner tests
Troubleshooting
Issue | Solution |
Chrome not found | Install Chrome/Chromium or set |
SearXNG 403 | Enable JSON API in |
Timeout | Increase |
MCP discovery error: | Set |
Reddit 429 or 403 | Reddit rate limited or blocked the JSON endpoint; retry later or reduce request frequency |
Download missing from output | Set |
Download rejected as too large | Increase |
License
MIT
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/tickernelz/mcp-web-search'
If you have feedback or need assistance with the MCP directory API, please join our Discord server