Which integrations are available for this server?

Integrates DuckDuckGo as a search engine source to retrieve clean, spam-free search results filtered by domain blocklists and content quality scoring.

How do I use clean-search-mcp?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@clean-search-mcp search for best practices in React hooks" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

clean-search-mcp

by lzmd66

Overview Schema Related Servers Score Discussions

Python

Local

clean-search-mcp 🧹

A lightweight MCP (Model Context Protocol) service that provides clean, spam-free search results for AI agents. Filters out content farms, SEO garbage, and low-quality sites before they reach your LLM.

Features

Three search engines — Yandex + Bing + DuckDuckGo auto-fallback
176K+ domain blocklist — auto-updated from 25+ community sources, covers malware/scam/ads/content farms
Three-layer filtering — domain blacklist → content rules → quality scoring
Content extraction — full page text via trafilatura + selectolax
Result scoring — 0-1 quality score (official docs 0.8 > tutorials 0.6 > garbage 0)
LRU cache — search cache 6h, fetch cache 24h, auto-cleanup
User blacklist — add domains on the fly, report bad results
Deep mode — optional Playwright fallback for JS-heavy pages
No heavy dependencies — pure HTTP, no browser required

Related MCP server: internet-context-mcp

Quick Start

pip install -r requirements.txt
python main.py

MCP Client Config

{
  "mcpServers": {
    "clean-search": {
      "command": "python",
      "args": ["/path/to/clean_search_mcp/main.py"]
    }
  }
}

Test Locally

python test_local.py "your search query" -n 5
python test_local.py "your query" -n 5 --no-content   # skip page content
python test_local.py "your query" --deep               # use Playwright fallback

API

`clean_search(query, max_results=5, with_content=True, deep_mode=False)`

Param	Default	Description
`query`	required	Search query
`max_results`	5	Results to return (max 10)
`with_content`	True	Include extracted page text
`deep_mode`	False	Use Playwright fallback for JS pages

Returns [{title, url, snippet, content, score}] sorted by quality.

`add_user_blacklist(domain)`

Add a domain to personal blocklist.

`report_bad_result(url)`

Report a low-quality URL (domain auto-blocked).

Configuration

Edit config.py to tune:

Search providers: enable/disable Yandex, Bing, DuckDuckGo
Blacklist sources: add or remove community blocklist URLs
Scoring weights: adjust domain authority, content quality bonuses
Caching: TTL, max files, cleanup interval
Proxy: set PROXY for HTTP/Playwright

Dependencies

mcp, httpx, selectolax, trafilatura, duckduckgo-search

All lightweight pip packages. Playwright is optional (deep mode only).

License

MIT

This server cannot be installed

license - not found

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
open source
OpenAI
Tool Definition Quality Score (TDQS)
By punkpeye on April 3, 2026.
mcp
The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/lzmd66/clean-search-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server