スクラップリングフェッチMCP

AI アシスタントがボット検出を実装した Web サイトのテキストコンテンツにアクセスできるようにし、ブラウザーで表示できるものと AI がアクセスできるもののギャップを埋める MCP サーバーです。

使用目的

このツールは、ボット検出機能を実装したウェブサイトから少量のドキュメントや参考資料（テキスト/HTMLのみ）を取得するために最適化されています。汎用的なサイトスクレイピングやデータ収集を目的として設計・テストされていません。

注: このプロジェクトは、 LLM Contextを使用して、Claude Sonnet 3.7 と共同で開発されました。

インストール

要件：
- Python 3.10以上
- UVパッケージマネージャー
依存関係とツールをインストールします。

uv tool install scrapling
scrapling install
uv tool install scrapling-fetch-mcp

クロードとのセットアップ

この構成を Claude クライアントの MCP サーバー構成に追加します。

{
  "mcpServers": {
    "Cyber-Chitta": {
      "command": "uvx",
      "args": ["scrapling-fetch-mcp"]
    }
  }
}

利用可能なツール

このパッケージは、2 つの異なるツールを提供します。

s-fetch-page : ページネーションサポート付きの完全なWebページを取得します
s-fetch-pattern : 正規表現パターンと周囲のコンテキストに一致するコンテンツを抽出します。

使用例

完全なページを取得する

Human: Please fetch and summarize the documentation at https://example.com/docs

Claude: I'll help you with that. Let me fetch the documentation.

<mcp:function_calls>
<mcp:invoke name="s-fetch-page">
<mcp:parameter name="url">https://example.com/docs</mcp:parameter>
<mcp:parameter name="mode">basic</mcp:parameter>
</mcp:invoke>
</mcp:function_calls>

Based on the documentation I retrieved, here's a summary...

パターンマッチングによる特定のコンテンツの抽出

Human: Please find all mentions of "API keys" on the documentation page.

Claude: I'll search for that specific information.

<mcp:function_calls>
<mcp:invoke name="s-fetch-pattern">
<mcp:parameter name="url">https://example.com/docs</mcp:parameter>
<mcp:parameter name="mode">basic</mcp:parameter>
<mcp:parameter name="search_pattern">API\s+keys?</mcp:parameter>
<mcp:parameter name="context_chars">150</mcp:parameter>
</mcp:invoke>
</mcp:function_calls>

I found several mentions of API keys in the documentation:
...

機能オプション

保護レベル:
- basic : 高速な取得（1～2秒）ですが、厳重に保護されたサイトでは成功率が低くなります。
- stealth ：ほとんどのサイトで機能するバランスの取れた保護（3〜8秒）
- max-stealth : 厳重に保護されたサイトに対する最大限の保護（10秒以上）
コンテンツターゲティングオプション:
- s-fetch-page : ページ区切りのサポートを使用してページ全体を取得します（ start_indexとmax_lengthを使用）
- s-fetch-pattern : 正規表現を使用して特定のコンテンツを抽出します（ search_patternとcontext_charsを使用）
  - 結果にはs-fetch-pageを使用したフォローアップクエリの位置情報が含まれます。

最良の結果を得るためのヒント

basicモードから開始し、必要に応じてより高い保護レベルにエスカレートします。
大きな文書の場合は、 s-fetch-pageでページ区切りパラメータを使用します。
大きなページで特定の情報を探すときはs-fetch-patternを使用します。
AIはサイトの保護レベルに応じて自動的にアプローチを調整します。

制限事項

テキストコンテンツ専用に設計：特にドキュメント、記事、参考資料向け
大量のスクレイピングやデータ収集には適していません
認証が必要なサイトでは動作しない場合があります
パフォーマンスはサイトの複雑さによって異なります

ライセンス

アパッチ2

Install Server

HTTP connection URL

security – no known vulnerabilities

license - permissive license

quality - confirmed to work

How are these scores calculated?

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

Tools

Related MCP Servers

SEO AI Assistant
ayushsinghvi92
-
security
F
license
-
quality
MCP server that enables AI assistants to perform SEO automation tasks including keyword research, SERP analysis, and competitor analysis through Google Ads API integration.
Last updated -
browser-use MCP Server
co-browser
-
security
A
license
-
quality
An MCP server that enables AI assistants to control a web browser through natural language commands, allowing them to navigate websites and extract information via SSE transport.
Last updated -
663
Python
MIT License
browser-mcp
djyde
-
security
F
license
-
quality
A MCP server that allows AI assistants to interact with the browser, including getting page content as markdown, modifying page styles, and searching browser history.
Last updated -
79
TypeScript
YouTube Toolbox
jikime
A
security
F
license
A
quality
An MCP server that provides AI assistants with powerful tools to interact with YouTube, including video searching, transcript extraction, comment retrieval, and more.
Last updated -
8
15
Python

View all related MCP servers

Scrapling Fetch MCP