MCP Firecrawl サーバー

これは、Firecrawl の API を使用して Web サイトをスクレイピングし、構造化データを抽出するためのツールを提供するシンプルな MCP サーバーです。

設定

依存関係をインストールします:

npm install

次の変数を含む.envファイルをルートディレクトリに作成します。

FIRECRAWL_API_TOKEN=your_token_here
SENTRY_DSN=your_sentry_dsn_here

FIRECRAWL_API_TOKEN (必須): Firecrawl APIトークン
SENTRY_DSN (オプション): エラー追跡とパフォーマンス監視用の Sentry DSN

サーバーを起動します。

npm start

あるいは、サーバーの実行時に環境変数を直接設定することもできます。

FIRECRAWL_API_TOKEN=your_token_here npm start

特徴

ウェブサイトスクレイピング：さまざまな形式でウェブサイトからコンテンツを抽出します
構造化データ抽出:カスタムスキーマに基づいて特定のデータポイントを抽出します
エラー追跡: エラー追跡とパフォーマンス監視のために Sentry と統合されています

使用法

サーバーは 2 つのツールを公開します。

scrape-website : 複数のフォーマットオプションを備えた基本的なウェブサイトスクレイピング
extract-data : プロンプトとスキーマに基づく構造化データ抽出

ツール: scrape-website

このツールは、Web サイトをスクレイピングし、要求された形式でそのコンテンツを返します。

パラメータ:

url (文字列、必須): スクレイピングするウェブサイトのURL
formats (文字列の配列、オプション): 希望する出力形式の配列。サポートされている形式は以下のとおりです。
- "markdown" （デフォルト）
- "html"
- "text"

MCP Inspector の使用例:

# Basic usage (defaults to markdown)
mcp-inspector --tool scrape-website --args '{
  "url": "https://example.com"
}'

# Multiple formats
mcp-inspector --tool scrape-website --args '{
  "url": "https://example.com",
  "formats": ["markdown", "html", "text"]
}'

ツール: データ抽出

このツールは、提供されたプロンプトとスキーマに基づいて、Web サイトから構造化データを抽出します。

パラメータ:

urls （文字列の配列、必須）: データを抽出するURLの配列
prompt （文字列、必須）：抽出するデータを説明するプロンプト
schema (オブジェクト、必須): 抽出するデータのスキーマ定義

スキーマ定義は、キーがフィールド名、値が型であるオブジェクトである必要があります。サポートされる型は次のとおりです。

"string" : テキストフィールドの場合
"boolean" : true/falseフィールド用
"number" : 数値フィールドの場合
配列: ["type"]として指定されます。type は上記のいずれかです。
オブジェクト: 独自の型定義を持つネストされたオブジェクト

MCP Inspector の使用例:

# Basic example extracting company information
mcp-inspector --tool extract-data --args '{
  "urls": ["https://example.com"],
  "prompt": "Extract the company mission, whether it supports SSO, and whether it is open source.",
  "schema": {
    "company_mission": "string",
    "supports_sso": "boolean",
    "is_open_source": "boolean"
  }
}'

# Complex example with nested data
mcp-inspector --tool extract-data --args '{
  "urls": ["https://example.com/products", "https://example.com/pricing"],
  "prompt": "Extract product information including name, price, and features.",
  "schema": {
    "products": [{
      "name": "string",
      "price": "number",
      "features": ["string"]
    }]
  }
}'

どちらのツールも、スクレイピングまたは抽出が失敗した場合に適切なエラーメッセージを返し、設定されている場合は自動的に Sentry にエラーを記録します。

トラブルシューティング

問題が発生した場合:

Firecrawl APIトークンが有効であることを確認してください
スクレイピングしようとしているURLにアクセスできることを確認してください
複雑なスキーマの場合は、サポートされている形式に従っていることを確認してください。
Sentry ログで詳細なエラー情報を確認する (構成されている場合)

Install Server

HTTP connection URL

security – no known vulnerabilities

license - not found

quality - confirmed to work

How are these scores calculated?

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Tools

Firecrawl の API を使用して Web サイトをスクレイピングし、そこから構造化データを抽出するためのツールを提供するサーバー。複数の形式での基本的な Web サイトスクレイピングと、カスタムスキーマベースのデータ抽出の両方をサポートします。

Related Resources

Reddit Discussion about this server

Related MCP Servers

mcp-server-firecrawl
mendableai
A
security
A
license
A
quality
A Model Context Protocol (MCP) server implementation that integrates with FireCrawl for advanced web scraping capabilities.
Last updated -
9
21,717
3,668
JavaScript
MIT License
MCP Server Firecrawl
Msparihar
A
security
A
license
A
quality
A server that provides web scraping and intelligent content searching capabilities using the Firecrawl API, enabling AI agents to extract structured data from websites and perform content searches.
Last updated -
5
2
TypeScript
MIT License
Firecrawl MCP Server
mcma123
-
security
A
license
-
quality
A Model Context Protocol server that enables AI assistants to perform advanced web scraping, crawling, searching, and data extraction through the Firecrawl API.
Last updated -
21,717
MIT License
Oxylabs MCP Serverofficial
oxylabs
A
security
A
license
A
quality
A scraper tool that leverages the Oxylabs Web Scraper API to fetch and process web content with flexible options for parsing and rendering pages, enabling efficient content extraction from complex websites.
Last updated -
2
33
Python
MIT License

View all related MCP servers

MCP Firecrawl Server