MCP ドキュメントサーバー

Docling ライブラリを使用してドキュメント処理機能を提供する MCP サーバー。

インストール

pip を使用してパッケージをインストールできます。

pip install -e .

使用法

stdio (デフォルト) または SSE トランスポートのいずれかを使用してサーバーを起動します。

# Using stdio transport (default) mcp-server-lls # Using SSE transport on custom port mcp-server-lls --transport sse --port 8000

uv を使用している場合は、インストールせずにサーバーを直接実行できます。

# Using stdio transport (default) uv run mcp-server-lls # Using SSE transport on custom port uv run mcp-server-lls --transport sse --port 8000

利用可能なツール

サーバーは次のツールを公開します。

convert_document : URL またはローカルパスからマークダウン形式にドキュメントを変換します
- source : ドキュメントへの URL またはローカルファイルパス (必須)
- enable_ocr : スキャンしたドキュメントのOCRを有効にするかどうか（オプション、デフォルト: false）
- ocr_language : OCRの言語コードのリスト、例: ["en", "fr"] (オプション)
convert_document_with_images : ドキュメントを変換し、埋め込まれた画像を抽出する
- source : ドキュメントへの URL またはローカルファイルパス (必須)
- enable_ocr : スキャンしたドキュメントのOCRを有効にするかどうか（オプション、デフォルト: false）
- ocr_language : OCRの言語コードのリスト（オプション）
extract_tables : ドキュメントから表を構造化データとして抽出する
- source : ドキュメントへの URL またはローカルファイルパス (必須)
convert_batch : 複数のドキュメントをバッチモードで処理する
- sources : ドキュメントへのURLまたはファイルパスのリスト（必須）
- enable_ocr : スキャンしたドキュメントのOCRを有効にするかどうか（オプション、デフォルト: false）
- ocr_language : OCRの言語コードのリスト（オプション）
qna_from_document : URL またはローカルパスから YAML 形式で Q&A ドキュメントを作成します
- source : ドキュメントへの URL またはローカルファイルパス (必須)
- no_of_qnas : 予想されるQ&Aの数（オプション、デフォルト: 5）
- 注: このツールでは、IBM Watson X の資格情報を環境変数として設定する必要があります。
  - WATSONX_PROJECT_ID : Watson X プロジェクト ID
  - WATSONX_APIKEY : IBM Cloud APIキー
  - WATSONX_URL : Watson X API URL (デフォルト: https://us-south.ml.cloud.ibm.com )
get_system_info : システム構成と加速ステータスに関する情報を取得します

ラマスタックの例

https://github.com/user-attachments/assets/8ad34e50-cbf7-4ec8-aedd-71c42a5de0a1

このサーバーをLlama Stackと併用することで、LLMアプリケーションにドキュメント処理機能を提供できます。Llama Stackサーバーが稼働していることを確認し、 INFERENCE_MODELを設定してください。

from llama_stack_client.lib.agents.agent import Agent from llama_stack_client.lib.agents.event_logger import EventLogger from llama_stack_client.types.agent_create_params import AgentConfig from llama_stack_client.types.shared_params.url import URL from llama_stack_client import LlamaStackClient import os # Set your model ID model_id = os.environ["INFERENCE_MODEL"] client = LlamaStackClient( base_url=f"http://localhost:{os.environ.get('LLAMA_STACK_PORT', '8080')}" ) # Register MCP tools client.toolgroups.register( toolgroup_id="mcp::docling", provider_id="model-context-protocol", mcp_endpoint=URL(uri="http://0.0.0.0:8000/sse")) # Define an agent with MCP toolgroup agent_config = AgentConfig( model=model_id, instructions="""You are a helpful assistant with access to tools to manipulate documents. Always use the appropriate tool when asked to process documents.""", toolgroups=["mcp::docling"], tool_choice="auto", max_tool_calls=3, ) # Create the agent agent = Agent(client, agent_config) # Create a session session_id = agent.create_session("test-session") def _summary_and_qna(source: str): # Define the prompt run_turn(f"Please convert the document at {source} to markdown and summarize its content.") run_turn(f"Please generate a Q&A document with 3 items for source at {source} and display it in YAML format.") def _run_turn(prompt): # Create a turn response = agent.create_turn( messages=[ { "role": "user", "content": prompt, } ], session_id=session_id, ) # Log the response for log in EventLogger().log(response): log.print() _summary_and_qna('https://arxiv.org/pdf/2004.07606')

キャッシング

サーバーは、繰り返しのリクエストのパフォーマンスを向上させるために、処理済みのドキュメントを~/.cache/mcp-docling/にキャッシュします。

This server cannot be installed

security - not tested

license - permissive license

quality - not tested

How are these scores calculated?

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

モデルコンテキストプロトコルを使用してドキュメント処理機能を提供し、ドキュメントのマークダウンへの変換、表の抽出、ドキュメント画像の処理を可能にするサーバーです。

Related Resources

Reddit Discussion about this server

Related MCP Servers

MCP Server Fetch TypeScript
tatn
A
security
A
license
A
quality
A Model Context Protocol server that provides web content fetching and conversion capabilities.
Last updated -
4
66
3
MIT License
MarkItDown MCP Server
canlgz
A
security
A
license
A
quality
A Model Context Protocol server that converts various file formats (PDF, PowerPoint, Word, Excel, Images, etc.) to Markdown to make them accessible to LLMs.
Last updated -
1
MIT License
Better Fetch
flutterninja9
A
security
F
license
A
quality
A Model Context Protocol server that intelligently fetches and processes web content, transforming websites and documentation into clean, structured markdown with nested URL crawling capabilities.
Last updated -
2
25
5
DOCX MCP Server
zeph-gh
A
security
F
license
A
quality
A comprehensive Model Context Protocol server that processes Microsoft Word documents with full formatting support, enabling text extraction, HTML/Markdown conversion, structure analysis, and image extraction.
Last updated -
5
1

View all related MCP servers

MCP Docling Server

MCP ドキュメントサーバー

インストール

使用法

利用可能なツール

ラマスタックの例

キャッシング

Related Resources

Related MCP Servers

MCP Server Fetch TypeScript

MarkItDown MCP Server

Better Fetch

DOCX MCP Server

Appeared in Searches

New MCP Servers

MCP directory API