DuckDB-RAG-MCP-Sample

DuckDB RAG MCP Sample

markdown 문서를 포함 벡터화하여 MCP에서 RAG로 설명할 수 있는 샘플입니다.

벡터화에는 Plamo-Embedding-1B 를 사용합니다.

기능

markdown 파일에서 텍스트 추출 및 벡터화
DuckDB를 사용하여 벡터 검색
Parquet 파일로 벡터 데이터 지속성
MCP에서 벡터 검색

사용방법

벡터 데이터 생성

먼저 검색할 markdown 파일을 특정 디렉토리에 넣고 다음 명령을 사용하여 Parquet 파일로 변환하십시오.

uv run main.py --directory ~/path/to/markdown/files --parquet vectors.parquet

MCP 설정

빌드

다음 명령은 단일 바이너리를 dist/server 로 생성합니다.

uv run pyinstaller --clean --strip --noconfirm --onefile server.py

MCP 클라이언트 설정

사용하려는 클라이언트에 따라 설정하십시오.

Claude Desktop의 경우는 다음과 같은 느낌입니다.

VECTOR_PARQUET는 이전에 변환된 파일을 지정해야 합니다.

uv run mcp install server.py -v VECTOR_PARQUET=/path/to/vectors.parquet

다음과 같이 설정됩니다.

{
  "mcpServers": {
    "DuckDB-RAG-MCP-Sample": {
      "command": "/path/to/dist/server",
      "env": {
        "VECTOR_PARQUET": "/path/to/vectors.parquet"
      }
    }
  }
}

개발 서버 시작

uv run mcp dev server.py

라이센스

DuckDB RAG MCP Sample은 Apache License, Version 2.0에서 제공됩니다.

This server cannot be installed

security - not tested

license - permissive license

quality - not tested

How are these scores calculated?

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

An MCP server that enables RAG (Retrieval-Augmented Generation) on markdown documents by converting them to embedding vectors and performing vector search using DuckDB.

Related MCP Servers

Duck Duck MCP
qwang07
A
security
A
license
A
quality
This MCP server utilizes DuckDuckGo for web searches, providing structured search results with metadata and features like smart content classification and language detection, facilitating easy integration with AI clients supporting the MCP protocol.
Last updated -
1
663
2
JavaScript
MIT License
Vectorizeofficial
vectorize-io
A
security
A
license
A
quality
Vectorize MCP server for advanced retrieval, Private Deep Research, Anything-to-Markdown file extraction and text chunking.
Last updated -
3
56
83
JavaScript
MIT License
Library MCP
lethain
A
security
F
license
A
quality
An MCP server that enables interaction with Markdown knowledge bases, allowing users to search and retrieve content by tags, text, URL, or date range from their local markdown files.
Last updated -
7
71
Python
RAG Memory MCP
ttommyth
-
security
F
license
-
quality
An advanced MCP server providing RAG-enabled memory through a knowledge graph with vector search capabilities, enabling intelligent information storage, semantic retrieval, and document processing.
Last updated -
35
16
TypeScript

View all related MCP servers

DuckDB-RAG-MCP-Sample