Skip to main content
Glama

MCP Data Server

by hkonda015

MCP Data Server — Local file search you can call from Claude (or CLI)

MCP Data Server indexes files on your machine (PDF, TXT, CSV, Markdown, etc.) and lets you search them with embeddings. You can use it from:

  • a friendly CLI (ls, index, search)
  • an MCP server over stdio (so Claude Desktop/Cursor can call your tools)

Works great on Windows 11. Also tested on macOS/Linux (see notes).


Table of contents


Features

  • 🔎 Local search with SentenceTransformers embeddings (cosine similarity)
  • ⚡ Optional FAISS index for fast Top-K search
  • 🧰 Simple CLI: ls, index, search
  • 🔌 MCP server so Claude Desktop can call tools: list_docs_tool, index_docs_tool, search_chunks_tool, read_doc_tool
  • 🧩 Extensible loaders/chunkers; add new formats easily
  • ✅ Batteries-included dev setup: Ruff, Black, MyPy, PyTest, pre-commit

Prerequisites

  • Python 3.11+ (3.11 recommended)
  • Windows 11 (PowerShell) macOS/Linux are fine too (bash)
  • ~3 GB free disk space on first run (model cache)
  • (Optional) FAISS CPU wheels installed automatically via faiss-cpu

Quick start (Windows)

Folder in this repo where you put files to index: ./data/

# 1) Clone and enter project git clone https://github.com/hkonda015/McpServer.git Set-Location .\McpServer\McpServer # 2) Create & activate venv (PowerShell) python -m venv .venv .\.venv\Scripts\Activate.ps1 # 3) Install runtime (or dev) dependencies pip install -r requirements.txt # or for contributors: pip install -r requirements-dev.txt # 4) (Optional) pre-commit hooks pre-commit install # 5) Put a few files in .\data\ (txt/pdf/csv/md), then: python -m mcp_data_server ls python -m mcp_data_server index python -m mcp_data_server search "your query" --k 5 ## Usage (CLI) The CLI lets you **list files**, **build/rebuild the index**, and **search** your local documents. > **Prereq:** open a terminal at your repo root and activate the venv > Windows (PowerShell): > ```powershell > Set-Location .\McpServer > .\.venv\Scripts\Activate.ps1 > ``` > macOS/Linux (bash): > ```bash > cd McpServer > source .venv/bin/activate > ``` --- ### 1) List files (`ls`) Lists all **supported documents** under `DATA_DIR` (defaults to `./data`). ```powershell python -m mcp_data_server ls # Contributing to MCP Data Server Thanks for your interest in contributing! This document explains how to set up your dev environment, the coding standards we use, how to run tests, and how to submit a good pull request. --- ## Ways to contribute - **Bug reports**: include steps to reproduce, expected vs actual behavior, OS, Python version, and logs. - **Feature requests**: explain the use case, not just the solution. Sketch CLI and/or MCP tool UX if relevant. - **Documentation**: improve READMEs, examples, and comments. - **Code**: bug fixes, new loaders, chunking strategies, performance improvements, tests. Good first issues will be labeled **good first issue** and **help wanted**. --- ## Development setup ### Prerequisites - Python **3.11+** (we recommend 3.11) - Git - ~3 GB free disk space for model cache on first run ### Clone and create a virtual environment #### Windows (PowerShell) ```powershell git clone https://github.com/hkonda015/McpServer.git Set-Location .\McpServer\McpServer python -m venv .venv .\.venv\Scripts\Activate.ps1 pip install -r requirements-dev.txt pre-commit install
-
security - not tested
F
license - not found
-
quality - not tested

local-only server

The server can only run on the client's local machine because it depends on local resources.

Indexes local files (PDF, TXT, CSV, Markdown) with embeddings for semantic search. Provides both CLI and MCP server interfaces so Claude Desktop can search and read your local documents.

  1. Table of contents
    1. Features
      1. Prerequisites
        1. Quick start (Windows)

          Related MCP Servers

          • A
            security
            A
            license
            A
            quality
            The Search MCP Server enables seamless integration of network and local search capabilities in tools like Claude Desktop and Cursor, utilizing the Brave Search API for high-concurrency and asynchronous requests.
            Last updated -
            1
            72
            MIT License
            • Linux
          • -
            security
            F
            license
            -
            quality
            An MCP server that integrates with Claude to provide smart documentation search capabilities across multiple AI/ML libraries, allowing users to retrieve and process technical information through natural language queries.
            Last updated -
          • -
            security
            A
            license
            -
            quality
            An MCP server that enables AI assistants like Claude Desktop to search and retrieve information from custom search indexes created with Searchcraft.
            Last updated -
            5
            Apache 2.0
            • Apple
          • A
            security
            A
            license
            A
            quality
            An MCP server that allows users to efficiently search and reference user-configured documents through document listing, grep searching, semantic searching with OpenAI Embeddings, and full document retrieval.
            Last updated -
            4
            3
            MIT License
            • Apple
            • Linux

          View all related MCP servers

          MCP directory API

          We provide all the information about MCP servers via our MCP API.

          curl -X GET 'https://glama.ai/api/mcp/v1/servers/hkonda015/McpServer'

          If you have feedback or need assistance with the MCP directory API, please join our Discord server