Skip to main content
Glama
MandarGavali

SecondBrain MCP Server

by MandarGavali

๐Ÿง  SecondBrain AI

Production-Ready AI Personal Knowledge Assistant

An intelligent knowledge management system powered by Agentic RAG, LangGraph, Long-Term Memory, and the Model Context Protocol (MCP).

Python FastAPI LangGraph Docker MongoDB Qdrant Redis


๐Ÿš€ Overview

SecondBrain AI is a production-ready AI knowledge assistant designed to transform personal documents into a searchable, conversational knowledge base.

Unlike a traditional chatbot, SecondBrain combines Retrieval-Augmented Generation (RAG), LangGraph-based agent orchestration, short-term and long-term memory, and the Model Context Protocol (MCP) to create an extensible AI assistant capable of understanding documents, remembering context across conversations, and exposing its capabilities to external AI clients.

The project follows a modular architecture with asynchronous document ingestion, background processing, vector search using Qdrant, persistent conversational memory through MongoDB, and containerized deployment using Docker Compose.


Related MCP server: idea-basin-mcp

โœจ Key Highlights

  • ๐Ÿง  Agentic RAG powered by LangGraph

  • ๐Ÿ“„ Upload and chat with PDF documents

  • ๐Ÿ” Semantic vector search using Qdrant

  • ๐Ÿง  Persistent long-term memory with Mem0

  • ๐Ÿ’ฌ Conversation state managed using MongoDB Checkpointer

  • โšก FastAPI REST API

  • ๐Ÿ”„ Redis + RQ background workers

  • ๐Ÿ”Œ Model Context Protocol (MCP) Server

  • ๐Ÿณ Fully Dockerized architecture

  • ๐Ÿ“œ Structured logging and centralized error handling

  • ๐Ÿงช Smoke-tested production workflow


๐Ÿ“– Project Overview

SecondBrain AI is a production-ready personal knowledge assistant that enables users to build an intelligent, searchable knowledge base from their own documents. Instead of relying solely on a large language model's built-in knowledge, SecondBrain retrieves relevant information from user-provided content and generates grounded, context-aware responses.

The system is built around an Agentic Retrieval-Augmented Generation (RAG) architecture powered by LangGraph. User requests are orchestrated through a multi-step workflow that performs intent routing, semantic retrieval, memory lookup, and response generation before returning a final answer.

To support real-world usage, SecondBrain combines multiple components into a modular backend:

  • FastAPI for serving REST APIs

  • Qdrant for semantic vector search

  • MongoDB for conversation checkpoints and persistence

  • Mem0 for long-term memory management

  • Redis + RQ for asynchronous background processing

  • Docker Compose for reproducible deployment

  • Model Context Protocol (MCP) for integration with external AI clients

The project follows production-oriented software engineering practices including modular architecture, centralized logging, environment-based configuration, containerization, background workers, and automated smoke testing.

Rather than being a simple chatbot, SecondBrain demonstrates how modern AI systems combine retrieval, reasoning, memory, orchestration, and external tools to build scalable intelligent applications.


โœจ Features

๐Ÿง  AI & Intelligence

  • Agentic Retrieval-Augmented Generation (RAG) powered by LangGraph

  • Semantic document retrieval using Qdrant Vector Database

  • Long-term memory using Mem0

  • Short-term conversational memory with MongoDB Checkpointer

  • Context-aware response generation using Google Gemini

  • Multi-step workflow orchestration for intelligent query handling


๐Ÿ“„ Document Processing

  • PDF document ingestion

  • Automatic text extraction and cleaning

  • Intelligent document chunking

  • Embedding generation

  • Vector indexing for semantic search


โš™๏ธ Backend & Infrastructure

  • FastAPI REST API

  • Asynchronous document processing using Redis + RQ

  • Modular service-oriented architecture

  • Centralized logging

  • Custom exception handling

  • Environment-based configuration

  • Production-ready Docker deployment


๐Ÿ”Œ Integrations

  • Model Context Protocol (MCP) Server

  • REST API endpoints

  • Command Line Interface (CLI)


๐Ÿงช Quality & Reliability

  • Smoke tests

  • Dockerized development environment

  • Persistent MongoDB storage

  • Persistent Qdrant storage

  • Modular project structure


๐Ÿ—๏ธ System Architecture

The following diagram illustrates the high-level architecture of SecondBrain AI.

SecondBrain follows a modular architecture where each component has a well-defined responsibility.

Component

Responsibility

FastAPI

Exposes REST APIs

LangGraph

Orchestrates the AI workflow

Gemini

Generates responses

Qdrant

Stores vector embeddings

MongoDB

Stores conversation state

Mem0

Manages long-term memory

Redis + RQ

Executes background jobs

MCP Server

Exposes tools for external AI clients


๐Ÿ”„ Detailed Workflow

The following diagram illustrates the complete execution flow inside SecondBrain AI.

Request Flow

User
   โ”‚
   โ–ผ
FastAPI
   โ”‚
   โ–ผ
LangGraph Workflow
   โ”‚
   โ”œโ”€โ”€ Route Request
   โ”œโ”€โ”€ Retrieve Documents
   โ”œโ”€โ”€ Retrieve Memory
   โ”œโ”€โ”€ Grade Documents
   โ”œโ”€โ”€ Rewrite Query (if needed)
   โ”œโ”€โ”€ Generate Response
   โ””โ”€โ”€ Store Conversation Memory
   โ”‚
   โ–ผ
Gemini LLM
   โ”‚
   โ–ผ
Response

Document Upload Flow

PDF Upload
     โ”‚
     โ–ผ
FastAPI
     โ”‚
     โ–ผ
Redis Queue
     โ”‚
     โ–ผ
RQ Worker
     โ”‚
     โ–ผ
PDF Loader
     โ”‚
     โ–ผ
Text Cleaning
     โ”‚
     โ–ผ
Chunking
     โ”‚
     โ–ผ
Gemini Embeddings
     โ”‚
     โ–ผ
Qdrant Vector Store

This architecture separates document ingestion from user interaction, enabling scalable background processing while keeping the API responsive.

โš™๏ธ Tech Stack

Category

Technologies

Programming Language

Python 3.13

Backend Framework

FastAPI

AI Framework

LangChain, LangGraph

LLM

Google Gemini

Embeddings

Gemini Embedding Model

Vector Database

Qdrant

Memory

Mem0, MongoDB Checkpointer

Database

MongoDB

Background Processing

Redis, RQ Worker

Document Processing

PyPDF

API Documentation

Swagger / OpenAPI

Protocol

Model Context Protocol (MCP)

Containerization

Docker, Docker Compose

Configuration

Python Dotenv

Testing

Smoke Tests

Version Control

Git, GitHub


๐Ÿ“‚ Project Structure

SecondBrain/
โ”‚
โ”œโ”€โ”€ secondbrain/
โ”‚   โ”œโ”€โ”€ agent/              # Agent orchestration
โ”‚   โ”œโ”€โ”€ agents/             # Specialized AI agents
โ”‚   โ”œโ”€โ”€ api/                # FastAPI endpoints
โ”‚   โ”œโ”€โ”€ cli/                # Command-line interface
โ”‚   โ”œโ”€โ”€ core/               # Logging & exceptions
โ”‚   โ”œโ”€โ”€ graph/              # LangGraph workflow
โ”‚   โ”œโ”€โ”€ mcp_server/         # MCP server implementation
โ”‚   โ”œโ”€โ”€ memory/             # Short & long-term memory
โ”‚   โ”œโ”€โ”€ models/             # Request & response models
โ”‚   โ”œโ”€โ”€ queues/             # Redis queue & worker
โ”‚   โ”œโ”€โ”€ rag/                # RAG pipeline
โ”‚   โ”œโ”€โ”€ tools/              # AI tools
โ”‚   โ”œโ”€โ”€ data/               # Runtime data
โ”‚   โ””โ”€โ”€ main.py             # FastAPI application
โ”‚
โ”œโ”€โ”€ tests/                  # Smoke tests
โ”œโ”€โ”€ assets/                 # README images
โ”œโ”€โ”€ logs/                   # Application logs
โ”‚
โ”œโ”€โ”€ Dockerfile
โ”œโ”€โ”€ docker-compose.yml
โ”œโ”€โ”€ requirements.txt
โ”œโ”€โ”€ pyproject.toml
โ”œโ”€โ”€ .env.example
โ””โ”€โ”€ README.md

๐Ÿ“ฆ Core Components

Module

Description

RAG Pipeline

Document ingestion, chunking, embeddings, retrieval

LangGraph

Agent workflow orchestration

Memory

Short-term & long-term conversational memory

FastAPI

REST API layer

Redis Worker

Background document processing

Qdrant

Vector similarity search

MongoDB

Conversation persistence

MCP Server

External AI tool integration


๐Ÿš€ Installation & Quick Start

Prerequisites

Before getting started, ensure you have the following installed:

  • Python 3.13+

  • Docker & Docker Compose

  • Git

  • Google Gemini API Key


Clone the Repository

git clone https://github.com/MandarGavali/SecondBrain.git
cd SecondBrain

Configure Environment Variables

Create a .env file in the project root.

GOOGLE_API_KEY=your_google_api_key

MONGODB_URI=mongodb://localhost:27017
QDRANT_URL=http://localhost:6333
REDIS_URL=redis://localhost:6379

Start the complete application stack:

docker compose up --build

This launches:

  • FastAPI Server

  • Redis

  • MongoDB

  • Qdrant

  • Background Worker

Swagger UI:

http://localhost:8000/docs

Run Locally (Without Docker)

Create a virtual environment:

python -m venv venv

Activate it.

Windows:

venv\Scripts\activate

Linux / macOS:

source venv/bin/activate

Install dependencies:

pip install -r requirements.txt

Start the FastAPI server:

uvicorn secondbrain.main:app --reload

Start the Redis worker in another terminal:

python -m secondbrain.queues.worker

The API will be available at:

http://localhost:8000

Swagger UI:

http://localhost:8000/docs

๐Ÿณ Docker Deployment

SecondBrain is fully containerized using Docker Compose, making the entire application stack reproducible with a single command.

Containers

Container

Purpose

secondbrain-api

FastAPI application

secondbrain-worker

Background document processing

secondbrain-mongodb

Conversation state & memory

secondbrain-qdrant

Vector database

secondbrain-redis

Background job queue


Start the Stack

docker compose up --build

Run in detached mode:

docker compose up -d

Stop the stack:

docker compose down

Rebuild after dependency changes:

docker compose up --build --force-recreate

Verify Services

docker ps

Expected running containers:

  • secondbrain-api

  • secondbrain-worker

  • secondbrain-mongodb

  • secondbrain-qdrant

  • secondbrain-redis


Persistent Storage

Docker volumes are used to persist application data.

Volume

Stores

mongodb_data

MongoDB data

qdrant_data

Vector embeddings

This ensures conversations and indexed documents remain available even after restarting the containers.

๐Ÿ“ก API Reference

Once the application is running, Swagger documentation is available at:

http://localhost:8000/docs

REST Endpoints

Method

Endpoint

Description

POST

/upload

Upload and index PDF documents

POST

/chat

Chat with indexed documents

GET

/jobs/{job_id}

Check background processing status


Example Upload Request

curl -X POST \
  "http://localhost:8000/upload" \
  -F "file=@document.pdf"

Example Chat Request

POST /chat

{
  "query": "Summarize the uploaded document."
}

๐Ÿ”Œ Model Context Protocol (MCP) Integration

SecondBrain includes a dedicated Model Context Protocol (MCP) Server, allowing external AI clients (such as Claude Desktop, Cursor, VS Code, and other MCP-compatible applications) to interact directly with the knowledge base.

Instead of exposing only REST APIs, MCP enables AI assistants to invoke tools, retrieve documents, and access memory through a standardized protocol.

Available MCP Tools

Tool

Description

ask_secondbrain

Query the complete RAG pipeline with memory support

search_documents

Perform semantic search across indexed documents

upload_document

Upload new documents to the knowledge base

memory

Access and manage long-term memory


MCP Capabilities

  • AI-assisted document search

  • Agentic RAG workflow execution

  • Long-term memory retrieval

  • Knowledge base interaction

  • Tool-based AI integration

  • Standardized MCP interface


MCP Architecture

AI Client
(Claude Desktop / Cursor / VS Code)
            โ”‚
            โ–ผ
      MCP Server
            โ”‚
            โ–ผ
    SecondBrain Tools
            โ”‚
            โ–ผ
 LangGraph Workflow
            โ”‚
     โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
     โ–ผ               โ–ผ
 MongoDB         Qdrant
     โ”‚               โ”‚
     โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
            โ–ผ
      Google Gemini

The MCP server allows external AI systems to securely access the capabilities of SecondBrain without directly interacting with the internal application components.

๐Ÿ–ผ๏ธ Screenshots

System Architecture


Engineering Workflow


FastAPI Swagger UI


๐Ÿšง Future Improvements

SecondBrain is designed with extensibility in mind. Some planned enhancements include:

  • ๐ŸŒ Web-based user interface

  • ๐Ÿ” User authentication and role-based access control

  • ๐Ÿ“š Support for additional document formats (DOCX, Markdown, HTML, TXT)

  • โ˜๏ธ Cloud storage integration (AWS S3, Google Cloud Storage)

  • ๐Ÿ” Hybrid Search (Semantic + Keyword Search)

  • โšก Streaming responses using Server-Sent Events (SSE)

  • ๐Ÿ“ˆ Observability with Prometheus & Grafana

  • ๐Ÿ“Š Monitoring and analytics dashboard

  • ๐Ÿงฉ Plugin architecture for custom tools

  • ๐Ÿค– Multi-agent collaboration workflows

  • ๐Ÿ—ฃ Voice input and speech synthesis

  • ๐ŸŒ Multi-language document support

  • ๐Ÿ“ฑ Web and mobile client applications

  • ๐Ÿš€ Kubernetes deployment for horizontal scaling


๐Ÿค Contributing

Contributions are welcome!

If you'd like to improve SecondBrain, please follow these steps:

  1. Fork the repository

  2. Create a feature branch

git checkout -b feature/your-feature
  1. Commit your changes

git commit -m "Add new feature"
  1. Push to your branch

git push origin feature/your-feature
  1. Open a Pull Request

Please ensure your code follows the existing project structure and coding style.


๐Ÿ“„ License

This project is licensed under the MIT License.

See the LICENSE file for more information.


๐Ÿ™ Acknowledgements

This project was built using several outstanding open-source technologies.

Special thanks to the teams behind:

  • LangChain

  • LangGraph

  • Google Gemini

  • FastAPI

  • Qdrant

  • MongoDB

  • Redis

  • Mem0

  • Docker

  • Model Context Protocol (MCP)

Their work makes projects like SecondBrain possible.


โญ If you found this project interesting, consider giving it a star!

Built with โค๏ธ by Mandar Gavali

A
license - permissive license
-
quality - not tested
B
maintenance

Maintenance

โ€“Maintainers
โ€“Response time
โ€“Release cycle
1Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/MandarGavali/SecondBrain'

If you have feedback or need assistance with the MCP directory API, please join our Discord server