RAGmonsters Custom PostgreSQL MCP Server

Integrations

  • Mentioned for hosting capabilities, allowing deployment of the MCP server on Clever Cloud infrastructure

  • Integrates with GitHub repositories for accessing the RAGmonsters dataset, which serves as the foundation for the MCP server's functionality

  • Integrates with LangChain.js for LLM interactions, enabling structured communication between the custom MCP server and language models

Custom PostgreSQL MCP Server for RAGmonsters

Overview

This repository demonstrates a more advanced approach to integrating Large Language Models (LLMs) with databases using the Model Context Protocol (MCP). While generic MCP PostgreSQL servers allow LLMs to explore databases through raw SQL queries, this project takes a different approach by creating a custom MCP server that provides a domain-specific API tailored to the application's needs.

This implementation uses FastMCP, a high-performance implementation of the Model Context Protocol, which provides improved efficiency and reliability for tool-based interactions with LLMs.

This project uses the RAGmonsters dataset as its foundation. RAGmonsters is an open-source project that provides a rich, fictional dataset of monsters with various attributes, abilities, and relationships - specifically designed for demonstrating and testing Retrieval-Augmented Generation (RAG) systems.

The Problem with Generic MCP Database Access

Generic MCP PostgreSQL servers provide LLMs with a query tool that allows them to:

  • Explore database schemas
  • Formulate SQL queries based on natural language questions
  • Execute those queries against the database

While this approach works, it has several limitations for real-world applications:

  • Cognitive Load: The LLM must understand the entire database schema
  • Inefficiency: Multiple SQL queries are often needed to answer a single question
  • Security Concerns: Raw SQL access requires careful prompt engineering to prevent injection attacks
  • Performance: Complex queries may be inefficient if the LLM doesn't understand the database's indexing strategy
  • Domain Knowledge Gap: The LLM lacks understanding of business rules and domain-specific constraints

About RAGmonsters Dataset

RAGmonsters is an open dataset specifically designed for testing and demonstrating Retrieval-Augmented Generation (RAG) systems. It contains information about fictional monsters with rich attributes, abilities, and relationships - making it perfect for natural language querying demonstrations.

The PostgreSQL version of RAGmonsters provides a well-structured relational database with multiple tables and relationships, including:

  • Monsters with various attributes (attack power, defense, health, etc.)
  • Abilities that monsters can possess
  • Elements (fire, water, earth, etc.) with complex relationships
  • Habitats where monsters can be found
  • Evolution chains and relationships between monsters

This rich, interconnected dataset is ideal for demonstrating the power of domain-specific APIs versus generic SQL access.

Our Solution: Domain-Specific MCP API

This project demonstrates how to build a custom MCP server that provides a higher-level, domain-specific API for the RAGmonsters dataset. Instead of exposing raw SQL capabilities, our MCP server offers purpose-built functions that:

  1. Abstract Database Complexity: Hide the underlying schema and SQL details
  2. Provide Domain-Specific Operations: Offer functions that align with business concepts
  3. Optimize for Common Queries: Implement efficient query patterns for frequently asked questions
  4. Enforce Business Rules: Embed domain-specific logic and constraints
  5. Improve Security: Limit the attack surface by removing direct SQL access

Example: Domain-Specific API vs. Generic SQL

Generic MCP PostgreSQL Approach:

User: "What are the top 3 monsters with the highest attack power that are vulnerable to fire?" LLM: (Must understand schema, joins, and SQL syntax) 1. First query to understand the schema 2. Second query to find monsters with attack power 3. Third query to find vulnerabilities 4. Final query to join and filter results

Our Custom MCP Server Approach:

User: "What are the top 3 monsters with the highest attack power that are vulnerable to fire?" LLM: (Uses our domain-specific API) 1. Single call: getMonsters({ vulnerableTo: "fire", sortBy: "attackPower", limit: 3 })

Project Structure

├── .env.example # Example environment variables ├── package.json # Node.js project configuration ├── README.md # This documentation ├── img/ # Images for documentation ├── scripts/ │ ├── testMcpServer.js # Test script for the MCP server │ └── testLogger.js # Logger for test script ├── src/ │ ├── index.js # Main application server │ ├── mcp-server/ # Custom MCP server implementation with FastMCP │ │ ├── index.js # Server entry point │ │ ├── tools/ # Domain-specific tools │ │ │ ├── index.js # Tool registration │ │ │ └── monsters.js # Monster-related operations │ │ └── utils/ # Helper utilities │ │ └── logger.js # Logging functionality │ ├── llm.js # LangChain integration for LLM │ └── public/ # Web interface files │ └── index.html # Chat interface

Features

  • Custom MCP Server with FastMCP: High-performance domain-specific API for RAGmonsters data
  • Optimized Queries: Pre-built efficient database operations
  • Business Logic Layer: Domain rules and constraints embedded in the API
  • Structured Response Format: Consistent JSON responses for LLM consumption
  • Comprehensive Logging: Detailed logging for debugging and monitoring
  • Test Suite: Scripts to verify server functionality

Planned Features

  • LangChain.js Integration: For LLM interactions
  • Web Interface: Simple chat interface to interact with the data
  • Deployment on Clever Cloud: Easy deployment instructions

Benefits of This Approach

  1. Improved Performance: Optimized queries and caching strategies
  2. Better User Experience: More accurate and faster responses
  3. Reduced Token Usage: LLM doesn't need to process complex SQL or schema information
  4. Enhanced Security: No direct SQL access means reduced risk of injection attacks
  5. Maintainability: Changes to the database schema don't require retraining the LLM
  6. Scalability: Can handle larger and more complex databases

Getting Started

Installation

  1. Clone this repository
  2. Install dependencies: npm install
  3. Copy .env.example to .env and configure your PostgreSQL connection string
  4. Run the test script: node scripts/testMcpServer.js

Available Tools

The MCP server provides the following tools:

  1. getMonsters - Get a list of monsters with optional filtering, sorting, and pagination
    • Parameters: filters (category, habitat, rarity), sort (field, direction), limit, offset
  2. getMonsterById - Get detailed information about a specific monster by ID
    • Parameters: monsterId
  3. add - Simple utility to add two numbers (for testing)
    • Parameters: a, b

Prerequisites

  • Node.js 23 or later
  • PostgreSQL database with RAGmonsters data
  • Access to an LLM API (e.g., OpenAI)
  • FastMCP package (included in dependencies)

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

-
security - not tested
F
license - not found
-
quality - not tested

A domain-specific MCP server that provides optimized API access to the RAGmonsters fictional monster dataset, enabling more efficient and secure interactions compared to generic SQL queries.

  1. Overview
    1. The Problem with Generic MCP Database Access
    2. About RAGmonsters Dataset
    3. Our Solution: Domain-Specific MCP API
  2. Example: Domain-Specific API vs. Generic SQL
    1. Generic MCP PostgreSQL Approach:
    2. Our Custom MCP Server Approach:
  3. Project Structure
    1. Features
      1. Planned Features
    2. Benefits of This Approach
      1. Getting Started
        1. Installation
        2. Available Tools
      2. Prerequisites
        1. License
          1. Acknowledgments

            Related MCP Servers

            • -
              security
              A
              license
              -
              quality
              A server that helps people access and query data in databases using the Query Runner with integration of the Model Context Protocol (MCP) Python SDK. Support databases including PostgreSQL Redshift MySQL Microsoft SQL Server Google APIs Amazon Web Services (via boto3) CockroachDB SQLite
              Last updated -
              26
              Python
              GPL 3.0
              • Linux
              • Apple
            • -
              security
              A
              license
              -
              quality
              A lightweight MCP server that enables database access and querying through ODBC connections, with special support for Virtuoso DBMS features like SPARQL and AI assistance through natural language.
              Last updated -
              Python
              MIT License
              • Apple
              • Linux
            • -
              security
              A
              license
              -
              quality
              Open source MCP server specializing in easy, fast, and secure tools for Databases.
              Last updated -
              872
              Go
              Apache 2.0
              • Linux
            • -
              security
              -
              license
              -
              quality
              An MCP server that enables natural language interaction with Apache Iceberg data lakehouses, allowing users to query table metadata, schemas, and properties through Claude, Cursor, or other MCP clients.
              Last updated -
              31
              Python
              Apache 2.0

            View all related MCP servers

            ID: th7wp3tgdr