Which integrations are available for this server?

Utilizes AWS Bedrock to generate vector embeddings for building the server's local searchable knowledge base. Integrates with Google Gemini models to provide both text embedding generation and AI chat completions. Processes Markdown content to seed a local vector store, treating header-delimited sections as searchable documents. Connects to OpenAI's API for generating text embeddings and providing chat completions to power the RAG pipeline.

How do I use mcp-ragchat?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@mcp-ragchat set up AI chat for mybakery.com using the menu and FAQ content" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

mcp-ragchat

by gogabrielordonez

Overview Schema Related Servers Score Discussions

TypeScript

Local

Tell Claude Code "add AI chat to mysite.com" and it will crawl your content, build a local vector store, spin up a chat server, and hand you an embed snippet. No cloud infra. No database. Just one API key.

Demo

You:    "Set up AI chat for mybakery.com using this FAQ and menu content"

Claude: I'll set up RAG-powered chat for mybakery.com.

        [1/4] ragchat_setup
              Splitting content into sections...
              Generating embeddings for 12 documents...
              Domain "mybakery.com" configured with 12 documents.

        [2/4] ragchat_test
              Query: "What are your hours?"
              Reply: "We're open Monday-Saturday 7am-6pm, and Sunday 8am-2pm."
              RAG Sources: mybakery.com-3 (0.91), mybakery.com-7 (0.84)
              Latency: 340ms

        [3/4] ragchat_serve
              Chat server running at http://localhost:3456
              POST /chat ready.

        [4/4] ragchat_widget
              Generated embed code. Paste this into your HTML:

              <script src="http://localhost:3456/widget.js"></script>

You:    Done. Live chat on my site in under 60 seconds.

Related MCP server: Crawl4AI MCP Server

Quick Start

1. Clone and build

git clone https://github.com/gogabrielordonez/mcp-ragchat
cd mcp-ragchat
npm install && npm run build

2. Configure Claude Code (~/.claude/mcp.json)

{
  "mcpServers": {
    "ragchat": {
      "command": "node",
      "args": ["/absolute/path/to/mcp-ragchat/dist/mcp-server.js"],
      "env": {
        "OPENAI_API_KEY": "sk-..."
      }
    }
  }
}

3. Use it

Open Claude Code and say:

"Add AI chat to mysite.com. Here's the content: [paste your markdown]"

Claude handles the rest.

Tools

Tool	What it does
`ragchat_setup`	Seed a knowledge base from markdown content. Each `##` section becomes a searchable document with vector embeddings.
`ragchat_test`	Send a test message to verify RAG retrieval and LLM response quality.
`ragchat_serve`	Start a local HTTP chat server with CORS and input sanitization.
`ragchat_widget`	Generate a self-contained `<script>` tag -- a floating chat bubble, no dependencies.
`ragchat_status`	List all configured domains with document counts and config details.

How It Works

                        +------------------+
                        |  Your Markdown   |
                        +--------+---------+
                                 |
                          ragchat_setup
                                 |
                    +------------v-------------+
                    |   Local Vector Store      |
                    |   ~/.mcp-ragchat/domains/ |
                    |     vectors.json          |
                    |     config.json           |
                    +------------+-------------+
                                 |
          User Question          |
               |                 |
        +------v------+  +------v------+
        |  Embedding  |  |  Cosine     |
        |  Provider   +->+  Similarity |
        +-------------+  +------+------+
                                |
                         Top 3 chunks
                                |
                    +----------v-----------+
                    |  System Prompt       |
                    |  + RAG Context       |
                    |  + User Message      |
                    +----------+-----------+
                               |
                    +----------v-----------+
                    |     LLM Provider     |
                    +----------+-----------+
                               |
                            Reply

Everything runs locally. No cloud infrastructure. Bring your own API key.

Supported Providers

LLM (chat completions)

Provider	Env Var	Default Model
OpenAI	`OPENAI_API_KEY`	`gpt-4o-mini`
Anthropic	`ANTHROPIC_API_KEY`	`claude-sonnet-4-5-20250929`
Google Gemini	`GEMINI_API_KEY`	`gemini-2.0-flash`

Embeddings (vector search)

Provider	Env Var	Default Model
OpenAI	`OPENAI_API_KEY`	`text-embedding-3-small`
Google Gemini	`GEMINI_API_KEY`	`text-embedding-004`
AWS Bedrock	`AWS_REGION` + IAM	`amazon.titan-embed-text-v2:0`

Override defaults with LLM_MODEL and EMBEDDING_MODEL environment variables.

Architecture

~/.mcp-ragchat/domains/
  mysite.com/
    config.json     -- system prompt, settings
    vectors.json    -- documents + embedding vectors

Vector store -- Local JSON files with cosine similarity search. Zero external dependencies.
Chat server -- Node.js HTTP server with CORS and input sanitization.
Widget -- Self-contained <script> tag. No frameworks, no build step.

Contributing

Issues and pull requests are welcome.

Found a bug? Open an issue
Want to add a feature? Fork, branch, PR.
Questions? Start a discussion

Star History

Enterprise

Need multi-tenancy, security guardrails, audit trails, and managed infrastructure? Check out Supersonic -- the enterprise AI platform built on the same RAG pipeline.

MIT License -- Gabriel Ordonez

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

1Releases (12mo)

Commit activity

Resources

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

Related MCP Servers

MCP Web Docs
Web Scraping RAG Systems Documentation Access
cosmocoder
A
license
-
quality
A
maintenance
A self-hosted MCP server that crawls, indexes, and searches documentation from any website locally, including private sites requiring authentication. It provides hybrid search capabilities and local embedding generation to maintain privacy while keeping AI assistant knowledge up to date.
Last updated 2026-08-01
486
1
MIT
Crawl4AI MCP Server
Web Scraping Browser Automation Search
stevenzxs
F
license
-
quality
D
maintenance
A locally-hosted MCP server that provides AI assistants with advanced web crawling capabilities, including structured data extraction, deep site crawling, and page screenshots. It enables users to convert single or multiple URLs into clean Markdown content for processing by LLMs without requiring external API keys for basic features.
Last updated 2026-02-26
Crawl4AI RAG MCP Server
RAG Systems Web Scraping Vector Databases
Anshumaan031
A
license
-
quality
D
maintenance
An MCP server that integrates Crawl4AI with Supabase to enable AI agents to crawl websites, store content in a vector database, and perform RAG queries.
Last updated 2025-06-21
8
MIT
MCP Local Context
RAG Systems Search File Systems
SteedMonteiro
A
license
-
quality
D
maintenance
A simple MCP server for local documentation with RAG capabilities, enabling AI assistants to access and search local documents.
Last updated 2025-07-22
MIT

View all related MCP servers

Related MCP Connectors

Darwin RAG
Local-first RAG engine with MCP server for AI agent integration.
driflyte-mcp-server
Driflyte MCP server which lets AI assistants query topic-specific knowledge from web and GitHub.
anythingmcp
Self-hosted MCP gateway: turn any API, database or MCP server into AI connectors — no code.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/gogabrielordonez/mcp-ragchat'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

Demo

Quick Start

Tools

How It Works

Supported Providers

LLM (chat completions)

Embeddings (vector search)

Architecture

Contributing

Star History

Enterprise

Maintenance

Resources

Looking for Admin?

Tools

Related MCP Servers

MCP Web Docs

Crawl4AI MCP Server

Crawl4AI RAG MCP Server

MCP Local Context

Related MCP Connectors

Latest Blog Posts

MCP directory API