RAG (Retrieval-Augmented Generation) system that can store papers and web pages

Search for:

RAG (Retrieval-Augmented Generation) system that can store papers and web pages

View all MCP Servers

Why this server?
This server is an excellent fit as it provides RAG capabilities by ingesting various document formats and also processes URLs, directly addressing the user's need to store both papers (documents) and web pages (URLs).
MCP RAG with ChromaDB
RAG Systems Vector Databases Search
CyprianFusi
A
license
-
quality
D
maintenance
Provides retrieval-augmented generation (RAG) capabilities by ingesting various document formats into a persistent ChromaDB vector store. It enables semantic search and retrieval using either OpenAI or Ollama embeddings for processing local files, directories, and URLs.
Last updated 2025-12-07
1
MIT
Why this server?
This server directly supports storing and searching 'documents' and 'web pages' with RAG capabilities, explicitly mentioning 'web scraping' for content acquisition, making it suitable for both parts of the user's request.
AI Ops Hub
RAG Systems File Systems Note Taking
Galiusbro
F
license
B
quality
D
maintenance
An MCP server with RAG capabilities that provides AI clients secure access to local notes, documents, web pages, and task management tools for developer operational tasks. Features file operations, web scraping with content cleaning, and personal knowledge corpus search functionality.
Last updated 2025-10-10
7
Why this server?
This server enables RAG by indexing 'documents from URLs' and 'local directories,' covering both web pages and locally stored documents/papers with flexible embedding options.
py-mcp-qdrant-rag
RAG Systems Vector Databases Web Scraping
amornpan
F
license
-
quality
F
maintenance
Enables semantic search and retrieval-augmented generation (RAG) using Qdrant vector database. Supports indexing documents from URLs and local directories, with flexible embedding options using Ollama or OpenAI.
Last updated 2025-07-04
2
Why this server?
This server integrates RAG with the Model Control Protocol to provide both 'web search capabilities' and 'document analysis,' aligning well with the need to handle both web content and papers.
RAG-MCP Server
RAG Systems Search
plaban1981
A
license
-
quality
D
maintenance
A server that integrates Retrieval-Augmented Generation (RAG) with the Model Control Protocol (MCP) to provide web search capabilities and document analysis for AI assistants.
Last updated 2025-06-01
4
Apache 2.0
Why this server?
Similar to its counterpart, this server provides 'advanced web crawling and RAG capabilities,' enabling the scraping of websites to leverage that knowledge, making it ideal for web pages.
Crawl4AI RAG MCP Server
Chillbruhhh
A
license
-
quality
C
maintenance
Provides AI agents and coding assistants with advanced web crawling and RAG capabilities, allowing them to scrape websites and leverage that knowledge through various retrieval strategies.
Last updated 2025-07-15
2
MIT
Why this server?
This server integrates 'web scraping' to build 'searchable knowledge bases from web content,' which directly fulfills the requirement to store and query web pages using RAG.
BerryRAG
RAG Systems Web Scraping Vector Databases
berrydev-ai
F
license
-
quality
D
maintenance
A local vector database RAG system that integrates with Playwright MCP for web scraping, enabling users to build searchable knowledge bases from web content with multiple embedding providers and Claude-optimized context formatting.
Last updated 2025-06-23
Why this server?
This server focuses on 'Web crawling and RAG implementation' to 'scrape websites and perform semantic search over the crawled content,' making it a strong match for handling web pages.
Crawl4AI RAG MCP Server
Web Scraping RAG Systems
coleam00
A
license
-
quality
F
maintenance
Web crawling and RAG implementation that enables AI agents to scrape websites and perform semantic search over the crawled content, storing everything in Supabase for persistent knowledge retrieval.
Last updated 2025-07-25
2,215
MIT
Why this server?
This server is a great fit for storing papers, as it explicitly mentions 'indexing and searching through documents (Markdown, text, PowerPoint, PDF)' using vector embeddings for RAG.
MCP RAG Server
RAG Systems Vector Databases Search
karaage0703
A
license
-
quality
D
maintenance
Enables retrieval-augmented generation (RAG) by indexing and searching through documents (Markdown, text, PowerPoint, PDF) using vector embeddings with multilingual-e5-large model and PostgreSQL pgvector. Supports contextual chunk retrieval and incremental indexing for efficient document management.
Last updated 2025-10-30
71
MIT
Why this server?
This local RAG server specifically enables 'document indexing and sentence window retrieval across multiple file formats like PDF, MD, and DOCX,' making it highly suitable for managing and querying papers.
MCP-RAGNAR
RAG Systems Vector Databases File Systems
bixentemal
A
license
-
quality
D
maintenance
A local RAG server that enables document indexing and sentence window retrieval across multiple file formats like PDF, MD, and DOCX. It supports both local Hugging Face models and OpenAI embeddings for efficient context-aware querying through the Model Context Protocol.
Last updated 2025-06-25
GPL 3.0