Model Context Protocol Server

Integrations

  • Integrates with python-dotenv for managing environment variables, facilitating the secure storage and access of API keys and configuration settings.

  • Provides repository access and version control integration through GitHub, allowing users to clone and contribute to the project.

  • Integrates with LangChain to provide agent-based search capabilities and facilitate the creation of agentic AI systems that can search the web and retrieve information.

Search Engine with RAG and MCP

A powerful search engine that combines LangChain, Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Ollama to create an agentic AI system capable of searching the web, retrieving information, and providing relevant answers.

Features

  • Web search capabilities using the Exa API
  • Web content retrieval using FireCrawl
  • RAG (Retrieval-Augmented Generation) for more relevant information extraction
  • MCP (Model Context Protocol) server for standardized tool invocation
  • Support for both local LLMs via Ollama and cloud-based LLMs via OpenAI
  • Flexible architecture supporting direct search, agent-based search, or server mode
  • Comprehensive error handling and graceful fallbacks
  • Python 3.13+ with type hints
  • Asynchronous processing for efficient web operations

Architecture

This project integrates several key components:

  1. Search Module: Uses Exa API to search the web and FireCrawl to retrieve content
  2. RAG Module: Embeds documents, chunks them, and stores them in a FAISS vector store
  3. MCP Server: Provides a standardized protocol for tool invocation
  4. Agent: LangChain-based agent that uses the search and RAG capabilities

Project Structure

search-engine-with-rag-and-mcp/ ├── LICENSE # MIT License ├── README.md # Project documentation ├── data/ # Data directories ├── docs/ # Documentation │ └── env_template.md # Environment variables documentation ├── logs/ # Log files directory (auto-created) ├── src/ # Main package (source code) │ ├── __init__.py │ ├── core/ # Core functionality │ │ ├── __init__.py │ │ ├── main.py # Main entry point │ │ ├── search.py # Web search module │ │ ├── rag.py # RAG implementation │ │ ├── agent.py # LangChain agent │ │ └── mcp_server.py # MCP server implementation │ └── utils/ # Utility modules │ ├── __init__.py │ ├── env.py # Environment variable loading │ └── logger.py # Logging configuration ├── pyproject.toml # Poetry configuration ├── requirements.txt # Project dependencies └── tests/ # Test directory

Getting Started

Prerequisites

  • Python 3.13+
  • Poetry (optional, for development)
  • API keys for Exa and FireCrawl
  • (Optional) Ollama installed locally
  • (Optional) OpenAI API key

Installation

  1. Clone the repository
git clone https://github.com/yourusername/search-engine-with-rag-and-mcp.git cd search-engine-with-rag-and-mcp
  1. Install dependencies
# Using pip pip install -r requirements.txt # Or using poetry poetry install
  1. Create a .env file (use docs/env_template.md as a reference)

Usage

The application has three main modes of operation:

1. Direct Search Mode (Default)
# Using pip python -m src.core.main "your search query" # Or using poetry poetry run python -m src.core.main "your search query"
2. Agent Mode
python -m src.core.main --agent "your search query"
3. MCP Server Mode
python -m src.core.main --server

You can also specify custom host and port:

python -m src.core.main --server --host 0.0.0.0 --port 8080

Using Ollama (Optional)

To use Ollama for local embeddings and LLM capabilities:

  1. Install Ollama: https://ollama.ai/
  2. Pull a model:
ollama pull mistral:latest
  1. Set the appropriate environment variables in your .env file:
OLLAMA_BASE_URL=http://localhost:11434 OLLAMA_MODEL=mistral:latest

Development

This project follows these best practices:

  • Code formatting: Black and isort for consistent code style
  • Type checking: mypy for static type checking
  • Linting: flake8 for code quality
  • Testing: pytest for unit and integration tests
  • Environment Management: python-dotenv for managing environment variables
  • Logging: Structured logging to both console and file

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

-
security - not tested
A
license - permissive license
-
quality - not tested

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

Provides a standardized protocol for tool invocation, enabling an AI system to search the web, retrieve information, and provide relevant answers through integration with LangChain, RAG, and Ollama.

  1. Features
    1. Architecture
      1. Project Structure
        1. Getting Started
          1. Prerequisites
          2. Installation
          3. Usage
          4. Using Ollama (Optional)
        2. Development
          1. License
            1. Acknowledgements

              Related MCP Servers

              • A
                security
                A
                license
                A
                quality
                This server enables AI systems to integrate with Tavily's search and data extraction tools, providing real-time web information access and domain-specific searches.
                Last updated -
                2
                4,691
                164
                JavaScript
                MIT License
                • Apple
                • Linux
              • A
                security
                A
                license
                A
                quality
                A production-ready Model Context Protocol server that enables language models to leverage AI-powered web scraping capabilities, offering tools for transforming webpages to markdown, extracting structured data, and executing AI-powered web searches.
                Last updated -
                3
                16
                Python
                MIT License
                • Apple
              • -
                security
                F
                license
                -
                quality
                Enables AI assistants to interact with Metabase, providing access to dashboards, questions, databases, and tools for executing queries and viewing data through natural language.
                Last updated -
                JavaScript
                • Apple
              • -
                security
                A
                license
                -
                quality
                This server implementation allows AI assistants to interact with Asana's API, enabling users to manage tasks, projects, workspaces, and comments through natural language requests.
                Last updated -
                65
                TypeScript
                MIT License

              View all related MCP servers

              ID: iqudnbc9bd