Skip to main content
Glama

Aurora-MCP

by ndaniel

🌌 Aurora-MCP

Model Context Protocol (MCP) server providing access to datasets of natural and synthetic small molecules, with a focus on identifying potential mitochondrial Complex I inhibitors that may occur in plant species.


πŸ” Overview

Aurora-MCP is a Model Context Protocol (MCP) server and data integration layer that connects natural-product, biodiversity, and mitochondrial-inhibitor datasets. It enables LLMs and users to query relationships between plant species, small molecules, and mitochondrial Complex I inhibitionβ€”bridging COCONUT, Laji.fi, GBIF, and AI-derived PubMed data through structured joins and metadata schemas.

Aurora-MCP is a lightweight MCP server + Hugging Face Space designed to bridge two complementary knowledge sources:

  1. 🌿 Aurora β€” natural-product and plant biodiversity data, mapping compounds to genera and species found in Nordic ecosystems.

  2. 🧬 Aurora-Mito-ETL β€” curated PubMed-derived corpus of small-molecule inhibitors of mitochondrial Complex I (NADH dehydrogenase).

Together they form a conversational dataset where ChatGPT (or any MCP-compatible LLM) can reason over structured biological data, ask questions, and perform targeted searches on small compounds, plants, and mechanistic links between them.


🧠 Concept

Goal: allow scientific dialogue with an LLM grounded in domain data, for example:

β€œShow me plant-derived compounds that inhibit mitochondrial Complex I.”
β€œFind PubMed evidence for arctigenin as a Complex I inhibitor.”
β€œList Nordic plants whose metabolites overlap with known ETC inhibitors.”

Aurora-MCP turns your static text/TSV data into an interactive semantic backend, exposing programmatic tools for searching, linking, and reasoning.


🧩 Key Features

Capability

Description

πŸ—‚οΈ

File introspection

List and read data files under

./data

(merged from Aurora + Aurora-Mito-ETL).

πŸ”Ž

Regex & keyword search

Query files using regex (e.g. *inhibit.

complex I

,

NADH oxidoreductase

).

🧬

PubMed integration

Auto-generate PubMed URLs from PMIDs for fast evidence lookup.

🧾

MeSH & compound tag extraction

Parse MeSH IDs, chemical codes, or compound names from TSV files.

🌿

Plant–compound linkage

Bridge plant genera/species (Aurora) with small-molecule inhibitors (Aurora-Mito-ETL).

🧩

Hugging Face Space

Streamlit demo UI to browse data and test queries visually.

πŸ€–

MCP tools for ChatGPT

Connect directly to ChatGPT via MCP for grounded conversational access.


🧱 Architecture

aurora-mcp/ β”œβ”€ data/ # Unified domain data (from Aurora + Aurora-Mito-ETL) β”‚ β”œβ”€ plants/ # e.g. plant_compounds.tsv, gbif_species.tsv β”‚ β”œβ”€ inhibitors/ # e.g. pubmed_gpt.txt, mito_complex_I.tsv β”‚ └─ reference/ β”‚ β”œβ”€ mcp_server/ # Core MCP server implementation β”‚ β”œβ”€ server.py # stdio entrypoint (for ChatGPT MCP) β”‚ β”œβ”€ http.py # optional FastAPI/uvicorn HTTP endpoint β”‚ └─ tools/ β”‚ β”œβ”€ files.py # list_files, read_text β”‚ β”œβ”€ search.py # regex grep β”‚ └─ bio.py # pubmed_url, mesh_tags, unique_values β”‚ β”œβ”€ app/ # Streamlit demo for Hugging Face Space β”‚ └─ app.py β”‚ β”œβ”€ configs/mcp.json # Example MCP manifest β”œβ”€ requirements.txt β”œβ”€ Dockerfile β”œβ”€ huggingface.yaml └─ README.md

πŸš€ Quick Start (local)

git clone https://github.com/ndaniel/aurora-mcp.git cd aurora-mcp # Optional: add Aurora + Aurora-Mito-ETL data under ./data/ mkdir -p data && cp -r ../aurora/data/* data/ mkdir -p data/inhibitors && cp -r ../aurora-mito-etl/data/* data/inhibitors/ python -m venv .venv source .venv/bin/activate pip install -r requirements.txt # Run the MCP server (stdio mode) python -m mcp_server.server

βž• Add to ChatGPT (MCP)

  1. In ChatGPT, open Settings β†’ Model Context Protocol β†’ Add server

  2. Command: python -m mcp_server.server

  3. Working directory: this repository root (aurora-mcp)

  4. Start chatting about compounds, plants, or PubMed evidence β€” all grounded in your data!


πŸ’» Hugging Face Space

Option 1 β€” Python Space (recommended):

  • Push this repo to a new Space.

  • Spaces runs app/app.py (Streamlit UI).

  • Use the console or HTTP /mcp endpoint for MCP access.

Option 2 β€” Docker Space:

  • Build via the included Dockerfile.

  • Exposes:

    • Streamlit UI on :7860

    • MCP HTTP gateway on :8000/mcp

# Local test docker build -t aurora-mcp . docker run -p 7860:7860 -p 8000:8000 aurora-mcp

🌍 Example MCP Tools

Tool

Purpose

list_files(root, pattern)

List domain files under

data/

.

read_text(path)

Read contents of a text file.

grep(pattern, paths, flags)

Regex search within selected files.

pubmed_url(pmid)

Build PubMed link for given PMID.

mesh_tags(paths)

Extract MeSH or chemical IDs.

unique_values(path, column)

List unique values from TSV column.

You can easily extend these to include:

  • Compound β†’ plant cross-references

  • Frequency summaries

  • Co-occurrence matrices

  • Filtered outputs for downstream ML models


🧩 Example Conversations (MCP-ChatGPT)

User:

List natural compounds found in Arctium lappa that have PubMed evidence of Complex I inhibition.

Aurora-MCP:


πŸ§ͺ Development

  • Extend tools in mcp_server/tools/.

  • Keep MCP responses lightweight (avoid full-file dumps).

  • Test new tools locally:

    pytest -v

βš–οΈ License

GNU General Public License v3.0 (GPL-3.0)
See LICENSE for details.


πŸ‘€ Author

Daniel N.
University of Helsinki β€” Department of Computer Science / Precision Medicine
Bioinformatics β€’ Machine Learning β€’ Mitochondrial Metabolism β€’ Natural-Product Discovery

-
security - not tested
A
license - permissive license
-
quality - not tested

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

Enables querying relationships between plant species, small molecules, and mitochondrial Complex I inhibitors by bridging natural-product, biodiversity, and PubMed datasets. Allows LLMs to perform structured searches and reasoning over biological data to identify potential plant-derived mitochondrial inhibitors.

  1. πŸ” Overview
    1. 🧠 Concept
      1. 🧩 Key Features
        1. 🧱 Architecture
          1. πŸš€ Quick Start (local)
            1. βž• Add to ChatGPT (MCP)
          2. πŸ’» Hugging Face Space
            1. 🌍 Example MCP Tools
              1. 🧩 Example Conversations (MCP-ChatGPT)
                1. πŸ§ͺ Development
                  1. βš–οΈ License
                    1. πŸ‘€ Author

                      MCP directory API

                      We provide all the information about MCP servers via our MCP API.

                      curl -X GET 'https://glama.ai/api/mcp/v1/servers/ndaniel/aurora-mcp'

                      If you have feedback or need assistance with the MCP directory API, please join our Discord server