Skip to main content
Glama

BioContextAI Knowledgebase MCP

Apache 2.0
2
  • Apple
  • Linux

BioContextAI - Knowledgebase MCP

A Model Context Protocol (MCP) server for biomedical research that provides a standardized connection layer between artificial intelligence systems and biomedical resources. Documentation and usage guides are available at: https://biocontext.ai

Overview

BioContextAI Knowledgebase MCP is an MCP server implementation for common biomedical resources, enabling agentic large language models (LLMs) to retrieve verified information and perform domain-specific tasks. Unlike previous approaches that required custom integration for each resource, BioContextAI KB MCP provides a unified access layer through the Model Context Protocol that enables interoperability between AI systems and domain-specific data sources.

The Knowledgebase MCP is available both as:

  • An open-source software package for local hosting (see Self-hosting)
  • A remote server for setup-free integration at https://mcp.biocontext.ai/mcp/ (subject to fair use)

Warning

If possible, we encourage you to run BioContextAI Knowledgebase MCP locally to avoid rate limits and ensure the service's availability for applications that rely on remote hosting.

The Knowledgebase MCP is part of the wider BioContextAI project. The BioContextAI Registry catalogues community servers that expose biomedical databases and analysis tools, providing the community with a resource for tool discovery and distribution. The registry index can be found at: https://biocontext.ai/registry.

Citation

If our work is useful to your research, please cite it as below.

TBA.

Implemented Tools

BioContextAI Knowledgebase MCP exposes a number of external biomedical APIs. You can think of BioContextAI as a browser for your LLM that allows it to find relevant information across these knowledge bases. Please make sure to adhere to the usage limits (e.g., rate limits) of the respective services when using BioContextAI Knowledgebase MCP. If you use data from these services in your research, please make sure to cite both BioContextAI as well as the respective data source/tool.

Warning

The data accessed through these APIs is not covered by the BioContextAI Knowledgebase MCP license. You are responsible for ensuring that your use of the data aligns with permitted practices.

Tools

  • Antibody Registry - Gene id conversion
  • bioRxiv/medRxiv - Recent preprint search and metadata access
  • Ensembl - Gene id conversion
  • EuropePMC - Literature search and full-text access
  • Google Scholar - Academic publication and author search (only available for local use due to rate limiting)
  • InterPro - Protein families, domains, and functional sites classification
  • KEGG - Pathways, gene and drug-drug interaction database (only available for local use due to licensing restrictions)
  • OpenTargets - Target-disease associations
  • PanglaoDB - Single-cell RNA-sequencing cell type markers
  • PRIDE - Proteomics data repository for mass spectrometry data
  • Protein Atlas - Protein expression data
  • Reactome - Pathways database
  • STRING - Protein-protein interaction networks
  • AlphaFold DB - Tertiary protein structure predictions

License overview

The above data is provided without warranty and may be inaccurate or out-of-date. Please verify your use case is allowed.

OpenAPI MCP servers

FastMCP allows for easy conversion of REST endpoints following the OpenAPI specification into MCP servers. We have added code to automatically create such servers based on schemas provided through a configuration file, so that users deploying their own version of BioContextAI can easily extend the list of available tools. The configuration file is located at src/biocontext_kb/openapi/config.yaml. By default, no OpenAPI servers are included, but you can edit the configuration file to add services.

Self-hosting

Clone the latest version of this repository:

git clone https://github.com/biocontext-ai/core-mcp-server.git cd core-mcp-server

Then run one of the following:

  1. Docker setup (gunicorn with multiple uvicorn workers):
# Build the docker container docker build -t biocontext_kb:latest . docker run -p 127.0.0.1:8000:8000 biocontext_kb:latest

This exposes your MCP server at: http://127.0.0.1:8000/mcp/

Warning

For public deployments, you should disabled unnecessary ports and access your MCP server through a reverse proxy, e.g., Nginx or Caddy. You may also want to configure the running user and the directory to have limited rights, use Docker or podman in a rootless setup and take additional security measures like DDOS protection with Cloudflare or fail2ban.

  1. Local setup with uv, with streamable HTTP and uvicorn:
uv build export MCP_ENVIRONMENT=PRODUCTION export PORT=8000 biocontext_kb
  1. Locall setup with uv, with stdio transport:
uv build export MCP_ENVIRONMENT=DEVELOPMENT biocontext_kb
  1. Locall setup with Claude Desktop (claude_desktop_config.json):
{ "mcpServers": { "biocontext_kb": { "command": "uvx", "args": [ "biocontext_kb@latest" ], "env": { "UV_PYTHON": "3.12" } } } }

Don't forget to restart Claude to apply the changes.

  1. Locally, with your coding agents in VS Code (.vscode/mcp.json) or Cursor (.cursor/mcp.json) or WindSurf (.codeium/windsurf/mcp_config.json):
{ // VS Code: "servers" "mcpServers": { "biocontext_kb": { // if it doesn't work, replace with the `which uvx` path (installation via Homebrew recommended on macOS) "command": "uvx", "args": [ "biocontext_kb@latest" ], } } }

When using Windows and WSL2 the above config needs to be adapted as follows:

{ // VS Code: "servers" "mcpServers": { "biocontext_kb": { "command": "wsl", "args": [ "~/.local/bin/uvx", // set to the path to your `uvx` "biocontext_kb" ] } } }
  1. Locally, with your own MCP clients:

Resources

  • Project documentation: https://biocontext.ai
  • API documentationt: https://docs.kb.biocontext.ai/
  • BioContextAI Registry: https://github.com/biocontext-ai/registry
  • Chat Interface: https://biocontext.ai/chat

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Warning

The Apache 2.0 License only applies to the code provided in this repository. For usage limitations and licenses of the individually integrated APIs, users should directly refer to the respective API providers. We provide an overview below.

Data Sources and Licensing

Data SourceLicenseURLNotes
AlphaFold (EMBL-EBI)CC BY 4.0https://alphafold.ebi.ac.uk/
Antibody Registry (RRIDs)CC0 (metadata: CC-NC)https://www.antibodyregistry.org/faqCommercial reuse restrictions on some metadata
bioRxiv/medRxivCC BY 4.0https://www.biorxiv.org/about/FAQPreprint content licenses vary
ClinicalTrials.gov APITerms of Servicehttps://clinicaltrials.gov/about-site/terms-conditionsAttribution required
EnsemblNo restrictions*https://www.ensembl.org/info/about/legal/disclaimer.html*Some third-party data may have restrictions
EuropePMCVarious/Copyright protectedhttps://europepmc.org/CopyrightIndividual article licenses vary
Google ScholarTerms of Servicehttps://scholar.google.com/intl/en/scholar/terms.htmlRate limiting; use responsibly
Grants.gov APITerms of Servicehttps://www.grants.gov/api/terms-conditionsAttribution required
Human Protein AtlasCC BY-SA 4.0https://www.proteinatlas.org/about/licence
InterProCC0 1.0 Universalhttps://www.ebi.ac.uk/interpro/Includes InterPro, Pfam, PRINTS, and SFLD data
KEGGProprietary (Free academic use)https://www.kegg.jp/kegg/legal.htmlCommercial services/remote hosting not permitted
Ontology Lookup Service (EMBL-EBI)Generally CC0/CC BYhttps://www.ebi.ac.uk/licencing/Refer to EMBL-EBI general licensing
Open TargetsCC0 1.0https://platform-docs.opentargets.org/licence
OpenFDACC0 1.0 Universal*https://open.fda.gov/license/*Some data may have restrictions
PanglaoDBPublic datahttps://panglaodb.se/about.htmlAll data are public
PRIDECC0/CC BY 4.0*https://www.ebi.ac.uk/pride/markdownpage/license*CC0 for datasets from June 2018+, CC BY 4.0 for derived resources
ReactomeCC0https://reactome.org/license
STRINGCC BY 4.0https://string-db.org/cgi/access?footer_active_subpage=licensing
UniProtCC BY 4.0https://www.uniprot.org/help/license

Disclaimer

Users are solely responsible for ensuring compliance with all applicable license terms and conditions when accessing data through this MCP server. The licenses and terms listed above are subject to change, and additional citation requirements may apply for specific datasets or publications. Before using any data for commercial purposes, redistribution, or publication, please review the current license terms directly from each data source. Some data sources may have additional restrictions not fully captured in this summary.

For KEGG data specifically, please note that while academic use is permitted, providing commercial services or remote hosting using KEGG data is not allowed under their proprietary license terms.

-
security - not tested
A
license - permissive license
-
quality - not tested

An MCP server that provides standardized access to biomedical knowledge bases and resources, enabling AI systems to retrieve verified information from sources like bioRxiv, EuropePMC, and various protein/gene databases.

  1. Overview
    1. Citation
      1. Implemented Tools
        1. Tools
      2. License overview
        1. OpenAPI MCP servers
      3. Self-hosting
        1. Resources
          1. License
            1. Data Sources and Licensing
            2. Disclaimer

          Related MCP Servers

          • -
            security
            F
            license
            -
            quality
            🔍 Enable AI assistants to search and access bioRxiv papers through a simple MCP interface. The bioRxiv MCP Server provides a bridge between AI assistants and bioRxiv's preprint repository through the Model Context Protocol (MCP). It allows AI models to search for biology preprints and access their
            Last updated -
            16
            Python
            • Linux
            • Apple
          • -
            security
            F
            license
            -
            quality
            🔍 Enable AI assistants to search and access medRxiv papers through a simple MCP interface. The medRxiv MCP Server provides a bridge between AI assistants and medRxiv's preprint repository through the Model Context Protocol (MCP). It allows AI models to search for health sciences preprints and acce
            Last updated -
            5
            Python
            • Linux
            • Apple
          • -
            security
            A
            license
            -
            quality
            An MCP server that enables language models to fetch protein information from the UniProt database, including protein details, sequences, functions, and structures.
            Last updated -
            Python
            MIT License
            • Linux
            • Apple
          • -
            security
            F
            license
            -
            quality
            An MCP server that provides access to Monarch Initiative's biomedical data through the Biolink API, enabling agents to query biological entities, associations, and knowledge graphs using natural language.
            Last updated -
            Python

          View all related MCP servers

          MCP directory API

          We provide all the information about MCP servers via our MCP API.

          curl -X GET 'https://glama.ai/api/mcp/v1/servers/biocontext-ai/knowledgebase-mcp'

          If you have feedback or need assistance with the MCP directory API, please join our Discord server