cBioPortal MCP Server

by pickleton89

cBioPortal MCP Server

A high-performance async Model Context Protocol (MCP) server that enables AI assistants to interact with cancer genomics data from cBioPortal, a platform for exploring multidimensional cancer genomics datasets. Built with modern asynchronous Python for significantly faster data retrieval.

Features

  • 🔍 Cancer Studies: Browse and search cancer studies available in cBioPortal
  • 🧬 Genomic Data: Access gene mutations, clinical data, and molecular profiles
  • 🔎 Search Capabilities: Find studies, genes, and samples with keyword search
  • 📊 Multiple Data Types: Retrieve mutations, clinical data, and study metadata
  • ⚡ Async Performance: Fully asynchronous implementation for significantly faster data retrieval (up to 4.5x faster)
  • 📚 Bulk Operations: Concurrent fetching of multiple studies and genes for enhanced performance
  • 🔄 FastMCP Integration: Built on the high-performance FastMCP framework

Table of Contents

Installation

Prerequisites

  • Python 3.8 or higher
  • pip (Python package installer)
  • Git (optional, for cloning the repository)

Set Up Environment

Option 1: Using venv and pip (standard method)
# Create a virtual environment python -m venv cbioportal-mcp-env # Activate the environment # On Windows: cbioportal-mcp-env\Scripts\activate # On macOS/Linux: source cbioportal-mcp-env/bin/activate
Install Dependencies with pip
# Install the MCP SDK and FastMCP framework pip install mcp>=2.0.0 # Install additional dependencies pip install httpx asyncio
Option 2: Using UV (faster alternative)

UV is a modern, high-performance Python package manager and environment manager that's significantly faster than pip.

# Install UV if you don't have it yet pipx install uv # Or with Homebrew # brew install uv # Create and activate a virtual environment with UV uv venv # Activate the environment # On Windows: .venv\Scripts\activate # On macOS/Linux: source .venv/bin/activate
Install Dependencies with UV
# Install the MCP SDK and FastMCP framework uv pip install mcp>=2.0.0 # Install additional dependencies uv pip install httpx asyncio

Download the Server

Download the cbioportal_server.py script to your working directory or clone this repository:

git clone https://github.com/pickleton89/cbioportal-mcp.git cd cbioportal-mcp

Make the Script Executable (Linux/macOS only)

chmod +x cbioportal_server.py

Usage

Starting the Server

To start the server with default settings:

python cbioportal_server.py

This launches the server using the public cBioPortal API at https://www.cbioportal.org/api.

Advanced Options

Customize server behavior with command-line arguments:

# Use a different cBioPortal API instance python cbioportal_server.py --base-url https://your-cbioportal-instance.org/api # Specify a different transport mechanism (only stdio supported currently) python cbioportal_server.py --transport stdio

Configuration

Using with Claude Desktop

  1. Install Claude Desktop
  2. Open Claude Desktop
  3. Click on the MCP Servers icon in the toolbar
  4. Add a new MCP server with the following configuration:
{ "mcpServers": { "cbioportal": { "command": "python", "args": ["/path/to/cbioportal_server.py"], "env": {} } } }

Replace /path/to/cbioportal_server.py with the actual path to your script.

Using with VS Code

Configure the MCP server in your workspace settings:

{ "mcp.servers": { "cbioportal": { "command": "python", "args": ["/path/to/cbioportal_server.py"] } } }

Available Tools

The cBioPortal MCP server provides the following tools:

Tool NameDescription
get_cancer_studiesList all available cancer studies in cBioPortal
get_cancer_typesGet a list of all cancer types
get_study_detailsGet detailed information about a specific cancer study
get_samples_in_studyGet a list of samples associated with a study
get_genesGet information about specific genes by their Hugo symbol or Entrez ID
search_genesSearch for genes by keyword in their symbol or name
get_mutations_in_geneGet mutations in a specific gene for a given study
get_clinical_dataGet clinical data for patients in a study
get_molecular_profilesGet a list of molecular profiles available for a study
search_studiesSearch for cancer studies by keyword
get_multiple_studiesFetch multiple studies concurrently for better performance
get_multiple_genesRetrieve multiple genes concurrently with automatic batching

Examples

Here are examples of questions you can ask AI assistants connected to this server:

"What cancer studies are available in cBioPortal?" "Search for melanoma studies in cBioPortal" "Get information about the BRCA1 gene" "What mutations in TP53 are present in breast cancer studies?" "Find studies related to lung cancer" "Get clinical data for patients in the TCGA breast cancer study"

Performance

This server implements full asynchronous support for significantly improved performance when retrieving data from the cBioPortal API.

Benchmark Results

Our testing shows significant performance improvements with the async implementation:

  • 4.57x faster for concurrent study fetching compared to sequential operations
  • Efficient batched processing for retrieving multiple genes
  • Consistent data quality between sequential and concurrent operations

Bulk Operation Benefits

The server provides specialized tools for bulk operations that leverage concurrency:

  • get_multiple_studies: Fetches multiple studies in parallel using asyncio.gather
  • get_multiple_genes: Implements smart batching for efficient concurrent gene retrieval

These methods include detailed performance metrics, such as execution time and batch counts, to help you understand the efficiency gains.

Troubleshooting

Server Fails to Start

  • Ensure you have Python 3.8+ installed: python --version
  • Verify all dependencies are installed: pip list | grep mcp
  • Check for error messages in the console

Connection Issues with Claude Desktop

  • Verify the path to the script is correct in your configuration
  • Make sure the script has execute permissions
  • Check the Claude logs for detailed error messages

API Connection Issues

  • Ensure you have internet connectivity
  • Verify that the cBioPortal API is accessible: curl https://www.cbioportal.org/api/cancer-types
  • Try using a different API endpoint if available

Development

Extending the Server

You can extend the functionality of the server by adding new methods to the CBioPortalMCPServer class and registering them as tools:

# Add a new method def my_new_tool(self, parameter1: str, parameter2: int) -> Dict: # Implementation return {"result": "data"} # Register the new tool self.mcp.tool()(self.my_new_tool)

Future Improvements

Potential improvements for future versions:

  • Caching for frequently accessed data
  • Authentication support for private cBioPortal instances
  • Additional endpoints for more comprehensive data access
  • Fine-tuning concurrency limits based on server capabilities
  • Add request retry mechanisms for more robust error handling
  • Implement more concurrent bulk operation methods for other endpoints

Updates and Maintenance

To update to the latest version of the MCP SDK:

pip install -U mcp

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

-
security - not tested
A
license - permissive license
-
quality - not tested

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

A server that enables AI assistants to interact with cancer genomics data from cBioPortal, allowing users to explore cancer studies, access genomic data, and retrieve mutations and clinical information.

  1. Features
    1. Table of Contents
      1. Installation
        1. Prerequisites
        2. Set Up Environment
        3. Download the Server
        4. Make the Script Executable (Linux/macOS only)
      2. Usage
        1. Starting the Server
        2. Advanced Options
      3. Configuration
        1. Using with Claude Desktop
        2. Using with VS Code
      4. Available Tools
        1. Examples
          1. Performance
            1. Benchmark Results
            2. Bulk Operation Benefits
          2. Troubleshooting
            1. Server Fails to Start
            2. Connection Issues with Claude Desktop
            3. API Connection Issues
          3. Development
            1. Extending the Server
            2. Future Improvements
            3. Updates and Maintenance
          4. License
            1. Acknowledgments

              Related MCP Servers

              • -
                security
                F
                license
                -
                quality
                Enables AI assistants to interact with Metabase databases and dashboards, allowing users to list and execute queries, access data visualizations, and interact with database resources through natural language.
                Last updated -
                9
                JavaScript
                • Apple
              • -
                security
                F
                license
                -
                quality
                Enables AI assistants to interact with Metabase, providing access to dashboards, questions, databases, and tools for executing queries and viewing data through natural language.
                Last updated -
                JavaScript
                • Apple
              • -
                security
                A
                license
                -
                quality
                An MCP server enabling AI assistants to search and analyze pharmaceutical data through Cortellis. Features comprehensive drug search, ontology exploration, and real-time clinical trial data access.
                Last updated -
                Python
                MIT License
                • Linux
                • Apple
              • -
                security
                F
                license
                -
                quality
                A Model Context Protocol server providing AI assistants with access to healthcare data tools, including FDA drug information, PubMed research, health topics, clinical trials, and medical terminology lookup.
                Last updated -
                1
                Python
                • Linux
                • Apple

              View all related MCP servers

              ID: 06rhrqm6mn