UCSC Genome Browser MCP Server
A Model Context Protocol (MCP) server that provides comprehensive access to the UCSC Genome Browser API. This server enables LLM applications to query genomic data, sequences, tracks, and metadata from the UCSC Genome Browser.
Features
This MCP server exposes 12 tools that cover all major UCSC Genome Browser API endpoints:
Listing & Discovery Tools
find_genome - Search for genomes using keywords, accession IDs, or organism names
list_public_hubs - List all available public track hubs
list_ucsc_genomes - List all UCSC database genomes
list_genark_genomes - List GenArk assembly hub genomes
list_hub_genomes - List genomes from a specific hub
list_files - List downloadable files for a genome
list_tracks - List data tracks in a genome or hub
list_chromosomes - List chromosomes in a genome or track
list_schema - Get schema/field definitions for a track
Data Retrieval Tools
get_sequence - Retrieve DNA sequences from genomes
get_track_data - Get track data (genes, variants, annotations, etc.)
search_genome - Search within a genome assembly
Installation
Prerequisites
Python 3.10 or higher
pip
Install from source
Required Dependencies
mcp>=0.9.0- Model Context Protocol SDKhttpx>=0.27.0- HTTP client for API requests
Usage
Running the Server
The server communicates over stdio, following the MCP protocol:
Configuration with Claude Desktop
Add this to your Claude Desktop configuration file:
MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%/Claude/claude_desktop_config.json
Tool Examples
1. Find Genomes
Search for dog genomes:
Search with advanced operators:
2. List Available Tracks
List all tracks for human genome (hg38):
List tracks without container information:
3. Get DNA Sequence
Get entire mitochondrial chromosome:
Get specific region:
Get reverse complement:
4. Get Track Data
Get gene annotations for a region:
Get track data from an assembly hub:
5. Search Within a Genome
Search for BRCA1 in human genome:
Search only in help documentation:
API Details
Base URL
All requests go to: https://api.genome.ucsc.edu
Rate Limits
Recommended: Maximum 1 request per second
The API has a botDelay system to prevent overload
Excessive queries may result in restricted access
Coordinate Systems
Start coordinates: 0-based (first base is 0)
End coordinates: 1-based (exclusive)
Example:
start=0, end=10retrieves the first 10 bases
Supported Track Types
The get_track_data tool supports these track types:
bed, bigBed, bigWig
genePred, bigGenePred
bigChain, bigPsl, bigMaf
narrowPeak, bigNarrowPeak
wiggle/wig, barChart/bigBarChart
interact/bigInteract
And many more...
Advanced Features
Working with Large Datasets
For tracks with over 1 million items, use pagination:
Query by chromosome:
Use start/end coordinates for smaller regions:
Working with Track Hubs
Track hubs allow accessing external genomic data:
Error Handling
The server handles various error conditions:
HTTP errors (404, 500, etc.)
Invalid parameters
Non-existent genomes/tracks/chromosomes
Request timeouts (30 second default)
Errors are returned as text responses with descriptive messages.
Use Cases
This MCP server enables LLM applications to:
Genomic Research: Query gene locations, sequences, and annotations
Variant Analysis: Retrieve SNP and variant data
Comparative Genomics: Access data from multiple species
Sequence Analysis: Get DNA/RNA sequences for analysis
Data Discovery: Find available datasets and assemblies
Educational: Learn about genomics through interactive queries
API Documentation
For complete API documentation, visit: https://genome.ucsc.edu/goldenpath/help/api.html
License
This project interfaces with the UCSC Genome Browser, which has its own terms of use: https://genome.ucsc.edu/conditions.html
Support
For issues with the UCSC Genome Browser API, contact UCSC. For issues with this MCP server, please file an issue in the repository.
Development
Project Structure
Adding New Tools
To add new endpoints:
Add the tool definition in
list_tools()Add the handler in
call_tool()Update this README with examples
Testing
Test individual endpoints manually:
Changelog
Version 0.1.0
Initial release
Support for all major UCSC Genome Browser API endpoints
12 tools covering listing, discovery, and data retrieval
Full documentation and examples
This server cannot be installed
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Provides comprehensive access to the UCSC Genome Browser API, enabling queries of genomic data, DNA sequences, gene annotations, variants, and metadata across multiple species and assemblies.