Skip to main content
Glama
microbiomedata

nmdc-mcp

Official

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault

No arguments

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": true
}
prompts
{
  "listChanged": false
}
resources
{
  "subscribe": false,
  "listChanged": false
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
fetch_and_filter_gff_by_pfam_domainsA

Use this tool to analyze GFF files for PFAM domains and show genomic locations. Use AFTER get_samples_by_annotation to get the actual gene coordinates and genomic context. Takes a data_object_id from the previous search results and downloads the entire GFF file by default to show specific gene annotations containing the PFAM domains. Only use sample_bytes parameter if you specifically need to limit download size.

get_data_objects_by_pfam_domainsC

Use this tool to find biosamples that contain specific PFAM protein domains. Returns structured data about biosamples, their activities, and data objects. Perfect for finding samples with particular functional capabilities.

get_collection_namesA

Use this tool to discover what types of data are available in the NMDC database. Returns a list of collection names like 'biosample_set', 'study_set', etc.

get_collection_statsA

Use this tool to get statistics about NMDC collections including document counts. Helps understand the size and scope of available data.

get_all_collection_idsB

Use this tool to get lists of IDs from NMDC collections in batches. Useful for sampling or systematic analysis of large datasets.

get_samples_in_elevation_rangeB

Use this tool to find biosamples collected within a specific elevation range. Perfect for studying altitude-related microbial communities.

get_samples_within_lat_lon_bounding_boxC

Use this tool to find biosamples collected within a specific geographic bounding box defined by latitude and longitude coordinates.

get_samples_by_ecosystemC

Use this tool to find biosamples from specific ecosystem types, categories, or subtypes. Perfect for studying particular environments like soil, marine, or host-associated microbiomes.

get_entity_by_idC

Use this tool to retrieve any NMDC entity by its ID. Works with biosamples, studies, data objects, and other NMDC entities.

get_entity_by_id_with_projectionA

Use this tool to retrieve specific fields from an NMDC entity by ID. More efficient than getting full entities when you only need certain fields.

get_entities_by_ids_with_projectionA

Use this tool to retrieve specific fields from multiple NMDC entities by their IDs. Efficient for batch operations with field filtering.

get_study_for_biosampleC

Use this tool to find the study associated with a specific biosample. Returns study information and metadata.

get_biosamples_for_studyC

Use this tool to find all biosamples associated with a specific study. Returns a list of biosample IDs and metadata.

get_study_doi_detailsB

Use this tool to get detailed DOI information for a study, including publication DOIs, dataset DOIs, and award DOIs.

search_studies_by_doi_criteriaC

Use this tool to search for studies based on DOI criteria like provider, category, or DOI value patterns. Perfect for finding published studies or datasets from specific sources.

get_samples_by_annotationA

Use this tool to find biosamples with specific functional annotations. Returns COMPLETE biosample records including all data objects (GFF files, protein files, etc.) with their IDs and URLs. ALWAYS set max_records to match the user's request (e.g., if they ask for '1 sample' or 'a sample', set max_records=1). Use max_records, NOT limit, to control how many samples to return. Required formats: PFAM domains use 'PFAM:PF04183', KEGG use 'KEGG.ORTHOLOGY:K00001', COG use 'COG:COG0001', GO use 'GO:GO0000001'. When users want genomic locations of domains, use this first to find samples, then use fetch_and_filter_gff_by_pfam_domains with a GFF data_object_id from the results.

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/microbiomedata/nmdc-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server