Schema | nmdc-mcp

nmdc-mcp

Official

by microbiomedata

Overview Schema Related Servers Score Discussions

Python

Remote

Server Configuration

Describes the environment variables required to run the server.

Name	Required	Description	Default
No arguments

Capabilities

Features and capabilities supported by this server

Capability	Details
`tools`	{ "listChanged": true }
`prompts`	{ "listChanged": false }
`resources`	{ "subscribe": false, "listChanged": false }
`experimental`	{}

Tools

Functions exposed to the LLM to take actions

Name	Description
fetch_and_filter_gff_by_pfam_domainsA	Use this tool to analyze GFF files for PFAM domains and show genomic locations. Use AFTER get_samples_by_annotation to get the actual gene coordinates and genomic context. Takes a data_object_id from the previous search results and downloads the entire GFF file by default to show specific gene annotations containing the PFAM domains. Only use sample_bytes parameter if you specifically need to limit download size.
get_data_objects_by_pfam_domainsC	Use this tool to find biosamples that contain specific PFAM protein domains. Returns structured data about biosamples, their activities, and data objects. Perfect for finding samples with particular functional capabilities.
get_collection_namesA	Use this tool to discover what types of data are available in the NMDC database. Returns a list of collection names like 'biosample_set', 'study_set', etc.
get_collection_statsA	Use this tool to get statistics about NMDC collections including document counts. Helps understand the size and scope of available data.
get_all_collection_idsB	Use this tool to get lists of IDs from NMDC collections in batches. Useful for sampling or systematic analysis of large datasets.
get_samples_in_elevation_rangeB	Use this tool to find biosamples collected within a specific elevation range. Perfect for studying altitude-related microbial communities.
get_samples_within_lat_lon_bounding_boxC	Use this tool to find biosamples collected within a specific geographic bounding box defined by latitude and longitude coordinates.
get_samples_by_ecosystemC	Use this tool to find biosamples from specific ecosystem types, categories, or subtypes. Perfect for studying particular environments like soil, marine, or host-associated microbiomes.
get_entity_by_idC	Use this tool to retrieve any NMDC entity by its ID. Works with biosamples, studies, data objects, and other NMDC entities.
get_entity_by_id_with_projectionA	Use this tool to retrieve specific fields from an NMDC entity by ID. More efficient than getting full entities when you only need certain fields.
get_entities_by_ids_with_projectionA	Use this tool to retrieve specific fields from multiple NMDC entities by their IDs. Efficient for batch operations with field filtering.
get_study_for_biosampleC	Use this tool to find the study associated with a specific biosample. Returns study information and metadata.
get_biosamples_for_studyC	Use this tool to find all biosamples associated with a specific study. Returns a list of biosample IDs and metadata.
get_study_doi_detailsB	Use this tool to get detailed DOI information for a study, including publication DOIs, dataset DOIs, and award DOIs.
search_studies_by_doi_criteriaC	Use this tool to search for studies based on DOI criteria like provider, category, or DOI value patterns. Perfect for finding published studies or datasets from specific sources.
get_samples_by_annotationA	Use this tool to find biosamples with specific functional annotations. Returns COMPLETE biosample records including all data objects (GFF files, protein files, etc.) with their IDs and URLs. ALWAYS set max_records to match the user's request (e.g., if they ask for '1 sample' or 'a sample', set max_records=1). Use max_records, NOT limit, to control how many samples to return. Required formats: PFAM domains use 'PFAM:PF04183', KEGG use 'KEGG.ORTHOLOGY:K00001', COG use 'COG:COG0001', GO use 'GO:GO0000001'. When users want genomic locations of domains, use this first to find samples, then use fetch_and_filter_gff_by_pfam_domains with a GFF data_object_id from the results.

Prompts

Interactive templates invoked by user choice

Name	Description
No prompts

Resources

Contextual data attached and managed by the client

Name	Description
No resources

Server Configuration
Capabilities
Tools
Prompts
Resources

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/microbiomedata/nmdc-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server