get_samples_by_annotation
Find biosamples with specific functional annotations (PFAM, KEGG, COG, GO) and retrieve complete records including data objects like GFF files with IDs and URLs.
Instructions
Use this tool to find biosamples with specific functional annotations. Returns COMPLETE biosample records including all data objects (GFF files, protein files, etc.) with their IDs and URLs. ALWAYS set max_records to match the user's request (e.g., if they ask for '1 sample' or 'a sample', set max_records=1). Use max_records, NOT limit, to control how many samples to return. Required formats: PFAM domains use 'PFAM:PF04183', KEGG use 'KEGG.ORTHOLOGY:K00001', COG use 'COG:COG0001', GO use 'GO:GO0000001'. When users want genomic locations of domains, use this first to find samples, then use fetch_and_filter_gff_by_pfam_domains with a GFF data_object_id from the results.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| gene_function_ids | Yes | ||
| max_records | No | ||
| limit | No | ||
| offset | No |