uniprot_fetch
Fetch specific protein entries by providing UniProt accession IDs. Supports multiple databases, custom fields, and JSON or TOON output.
Instructions
Fetch specific protein entries by their UniProt accession IDs.
Args: ids: List of UniProt accession IDs to fetch. Examples: - ["P62988"] - Single protein - ["A0A0C5B5G6", "A0A1B0GTW7"] - Multiple proteins
database: UniProt database to fetch from. One of: uniprotkb (default), uniparc, uniref
fields: Optional list of return fields to include. If not specified, all fields
are returned. Available return fields (see https://www.uniprot.org/help/return_fields):
Names & Taxonomy:
- accession, id, gene_names, gene_primary, gene_synonym, gene_oln, gene_orf
- organism_name, organism_id, protein_name, xref_proteomes
- lineage, lineage_ids, virus_hosts
Sequences:
- cc_alternative_products, ft_var_seq, cc_sc_epred, fragment, encoded_in
- length, mass, cc_mass_spectrometry, ft_variant, ft_non_cons, ft_non_std
- ft_non_ter, cc_polymorphism, cc_rna_editing, sequence, cc_sequence_caution
- ft_conflict, ft_unsure, sequence_version
Function:
- absorption, ft_act_site, cc_activity_regulation, ft_binding, cc_catalytic_activity
- cc_cofactor, ft_dna_bind, ec, cc_function, kinetics, cc_pathway
- ph_dependence, redox_potential, rhea, ft_site, temp_dependence
Miscellaneous:
- annotation_score, cc_caution, comment_count, feature_count, keywordid, keyword
- cc_miscellaneous, protein_existence, reviewed, tools, uniparc_id
Interaction:
- cc_interaction, cc_subunit
Expression:
- cc_developmental_stage, cc_induction, cc_tissue_specificity
Gene Ontology (GO):
- go_p, go_c, go, go_f, go_id
Pathology & Biotech:
- cc_allergen, cc_biotechnology, cc_disruption_phenotype, cc_disease
- ft_mutagen, cc_pharmaceutical, cc_toxic_dose
Subcellular location:
- ft_intramem, cc_subcellular_location, ft_topo_dom, ft_transmem
PTM / Processing:
- ft_chain, ft_crosslnk, ft_disulfid, ft_carbohyd, ft_init_met, ft_lipid
- ft_mod_res, ft_peptide, cc_ptm, ft_propep, ft_signal, ft_transit
Structure:
- structure_3d, ft_strand, ft_helix, ft_turn
Publications:
- lit_pubmed_id
Date:
- date_created, date_modified, date_sequence_modified, version
Family & Domains:
- ft_coiled, ft_compbias, cc_domain, ft_domain, ft_motif, protein_families
- ft_region, ft_repeat, ft_zn_fing
Cross-references:
- See https://www.uniprot.org/help/return_fields for cross-reference fields
response_format: Response format. One of: 'json' (default) or 'toon'.
- 'json': Returns response in JSON format
- 'toon': Returns response in TOON formatReturns: When response_format='json': JSON object with: - results: Array of fetched protein entries - found: Number of entries successfully retrieved - requested: Number of IDs that were requested
When response_format='toon': TOON-formatted string with:
- results: Array of fetched protein entries
- found: Number of entries successfully retrieved
- requested: Number of IDs that were requestedInput Schema
| Name | Required | Description | Default |
|---|---|---|---|
| ids | Yes | ||
| database | No | uniprotkb | |
| fields | No | ||
| response_format | No | json |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |