Skip to main content
Glama
cfahlgren1

HF Dataset MCP

by cfahlgren1

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
HF_TOKENNoHugging Face API token (required for private/gated datasets)
HF_DATASETS_SERVERNoCustom Dataset Viewer API URLhttps://datasets-server.huggingface.co

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": true
}

Tools

Functions exposed to the LLM to take actions

NameDescription
search_datasetsA

Find datasets on the Hugging Face Hub by name, tag, or author

validate_datasetC

Check if a dataset is accessible and which viewer features are available

list_splitsC

Get all available configurations and splits for a dataset

get_dataset_infoB

Get the schema, metadata, and row counts for a dataset configuration

get_rowsC

Fetch a slice of rows from a dataset split

search_datasetB

Full-text search within a dataset split using BM25 ranking

filter_rowsC

Filter dataset rows using SQL-like WHERE conditions

get_dataset_sizeB

Get row counts and byte sizes for all configs and splits

list_parquet_filesA

Get URLs for the dataset's Parquet files for direct download or processing

get_statisticsB

Get descriptive statistics for each column in a dataset split

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/cfahlgren1/hf-dataset-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server