HF Dataset MCP
Server Configuration
Describes the environment variables required to run the server.
| Name | Required | Description | Default |
|---|---|---|---|
| HF_TOKEN | No | Hugging Face API token (required for private/gated datasets) | |
| HF_DATASETS_SERVER | No | Custom Dataset Viewer API URL | https://datasets-server.huggingface.co |
Capabilities
Features and capabilities supported by this server
| Capability | Details |
|---|---|
| tools | {
"listChanged": true
} |
Tools
Functions exposed to the LLM to take actions
| Name | Description |
|---|---|
| search_datasetsA | Find datasets on the Hugging Face Hub by name, tag, or author |
| validate_datasetC | Check if a dataset is accessible and which viewer features are available |
| list_splitsC | Get all available configurations and splits for a dataset |
| get_dataset_infoB | Get the schema, metadata, and row counts for a dataset configuration |
| get_rowsC | Fetch a slice of rows from a dataset split |
| search_datasetB | Full-text search within a dataset split using BM25 ranking |
| filter_rowsC | Filter dataset rows using SQL-like WHERE conditions |
| get_dataset_sizeB | Get row counts and byte sizes for all configs and splits |
| list_parquet_filesA | Get URLs for the dataset's Parquet files for direct download or processing |
| get_statisticsB | Get descriptive statistics for each column in a dataset split |
Prompts
Interactive templates invoked by user choice
| Name | Description |
|---|---|
No prompts | |
Resources
Contextual data attached and managed by the client
| Name | Description |
|---|---|
No resources | |
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/cfahlgren1/hf-dataset-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server