Skip to main content
Glama

Dataset Viewer MCP Server

by privetin

get_statistics

Retrieve detailed statistics for a Hugging Face dataset by specifying dataset identifier, configuration, and split. Analyze data distribution and insights for training or evaluation.

Instructions

Get statistics about a Hugging Face dataset

Input Schema

NameRequiredDescriptionDefault
auth_tokenNoHugging Face auth token for private/gated datasets
configYesDataset configuration/subset name. Use get_info to list available configs
datasetYesHugging Face dataset identifier in the format owner/dataset
splitYesDataset split name. Splits partition the data for training/evaluation

Input Schema (JSON Schema)

{ "properties": { "auth_token": { "description": "Hugging Face auth token for private/gated datasets", "optional": true, "type": "string" }, "config": { "description": "Dataset configuration/subset name. Use get_info to list available configs", "examples": [ "default", "en", "es" ], "type": "string" }, "dataset": { "description": "Hugging Face dataset identifier in the format owner/dataset", "examples": [ "ylecun/mnist", "stanfordnlp/imdb" ], "pattern": "^[^/]+/[^/]+$", "type": "string" }, "split": { "description": "Dataset split name. Splits partition the data for training/evaluation", "examples": [ "train", "validation", "test" ], "type": "string" } }, "required": [ "dataset", "config", "split" ], "type": "object" }

You must be authenticated.

Other Tools from Dataset Viewer MCP Server

Related Tools

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/privetin/dataset-viewer'

If you have feedback or need assistance with the MCP directory API, please join our Discord server