Dataset Viewer MCP Server

by privetin

search_dataset

Search for specific text within Hugging Face datasets by specifying dataset, config, split, and query. Ideal for locating relevant data segments in large datasets hosted on Hugging Face Hub.

Instructions

Search for text within a Hugging Face dataset

Input Schema

NameRequiredDescriptionDefault
auth_tokenNoHugging Face auth token for private/gated datasets
configYesDataset configuration/subset name. Use get_info to list available configs
datasetYesHugging Face dataset identifier in the format owner/dataset
queryYesText to search for in the dataset
splitYesDataset split name. Splits partition the data for training/evaluation

Input Schema (JSON Schema)

{ "properties": { "auth_token": { "description": "Hugging Face auth token for private/gated datasets", "optional": true, "type": "string" }, "config": { "description": "Dataset configuration/subset name. Use get_info to list available configs", "examples": [ "default", "en", "es" ], "type": "string" }, "dataset": { "description": "Hugging Face dataset identifier in the format owner/dataset", "examples": [ "ylecun/mnist", "stanfordnlp/imdb" ], "pattern": "^[^/]+/[^/]+$", "type": "string" }, "query": { "description": "Text to search for in the dataset", "type": "string" }, "split": { "description": "Dataset split name. Splits partition the data for training/evaluation", "examples": [ "train", "validation", "test" ], "type": "string" } }, "required": [ "dataset", "config", "split", "query" ], "type": "object" }

You must be authenticated.

Other Tools from Dataset Viewer MCP Server

Related Tools

ID: b5mmrmnn6b