Dataset Viewer MCP Server

MIT License

Overview InspectNew Schema Related Servers Reviews Score

search_dataset

Search for specific text within Hugging Face datasets by specifying dataset, config, split, and query. Ideal for locating relevant data segments in large datasets hosted on Hugging Face Hub.

Instructions

Search for text within a Hugging Face dataset

Input Schema

Name	Required	Description
`auth_token`	No	Hugging Face auth token for private/gated datasets
`config`	Yes	Dataset configuration/subset name. Use get_info to list available configs
`dataset`	Yes	Hugging Face dataset identifier in the format owner/dataset
`query`	Yes	Text to search for in the dataset
`split`	Yes	Dataset split name. Splits partition the data for training/evaluation

Input Schema (JSON Schema)

{
  "properties": {
    "auth_token": {
      "description": "Hugging Face auth token for private/gated datasets",
      "optional": true,
      "type": "string"
    },
    "config": {
      "description": "Dataset configuration/subset name. Use get_info to list available configs",
      "examples": [
        "default",
        "en",
        "es"
      ],
      "type": "string"
    },
    "dataset": {
      "description": "Hugging Face dataset identifier in the format owner/dataset",
      "examples": [
        "ylecun/mnist",
        "stanfordnlp/imdb"
      ],
      "pattern": "^[^/]+/[^/]+$",
      "type": "string"
    },
    "query": {
      "description": "Text to search for in the dataset",
      "type": "string"
    },
    "split": {
      "description": "Dataset split name. Splits partition the data for training/evaluation",
      "examples": [
        "train",
        "validation",
        "test"
      ],
      "type": "string"
    }
  },
  "required": [
    "dataset",
    "config",
    "split",
    "query"
  ],
  "type": "object"
}

Install Server

HTTP connection URL

Other Tools from Dataset Viewer MCP Server

Related Tools

filter
@privetin/dataset-viewer
search-spaces
@xiyuefox/mcp-hfspace
search-spaces
@evalstate/mcp-hfspace
get_statistics
@privetin/dataset-viewer
get_rows
@privetin/dataset-viewer
semantic_search_papers_on_huggingface
@jerpint/paperpal

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/privetin/dataset-viewer'

If you have feedback or need assistance with the MCP directory API, please join our Discord server