get_rows
Retrieve paginated rows from a Hugging Face dataset by specifying the dataset identifier, configuration, split, and page number to access specific data segments for analysis or processing.
Instructions
Get paginated rows from a Hugging Face dataset
Input Schema
Name | Required | Description | Default |
---|---|---|---|
auth_token | No | Hugging Face auth token for private/gated datasets | |
config | Yes | Dataset configuration/subset name. Use get_info to list available configs | |
dataset | Yes | Hugging Face dataset identifier in the format owner/dataset | |
page | No | Page number (0-based), returns 100 rows per page | |
split | Yes | Dataset split name. Splits partition the data for training/evaluation |
Input Schema (JSON Schema)
{
"properties": {
"auth_token": {
"description": "Hugging Face auth token for private/gated datasets",
"optional": true,
"type": "string"
},
"config": {
"description": "Dataset configuration/subset name. Use get_info to list available configs",
"examples": [
"default",
"en",
"es"
],
"type": "string"
},
"dataset": {
"description": "Hugging Face dataset identifier in the format owner/dataset",
"examples": [
"ylecun/mnist",
"stanfordnlp/imdb"
],
"pattern": "^[^/]+/[^/]+$",
"type": "string"
},
"page": {
"default": 0,
"description": "Page number (0-based), returns 100 rows per page",
"type": "integer"
},
"split": {
"description": "Dataset split name. Splits partition the data for training/evaluation",
"examples": [
"train",
"validation",
"test"
],
"type": "string"
}
},
"required": [
"dataset",
"config",
"split"
],
"type": "object"
}