Skip to main content
Glama

lerobot_hf_search_datasets

Search robotics datasets by robot, format, task, and filters such as episode count, size, tags, simulation, and language conditioning.

Instructions

Search robotics datasets by robot, format, scale, size, task, and compatibility hints.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
queryNo
robotNo
formatNo
min_episodesNo
max_episodesNo
max_size_gbNo
tagsNo
taskNo
language_conditionedNo
simulationNo
demo_suitableNo
prefer_lerobotNo
include_forge_registryNo
include_hubNo
sortNodownloads
limitNo

Output Schema

TableJSON Schema
NameRequiredDescriptionDefault
resultYes
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description bears full responsibility for behavioral disclosure. It fails to mention key traits such as the tool querying Hugging Face, the nature of results, rate limits, authentication requirements, or the impact of boolean flags like 'prefer_lerobot' or 'include_forge_registry'. The description is too minimal to provide adequate transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence with no redundant words. It is concise and front-loaded, but the brevity comes at the cost of completeness. Still, it earns a high score for efficiency.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool has 16 parameters with 0% schema coverage, no output schema, and no annotations, the description is too short. It does not explain how filters combine, default values, result format, or how to use the output. The tool is complex but the description provides minimal context.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must compensate. It only mentions a subset of parameters (robot, format, task, scale/size, compatibility hints). Many parameters (e.g., query, language_conditioned, simulation, demo_suitable, sort, limit, boolean flags) are omitted. While it adds some meaning for the mentioned parameters, it is insufficient for the 16-parameter schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses a specific verb 'Search' and identifies the resource as 'robotics datasets'. It lists multiple filter criteria (robot, format, scale, size, task, compatibility hints), clearly distinguishing this tool from sibling tools like lerobot_hf_repo_info or lerobot_inspect_dataset_metadata.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for searching datasets by various criteria but does not explicitly state when to use this tool versus alternatives (e.g., lerobot_hf_repo_info for repo info, lerobot_inspect_dataset_metadata for metadata inspection). No exclusions or when-not-to-use guidance provided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/noah-wardlow/lerobot-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server