get_leaderboard
Retrieve top-ranked models and methods for a specific dataset or benchmark, showing performance scores to compare research results across computer science and AI papers.
Instructions
Get the SOTA leaderboard for a dataset/benchmark (e.g. ImageNet, MMLU, GSM8K, SWE-bench). Returns top methods/models ranked by score. Only includes papers with absolute numeric results. Powered by 59k+ extracted benchmark results across 20k+ datasets.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| dataset | Yes | Dataset/benchmark name e.g. 'ImageNet', 'MMLU', 'GSM8K', 'CIFAR-10', 'SWE-bench verified' | |
| metric | No | Specific metric to filter by e.g. 'accuracy', 'F1', 'BLEU'. If omitted, returns all metrics for the dataset. | |
| limit | No | Max entries per metric (default 20) |