search_datasets
Fuzzy-search ABS dataflows using natural language. Curated indicators like unemployment and GDP rank first for common queries.
Instructions
Fuzzy-search ABS dataflow names, descriptions, and keywords.
Use this when you don't know the exact dataset ID. The 10 curated dataflows (LF, CPI, ANA_AGG, etc.) get a relevance boost so common queries like "unemployment" or "gdp" return the right dataset at rank #1 — not one of ABS's 800+ census tables that mention these keywords incidentally.
Examples: # Discover which dataflow answers "what's NSW unemployment?" results = await search_datasets("unemployment") # → [{id: 'LF', name: 'Labour Force', is_curated: True}, ...]
# Broader topic exploration
results = await search_datasets("housing", limit=5)
# → top 5 housing-related dataflows, curated firstWhen to use: - You have a natural-language question and need to identify the dataset - You want to discover what ABS publishes on a topic - You're not sure if a topic has a plain-English (curated) mapping yet
Returns: List of DatasetSummary (id, name, description, is_curated), ranked by relevance. Curated dataflows surface above raw SDMX dataflows.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| query | Yes | Free-text search query. Matches against dataflow IDs, names, descriptions, and each curated YAML's search_keywords. Case-insensitive. | |
| limit | No | Maximum number of results to return, ranked by relevance. Curated dataflows get a +25 score bonus so they surface above ABS's ~800 census tables for common queries. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |