Dataset Card Scan
dataset_card_scanScan directories for ML dataset card metadata, provenance, and detect PII/PHI in CSV, JSON, and JSONL files.
Instructions
Scan a directory for ML dataset card metadata, provenance, and optionally PII/PHI content.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| directory | Yes | Directory path to scan for dataset cards (dataset_info.json, README.md frontmatter, .dvc files). | |
| scan_pii | No | Also scan CSV/JSON/JSONL file contents for PII/PHI (emails, SSNs, credit cards, medical data). Default false. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |