cluster_documents_hdbscan
Group documents by content similarity with HDBSCAN. Automatically detect clusters and outliers in datasets of varying density.
Instructions
Cluster documents using HDBSCAN (hierarchical density-based) algorithm. Advanced clustering that handles varying densities. Automatically discovers clusters and outliers.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| min_cluster_size | No | Minimum samples per cluster (default: 5) | |
| min_samples | No | Minimum samples in neighborhood (optional) |