train_bertopic
Train a BERTopic model to automatically discover topics from semantic embeddings using UMAP and HDBSCAN clustering.
Instructions
Train BERTopic model using document embeddings. State-of-the-art topic modeling with UMAP + HDBSCAN clustering. Automatically discovers topics from semantic embeddings.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| num_topics | No | Target number of topics (default: 10) | |
| min_cluster_size | No | Minimum documents per topic (default: 5) |