generate_pca_visualization_report
Generate a PCA visualization HTML report to compare training and test data in 2D PCA space, highlighting model predictions with option to validate against true labels.
Instructions
Generate PCA visualization HTML report showing training and test data in 2D PCA space.
Creates interactive scatter plot with:
- Training data (blue dots) - healthy baseline
- Test/prediction data (green = predicted healthy, red = predicted anomaly)
- PC1 vs PC2 axes with variance explained
- Hover information showing segment details and prediction status
**IMPORTANT**: Labels show MODEL PREDICTIONS, not ground truth. Use `true_labels`
parameter to provide actual labels for validation visualization.
**Strategy**: Same HTML report approach as FFT/Envelope/ISO reports.
Saved to reports/ directory for LLM to reference without consuming tokens.
Args:
model_name: Name of trained model (e.g., 'bearing_health_model')
test_signal_files: Optional list of signals to predict and visualize
true_labels: Optional dict mapping signal filenames to true labels.
Format: {"baseline_3.csv": "healthy", "InnerRaceFault_vload_6.csv": "faulty"}
When provided, legend shows both true and predicted labels for validation.
sampling_rate: Sampling rate (auto-detect from metadata if None)
segment_duration: Segment duration in seconds (default: 0.1s for ML)
overlap_ratio: Overlap ratio 0-1 (default: 0.5)
ctx: MCP context
Returns:
Dictionary with file path, metadata, and summary (includes validation metrics if true_labels provided)
Example (predictions only):
>>> generate_pca_visualization_report(
... model_name="bearing_health_model",
... test_signal_files=["real_test/baseline_3.csv", "real_test/InnerRaceFault_vload_6.csv"]
... )
Example (with validation):
>>> generate_pca_visualization_report(
... model_name="bearing_health_model",
... test_signal_files=["real_test/baseline_3.csv", "real_test/InnerRaceFault_vload_6.csv"],
... true_labels={"baseline_3.csv": "healthy", "InnerRaceFault_vload_6.csv": "faulty"}
... )
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| model_name | Yes | ||
| test_signal_files | No | ||
| true_labels | No | ||
| sampling_rate | No | ||
| segment_duration | No | ||
| overlap_ratio | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||