tcai_meta_learning
Monitors meta-learning state by analyzing reward-prediction-error variance dynamics to determine if learning is converging or entering a novel regime.
Instructions
Meta-learning state (MetaLearningModule port): learning velocity from RPE-variance dynamics. velocity>0 ⇒ converging; noveltySpike ⇒ novel/confusing regime. Optionally inject an RPE sample.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| rpe | No | Inject a reward-prediction-error sample ∈ [−1,1] |