evaluate_pipeline
Run RAGAS-style evaluation on a RAG pipeline to measure answer relevance, faithfulness, and context recall across test queries.
Instructions
Run RAGAS-style evaluation on the RAG pipeline. Measures answer relevance, faithfulness, and context recall across a test query set.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| queries | No | Test queries to evaluate. Uses default set if not provided. | |
| report_format | No | summary |