de en es ja ko ru zh

@arizeai/phoenix-mcp

Official

by Arize-ai

227

7,296

Overview InspectNew Endpoints Schema Related Servers Reviews Score

Need Help?View Source Code Report Issue

phoenix
docs
evaluation
legacy
concepts-evals

evals-with-explanations.md•1.55 kB

# Evals With Explanations {% embed url="https://colab.research.google.com/github/Arize-ai/phoenix/blob/main/tutorials/evals/evaluate_relevance_classifications.ipynb?#scrollTo=zUtDrplhZZJu&uniqifier=1" %} See "Classifications with Explanations Section" {% endembed %} It can be hard to understand in many cases why an LLM responds in a specific way. The explanation feature of Phoneix allows you to get a Eval output and an explanation from the LLM at the same time. We have found this incredibly useful for debugging LLM Evals. <pre class="language-python"><code class="lang-python">from phoenix.evals import ( RAG_RELEVANCY_PROMPT_RAILS_MAP, RAG_RELEVANCY_PROMPT_TEMPLATE, OpenAIModel, download_benchmark_dataset, llm_classify, ) model = OpenAIModel( model_name="gpt-4", temperature=0.0, ) #The rails is used to hold the output to specific values based on the template #It will remove text such as ",,," or "..." #Will ensure the binary value expected from the template is returned rails = list(RAG_RELEVANCY_PROMPT_RAILS_MAP.values()) relevance_classifications = llm_classify( dataframe=df, template=RAG_RELEVANCY_PROMPT_TEMPLATE, model=model, rails=rails, <a data-footnote-ref href="#user-content-fn-1">provide_explanation=True</a> ) #relevance_classifications is a Dataframe with columns 'label' and 'explanation' </code></pre> The flag above can be set with any of the templates or your own custom templates. The example below is from a relevance Evaluation. [^1]: set to get an explanation out

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Arize-ai/phoenix'

If you have feedback or need assistance with the MCP directory API, please join our Discord server