@arizeai/phoenix-mcp

Official

Overview Schema Related Servers Score Discussions

phoenix
docs
section-learn

inferences-concepts.md•7.27 KiB

# Inferences Concepts This section introduces _inferences_ and _schemas,_ the starting concepts needed to use Phoenix with inferences. {% hint style="info" %} * For comprehensive descriptions of `phoenix.Inferences` and `phoenix.Schema`, see the [API reference](https://github.com/Arize-ai/phoenix/blob/main/docs/inferences/broken-reference/README.md). * For tips on creating your own Phoenix inferences and schemas, see the how-to guide. {% endhint %} ## Inferences _Phoenix inferences_ are an instance of `phoenix.Inferences` that contains three pieces of information: * The data itself (a pandas dataframe) * A [schema](https://github.com/Arize-ai/phoenix/blob/main/docs/inferences/broken-reference/README.md) (a `phoenix.Schema` instance) that describes the columns of your dataframe * A name that appears in the UI For example, if you have a dataframe `prod_df` that is described by a schema `prod_schema`, you can define inferences `prod_ds` with ```python prod_ds = px.Inferences(prod_df, prod_schema, "production") ``` If you launch Phoenix with these inferences, you will see inferences named "production" in the UI. ### How many inferences do I need? > You can launch Phoenix with zero, one, or two sets of inferences. With no inferences, Phoenix runs in the background and collects trace data emitted by your instrumented LLM application. With a single inference set, Phoenix provides insights into model performance and data quality. With two inference sets, Phoenix compares your inferences and gives insights into drift in addition to model performance and data quality, or helps you debug your retrieval-augmented generation applications. <table data-view="cards"><thead><tr><th align="center"></th><th></th></tr></thead><tbody><tr><td align="center"><strong>Use Zero Inference sets When:</strong></td><td><ul><li>You want to run Phoenix in the background to collect trace data from your instrumented LLM application.</li></ul></td></tr><tr><td align="center"><strong>Use a Single Inference set When:</strong></td><td><ul><li>You have only a single cohort of data, e.g., only training data.</li><li>You care about model performance and data quality, but not drift.</li></ul></td></tr><tr><td align="center"><strong>Use Two Inference sets When:</strong></td><td><ul><li>You want to compare cohorts of data, e.g., training vs. production.</li><li>You care about drift in addition to model performance and data quality.</li><li>You have corpus data for information retrieval. See Corpus Data.</li></ul></td></tr></tbody></table> ### Which inference set is which? > Your reference inferences provides a baseline against which to compare your primary inferences. To compare two inference sets with Phoenix, you must select one inference set as _primary_ and one to serve as a _reference_. As the name suggests, your primary inference set contains the data you care about most, perhaps because your model's performance on this data directly affects your customers or users. Your reference inferences, in contrast, is usually of secondary importance and serves as a baseline against which to compare your primary inferences. Very often, your primary inferences will contain production data and your reference inferences will contain training data. However, that's not always the case; you can imagine a scenario where you want to check your test set for drift relative to your training data, or use your test set as a baseline against which to compare your production data. When choosing primary and reference inference sets, it matters less _where_ your data comes from than _how important_ the data is and _what role_ the data serves relative to your other data. ### Corpus Inference set (Information Retrieval) The only difference for the corpus inferences is that it needs a separate schema because it has a different set of columns compared to the model data. See the schema section for more details. ## Schemas A _Phoenix schema_ is an instance of `phoenix.Schema` that maps the columns of your dataframe to fields that Phoenix expects and understands. Use your schema to tell Phoenix what the data in your dataframe means. For example, if you have a dataframe containing Fisher's Iris data that looks like this: | sepal\_length | sepal\_width | petal\_length | petal\_width | target | prediction | | ------------- | ------------ | ------------- | ------------ | ---------- | ---------- | | 7.7 | 3.0 | 6.1 | 2.3 | virginica | versicolor | | 5.4 | 3.9 | 1.7 | 0.4 | setosa | setosa | | 6.3 | 3.3 | 4.7 | 1.6 | versicolor | versicolor | | 6.2 | 3.4 | 5.4 | 2.3 | virginica | setosa | | 5.8 | 2.7 | 5.1 | 1.9 | virginica | virginica | your schema might look like this: ```python schema = px.Schema( feature_column_names=[ "sepal_length", "sepal_width", "petal_length", "petal_width", ], actual_label_column_name="target", prediction_label_column_name="prediction", ) ``` ### How many schemas do I need? > Usually one, sometimes two. Each inference set needs a schema. If your primary and reference inferences have the same format, then you only need one schema. For example, if you have dataframes `train_df` and `prod_df` that share an identical format described by a schema named `schema`, then you can define inference sets `train_ds` and `prod_ds` with <pre class="language-python"><code class="lang-python">train_ds = px.Inferences(train_df, schema, "training") <strong>prod_ds = px.Inferences(prod_df, schema, "production") </strong></code></pre> Sometimes, you'll encounter scenarios where the formats of your primary and reference inference sets differ. For example, you'll need two schemas if: * Your production data has timestamps indicating the time at which an inference was made, but your training data does not. * Your training data has ground truth (what we call _actuals_ in Phoenix nomenclature), but your production data does not. * A new version of your model has a differing set of features from a previous version. In cases like these, you'll need to define two schemas, one for each inference set. For example, if you have dataframes `train_df` and `prod_df` that are described by schemas `train_schema` and `prod_schema`, respectively, then you can define inference sets `train_ds` and `prod_ds` with <pre class="language-python"><code class="lang-python">train_ds = px.Inferences(train_df, train_schema, "training") <strong>prod_ds = px.Inferences(prod_df, prod_schema, "production") </strong></code></pre> #### Schema for Corpus Inferences (Information Retrieval) A corpus inference set, containing documents for information retrieval, typically has a different set of columns than those found in the model data from either production or training, and requires a separate schema. Below is an example schema for a corpus inference set with three columns: the `id`, `text`, and `embedding` for each document in the corpus. {% code fullWidth="false" %} ```python corpus_schema=Schema( id_column_name="id", document_column_names=EmbeddingColumnNames( vector_column_name="embedding", raw_data_column_name="text", ), ), corpus_ds = px.Inferences(corpus_df, corpus_schema) ``` {% endcode %}

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Arize-ai/phoenix'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

inferences-concepts.md•7.27 KiB