copy Copy chevron-down
User Guide chevron-right 🔢 Input dataData for Classification How to define the data schema for classification.
To evaluate classification model performance, you must correctly map the input data schema.
To evaluate the classification performance, you need both true labels and prediction. Depending on the classification type (e.g., binary, multi-class, probabilistic), you have different options of how to pass the predictions.
Multiclass classification
Target: encoded labels, Preds: encoded labels + Optional[target_names].
If you pass the target names, they will appear on the visualizations.
You can also pass the target names as a dictionary:
or
Target: labels, Preds: labels.
Multiclass probabilistic classification
Target: labels, Preds: columns named after labels.
target
‘Versicolour’
‘Setosa’
‘Virginica’
Naming the columns after the labels is a requirement. You cannot pass a custom list.
Binary classification
Target: encoded labels, Preds: encoded labels + pos_label + Optional[target_names]
By default, Evidently expects the positive class to be labeled as ‘1’. If you have a different label, specify it explicitly.
If you pass the target names, they will appear on the visualizations.
Target: labels, Preds: labels + pos_label
Passing the name of the positive class is a requirement in this case.
Binary probabilistic classification
Target: labels, Preds: columns named after labels + pos_label
Passing the name of the positive class is a requirement in this case.
Target: labels, Preds: a column named like one of the labels + pos_label
Both naming the column after one of the labels and passing the name of the positive class are requirements.
Target: encoded labels, Preds: one column with any name + pos_label
If you pass the target names, they will appear on the visualizations.
Last updated 10 months ago