sklearn roc_auc_score with multi_class==”ovr” should have None average available

Question

I&#8217;m trying to compute the AUC score for a multiclass problem using the sklearn&#8217;s roc_auc_score() function. I have prediction matrix of shape [n_samples,n_classes] and a ground truth vector of shape [n_samples], named np_pred and np_label respectively. What I&#8217;m trying to achieve is the set of…

Accepted Answer

As you already know, right now sklearn multiclass ROC AUC only handles the macro and weighted averages. But it can be implemented as it can then individually return the scores for each class.Theoretically speaking, you could implement OVR and calculate per-class roc_auc_score, as:roc = {label: [] for label in multi_class_series.unique()}for label in multi_class_series.unique():    selected_classifier.fit(train_set_dataframe, train_class == label)    predictions_proba = selected_classifier.predict_proba(test_set_dataframe)    roc[label] += roc_auc_score(test_class, predictions_proba[:,1])

Advertisement

Answer