Principle:DistrictDataLabs Yellowbrick Confusion Matrix Visualization

Knowledge Sources	Yellowbrick Docs Yellowbrick
Domains	Machine_Learning, Classification, Model_Evaluation
Last Updated	2026-02-08 00:00 GMT

Overview

Confusion matrix visualization is the practice of rendering the cross-tabulation of true versus predicted class labels as a color-coded heatmap to expose classification errors and their distribution across classes.

Description

A confusion matrix is a square matrix of size $| C | \times | C |$ where $C$ is the set of classes. Each cell $(i, j)$ records the count of instances whose true class is $i$ and whose predicted class is $j$ . The diagonal entries represent correct predictions (true positives for each class), while off-diagonal entries represent misclassifications. Visualizing this matrix as a heatmap, with color intensity proportional to cell values, transforms a dense numeric table into an immediately interpretable diagnostic tool.

Confusion matrix visualization solves the problem of quickly identifying systematic misclassification patterns. For example, if two classes are frequently confused with each other, the corresponding off-diagonal cells will be prominently colored. The visualization can display either raw counts or percentages of true class totals, where percentage mode normalizes each row by the total number of instances belonging to that true class, making it easier to compare error rates across classes of different sizes.

This technique is a fundamental part of the classification evaluation workflow, typically applied after model training and prediction on a test set. It provides complementary information to aggregate metrics like accuracy, precision, and recall by revealing the pairwise structure of errors between classes.

Usage

Use confusion matrix visualization whenever you evaluate a classifier and need to understand not just whether errors occur, but which specific classes are being confused. It is valuable for both binary and multiclass problems, and is especially informative when classes have similar characteristics that may cause systematic misclassification.

Theoretical Basis

Given a set of true labels $𝐲$ and predicted labels $\hat{𝐲}$ , the confusion matrix $M$ is defined as:

$M_{i, j} = | {k : y_{k} = c_{i} \land {\hat{y}}_{k} = c_{j}} |$

where $c_{i}, c_{j} \in C$ are class labels. The diagonal elements $M_{i, i}$ represent correct classifications for class $c_{i}$ .

When displaying as percentages of true class, each row is normalized by the row sum:

$M_{i, j}^{%} = \frac{M_{i, j}}{\sum_{j^{'}} M_{i, j^{'}}}$

The global accuracy can be derived from the confusion matrix as:

$Accuracy = \frac{\sum_{i} M_{i, i}}{\sum_{i} \sum_{j} M_{i, j}} = \frac{trace (M)}{‖ M ‖_{1}}$

Individual class metrics can also be extracted. For class $c_{i}$ :

True Positives: ${TP}_{i} = M_{i, i}$
False Positives: ${FP}_{i} = \sum_{j \neq i} M_{j, i}$
False Negatives: ${FN}_{i} = \sum_{j \neq i} M_{i, j}$

Related Pages

Implemented By

Implementation:DistrictDataLabs_Yellowbrick_ConfusionMatrix_Visualizer

Page Connections

Double-click a node to navigate. Hold to expand connections.

Principle

Implementation

Heuristic

Environment