The Conformalization ("Calibration") Set¶

MAPIE is based on two types of techniques for measuring uncertainty in regression and classification:

In all cases, the training/conformalization process can be broken down as follows:

Train a model using the training set (or full dataset if cross-conformal).
Estimate conformity scores using the conformalization set (or full dataset if cross-conformal).
Predict target on test data to obtain prediction intervals/sets based on these conformity scores.

1. Split Conformal Predictions¶

Compute conformity scores ("conformalization") on a conformalization set not seen by the model during training.

Data Splitting

Use train_conformalize_test_split to obtain the different sets.

MAPIE then uses the conformity scores to estimate sets associated with the desired coverage on new data with strong theoretical guarantees.

Split conformal prefit

Split conformal split

MAPIE then combines all these elements in a way that provides prediction intervals on new data with strong theoretical guarantees.

Cross conformal