Statistical Metrics for Prediction Drift Detection
DISTRIBUTIONAL DISTANCE METRICS
PSI (Population Stability Index): Same metric used for data drift. Compare baseline prediction distribution to current distribution. PSI > 0.25 indicates significant prediction drift. Works well because it is interpretable and has established thresholds.
Jensen-Shannon Divergence: Symmetric measure of similarity between distributions. Bounded between 0 (identical) and 1 (completely different). More sensitive to tail changes than PSI. Useful when extreme predictions matter.
Wasserstein Distance: Measures the minimum cost to transform one distribution into another. Captures shape changes that PSI might miss. More computationally expensive.
SUMMARY STATISTICS
Simpler than full distributional metrics but less sensitive:
Mean shift: Track prediction mean over time. Alert on significant deviation. Simple and fast but misses distribution shape changes.
Variance change: Track prediction variance. Narrowing variance may indicate model is over-confident. Widening may indicate uncertainty.
Percentile monitoring: Track p10, p50, p90 separately. Captures both central tendency and tail behavior. Alert when any percentile shifts beyond threshold.
CHOOSING METRICS
Start with PSI for its interpretability. Add percentile monitoring for tail sensitivity. Use JS divergence or Wasserstein only if simpler metrics miss important drift in your domain.
Multi-metric monitoring provides defense in depth. Different metrics catch different types of drift. Use 2-3 complementary metrics rather than relying on one.