What is Business Metrics Correlation in ML Systems?
THE DISCONNECT PROBLEM
ML teams optimize for model metrics: accuracy, AUC, NDCG. Business teams care about revenue, conversion rate, customer retention. Without correlation, a model improvement (AUC +2%) might have no business impact—or even hurt business metrics due to unintended side effects.
This disconnect creates two problems. First, you cannot prioritize model work effectively. Should you improve click prediction accuracy or reduce latency? Without knowing which impacts revenue more, you are guessing. Second, you cannot justify ML investment. If you cannot show that a 5% AUC improvement generated $500K in additional revenue, ML becomes a cost center rather than a profit driver.
THE METRIC CHAIN
Model metrics → Proxy metrics → Business metrics form a causal chain. A recommendation model improves its NDCG score. This leads to better click-through rate (proxy metric). Higher CTR leads to more purchases (business metric). Each link in this chain has a correlation coefficient that determines how much improvement propagates.
The challenge: correlations are not constant. A 5% NDCG improvement might yield 3% CTR improvement in one context and 0.5% in another. Context includes user segment, product category, time of year, and competitive environment.
WHAT MAKES THIS HARD
Confounders: Multiple factors affect business metrics simultaneously. Did revenue increase because of the model or because of a marketing campaign? Isolating model impact requires careful experimental design.
Lag: Business metrics may lag model changes by days or weeks. A better recommendation model affects today clicks but next month purchases.