Confidence Intervals: Precision and Practical Interpretation
WHAT CONFIDENCE INTERVALS MEAN
A 95% confidence interval means the procedure captures the true value 95% of the time across repeated samples. It does NOT mean there is a 95% probability that this specific interval contains the true value. The interval is fixed; the true parameter is fixed; the 95% refers to the long run frequency of the procedure.
INTERVALS SHOW BOTH SIGNIFICANCE AND MAGNITUDE
Unlike p-values, confidence intervals convey both whether an effect exists and how large it might be. CTR lift of 0.10 percentage points with 95% CI [0.06, 0.14] tells you: (1) effect is significant (interval excludes zero), (2) effect is meaningful (lower bound 0.06 is still useful). If the CI were [0.01, 0.19], the effect is significant but might be negligibly small.
CALCULATING INTERVALS
For large samples, use z critical value: 1.96 for 95% CI. For small samples, use t critical value with n-1 degrees of freedom. Formula: CI = mean ± z × (std_dev / sqrt(n)). For proportions near 0% or 100%, Wilson or Agresti-Coull intervals have better coverage than normal approximation.
BOOTSTRAP FOR HEAVY TAILS
Heavy-tailed metrics like revenue or p95 latency violate normal assumptions. Bootstrap intervals resample the data thousands of times to estimate the sampling distribution empirically. This handles outliers gracefully but costs 10-100x more computation than parametric methods.