What is Power Analysis and Why Does Sample Size Matter?
The Core Relationship
The formula is roughly: N ∝ variance × (z_alpha + z_beta)² / MDE². Halving the MDE quadruples required sample size. Increasing power from 80% to 90% adds ~30% more sample. These are expensive trade-offs.
For a conversion experiment: baseline 5%, MDE 10% relative lift (detecting 5.5% vs 5.0%), alpha 5%, power 80% requires ~31,000 users per arm, or 62,000 total. At 10,000 daily users, thats 6-7 days minimum.
Underpowered Experiments
Running underpowered experiments wastes resources. If true effect is 3% but your MDE is 5%, you have only 30-40% chance of detecting it. Worse, any significant result from underpowered experiments is likely inflated (winners curse).
Pre-Experiment Planning
Before launching, calculate: required sample for your MDE, daily traffic to experiment surface, expected runtime. If runtime exceeds 4-6 weeks, reconsider: accept larger MDE? Reduce metric variance? Use higher-traffic surface?