Definition
Cold start is the inability to make quality recommendations when you lack interaction data. New users have no history for personalization. New items have no engagement signals. The recommendation system falls back to generic or random suggestions.
The Three Cold Start Types
User cold start: A new user signs up. You know nothing about their preferences. Collaborative filtering cannot find similar users. Content-based has no profile to match against. First-session experience is critical for retention, yet you have zero signal.
Item cold start: A new product is added to catalog. No users have interacted with it. Collaborative models ignore it. Even if the item would be highly relevant to many users, it gets zero exposure. This kills long-tail discovery.
System cold start: Launching a new recommendation system with no historical data. Everything is cold. This is the hardest case, requiring bootstrapping strategies like importing data from related systems.
Why Cold Start Hurts Business
New user retention correlates strongly with first-session relevance. If recommendations are random, users leave. Studies show 20-40% higher churn in the first week for users who experienced poor initial recommendations. New items that never get exposure become dead inventory.
💡 Key Insight: Cold start is not an edge case. For growing platforms, 10-30% of users and 5-15% of items may be cold at any time. Treating it as an afterthought means degraded experience for a significant portion of traffic.
✓User cold start affects new and anonymous users who lack interaction history, preventing personalized recommendations until sufficient behavioral data accumulates
✓Item cold start impacts new catalog entries with zero engagement, creating a catch 22 where items need exposure to get interactions but need interactions to get exposure
✓Context cold start occurs when launching in new geographic markets or product verticals where existing interaction distributions and priors don't transfer
✓Collaborative filtering methods, which typically perform best at scale, are unusable in cold start scenarios because they fundamentally require interaction matrices that don't yet exist
✓Production impact is measurable: reducing time to first good recommendation from days to minutes improves new user activation rates and prevents new inventory from being permanently suppressed