ML-Powered Search & Ranking • Feature Engineering for RankingMedium⏱️ ~3 min
Hierarchical Feature Backoff and Cold Start Handling
New users and new items lack historical data, causing features like user Click Through Rate (CTR) or item conversion rate to be undefined or based on tiny samples with high variance. Over relying on historical engagement features creates a cold start problem: entities with no history receive poor scores and never get exposure, preventing them from accumulating the data needed to rank well. Production systems solve this through hierarchical backoff, where features fall back through progressively broader aggregations when entity level data is sparse.
For user features, backoff from user to segment to global. If a user has fewer than 50 impressions, do not use their personal CTR. Instead, use the CTR for their demographic segment, for example users in the same age range and geographic region. If the segment also has insufficient data, fall back to the global average CTR. Smooth transitions using Bayesian shrinkage: blend the user level estimate toward the segment prior with weight proportional to the user sample size. A user with 10 impressions might get 20 percent weight on personal CTR and 80 percent on segment CTR. A user with 1,000 impressions gets 95 percent personal and 5 percent segment.
For item features, backoff from item to category to global. A new product with zero sales uses category conversion rate. A product with 5 sales blends its observed rate with the category prior. Additionally, rely more heavily on content based features and embeddings that do not require behavioral history. Compute item embeddings from titles, descriptions, images, and attributes. New items immediately have embeddings that capture semantic similarity to existing items, allowing the ranker to generalize from related products.
Exploration is essential to escape cold start. Allocate slots in results specifically for new or underexplored entities, even if their predicted scores are lower. Airbnb reserves 10 to 15 percent of search results for listings with fewer than 10 bookings, ensuring new listings collect initial engagement data within days of going live. YouTube similarly boosts videos from channels with low view counts in a fraction of recommendations, preventing the platform from ossifying around established creators. This exploration cost is 2 to 4 percent immediate engagement, but substantially increases long term diversity and supply growth.
💡 Key Takeaways
•Hierarchical backoff prevents cold start by falling back from user to demographic segment to global average, with Bayesian shrinkage blending estimates weighted by sample size
•New users with fewer than 50 impressions use segment CTR instead of unreliable personal CTR, smoothly transitioning as data accumulates from 20 percent personal weight at 10 impressions to 95 percent at 1,000 impressions
•Item backoff chains from item to category to global: new products use category conversion rate until they collect 10 to 20 sales, then blend observed and prior rates
•Content based features and embeddings computed from titles, descriptions, and images allow new items to immediately leverage semantic similarity without requiring behavioral history
•Exploration slots reserve 10 to 15 percent of results for entities with sparse data: Airbnb ensures new listings collect engagement within days, YouTube boosts low view count videos to prevent creator ossification
•Exploration costs 2 to 4 percent immediate engagement but increases long term diversity and supply growth by preventing the system from locking onto only established entities
📌 Examples
Airbnb search reserves 10 to 15 percent of result slots for listings with fewer than 10 bookings, using content embeddings and location based priors. New listings receive initial bookings within 3 to 5 days instead of weeks without exploration
Amazon product ranking falls back from item conversion rate to category rate for products with fewer than 20 sales, smoothed with Bayesian shrinkage. This prevents new products from ranking at the bottom purely due to lack of history
YouTube boosts videos from channels with under 1,000 subscribers in 5 percent of recommendations, using video title and thumbnail embeddings to estimate quality. This exploration increased new channel growth by 20 percent over 6 months