What is Query Understanding in Search Systems?
Why Raw Queries Fail
Users type "apple" but mean the company, the fruit, or the record label. They type "cheap flights NYC" when they want "inexpensive airfare to New York City." They misspell, use abbreviations, and omit context obvious to them but invisible to the system. Without query understanding, search returns literal keyword matches that miss 30-50% of relevant results and include 20-30% irrelevant ones.
The Three Core Tasks
Intent classification: Determine the query type (navigational, informational, transactional) to route to appropriate backends. Entity extraction: Identify and link mentions to known entities ("NYC" → New York City, population 8.3M, coordinates 40.7°N). Query rewriting: Expand, correct, or reformulate queries to improve recall ("ML" → "machine learning") or precision (adding implicit filters).
Impact on Search Quality
Good query understanding improves relevance metrics by 15-30%. It reduces zero-result queries by 40-60% through spell correction and synonym expansion. It enables personalization by connecting queries to user context. Without it, even perfect ranking algorithms fail because they rank the wrong candidate set.