What is Storage Tiering and How Does it Differ from Caching?
The Core Problem
Storage costs and access speeds are inversely related. Fast storage (SSD, NVMe) costs $0.08-0.15/GB/month. Slow storage (archive, tape) costs $0.004/GB/month. A 20x cost difference. Most data follows the 80/20 rule: 80% of accesses go to 20% of data. Paying premium storage prices for the rarely accessed 80% wastes significant budget. Tiering automatically moves data to appropriate storage classes, optimizing the cost and performance balance.
Tiering vs Caching
Caching and tiering seem similar but solve different problems. Caching is about read latency. Data exists in slow storage and is temporarily copied to fast storage for speed. Cache misses hit the slow backend. Data is never deleted from primary storage. Tiering is about storage cost. Data is moved (not copied) between storage classes. The original location becomes empty. Access latency varies based on current tier. The key difference: with caching, all data lives in one tier with a fast cache in front. With tiering, data lives in exactly one tier at a time, selected based on access patterns.
The Three Tier Model
Hot tier provides immediate access with sub-millisecond latency. Used for active data accessed multiple times per day. Highest storage cost but no retrieval fees. Warm tier provides access within seconds to minutes. Used for data accessed occasionally, perhaps once per week or month. Lower storage cost with small retrieval fees. Cold tier may require hours for retrieval. Used for compliance archives, disaster recovery, and legal hold data. Lowest storage cost with significant retrieval fees. Some systems add frozen tier with 12+ hour retrieval for long term retention.
When Tiering Matters
Tiering becomes economically significant when: total storage exceeds 10TB+, data has clear age based access patterns (older data accessed less), access patterns are predictable, and the organization can tolerate retrieval delays for cold data. For a 100TB dataset where 80% is cold, moving cold data from hot to archive tier saves roughly $8,000/month. The complexity of tiering automation pays for itself quickly at scale.