Accuracy vs Latency Trade-offs: Choosing Between SSD MobileNet and EfficientDet Lite
THE PARETO FRONTIER
Every edge model sits on an accuracy-latency trade-off curve. MobileNetV2 at 5ms achieves 72% ImageNet accuracy. EfficientNet-Lite4 at 30ms achieves 80%. There is no model that is simultaneously fastest and most accurate. Your job is to find the point on this curve that meets your application requirements.
DETECTION MODEL COMPARISON
SSD MobileNetV2: 15-25ms inference, 22% COCO mAP. Good for real-time when accuracy is less critical (presence detection, counting).
EfficientDet-Lite0: 30-40ms, 26% mAP. Better accuracy, still real-time capable.
EfficientDet-Lite2: 60-80ms, 32% mAP. Near-cloud accuracy but sacrifices real-time.
YOLOv5n: 20-30ms, 28% mAP. Good balance for many applications.
DECISION FRAMEWORK
Start with your latency budget, then find the best accuracy within it. For 30fps video, budget 33ms. Subtract preprocessing (3ms) and postprocessing (5ms). You have 25ms for inference. EfficientDet-Lite0 or SSD-MobileNet fits; EfficientDet-Lite2 does not.
WHEN TO SACRIFICE ACCURACY
Prefer lower accuracy when: (1) task tolerates errors (suggestions vs safety-critical). (2) downstream processing can correct mistakes. (3) real-time response is essential. Prefer higher accuracy when: (1) errors are costly. (2) batch processing is acceptable. (3) thermal headroom exists.