Why do we need NMS?

Object detection models often predict multiple bounding boxes for a single object in an image. However, the final output should ideally have only one bounding box per object. To achieve this, a technique called non-maximum suppression (NMS) is used.

In the image below, there are three bounding boxes predicted for one person with different confidence scores. On visual inspection, for our final output, we would prefer the green box because it fits better as compared to the other two boxes and has the highest confidence score.

Press + to interact

Introduction to Object Detection

Fundamentals for Understanding YOLO

Building a System for Safety Helmet Detection Based on YOLOv5

YOLOv7 Architecture

Improving Model Performance: Handling Overfitting/Underfitting

Dealing With Small Datasets In ML

Pre-Trained Models, Fine-Tuning, and Hyperparameters in OD

Sun Detection Using YOLOv8

Conclusion

Understanding NMS (Non-Maximum Suppression)

Why do we need NMS?

How does NMS work?