What is a bounding box?

A bounding box is simply a rectangle drawn around an object to identify the exact location of the object in an image. In OD tasks, it also helps us identify what kind of object is present in an image.

How are coordinates represented?

Mathematically, a bounding box is represented as a tensor consisting of information related to the location of the object and confidence scores. In OD tasks, two formats are widely followed to represent location:

( $x_{min}$ , $y_{min}$ , $x_{max}$ , $y_{max}$ ): They are also known as top-left and bottom-right coordinates.
( $x_{center}$ , $y_{center}$ , $w$ , $h$ ): They are the center coordinates of an image, along with the width and height of the image.

Introduction to Object Detection

Fundamentals for Understanding YOLO

Building a System for Safety Helmet Detection Based on YOLOv5

YOLOv7 Architecture

Improving Model Performance: Handling Overfitting/Underfitting

Dealing With Small Datasets In ML

Pre-Trained Models, Fine-Tuning, and Hyperparameters in OD

Sun Detection Using YOLOv8

Conclusion

Bounding Box Predictions

What is a bounding box?

How are coordinates represented?

Time to code!

Input