YOLO (2015), YOLOv2 (2016), and YOLOv3 (2018)

Understand the evolution of YOLO object detection architectures from the original YOLO to YOLOv3. Learn how improvements in backbone networks, anchor boxes, and multiscale training enhanced speed and accuracy. Compare YOLO with SSD and Faster R-CNN and grasp key architectural changes for effective one-stage detection.

We'll cover the following...

YOLO architecture
- Training strategy
YOLO vs. SSD
- Performance comparison
YOLOv2 architecture
YOLOv3 architecture

You Only Look Once (YOLO) is the most popular single-shot object detection model, primarily due to its never-ending improvement story. Even though the first version, called simply YOLO, was created before SSD architecture, it kept improving over the years. We will examine the long road the YOLO family achieved, along with the novelties and improvements each new version has.

Let’s start with the base architecture created in 2016.

YOLO architecture

As it is a single-shot detector, we already assume that the architecture achieves object detection at one step, likely as SSD. But when we check the architecture, we can see that it works in an even simpler way than SSD.

1.Before We Start

2.Basics of Convolutional Neural Networks

Project

3.Popular Neural Network Architectures for Image Classification

4.Using PyTorch for Image Classification

5.Model Deployment

Project

6.Basics of Object Detection

7.Two-Stage Object Detection Architectures

8.One-Stage Object Detection Architectures

9.YOLOv7 Model Train and Inference on Edge

10.Conclusion

11.Appendix

Project

YOLO (2015), YOLOv2 (2016), and YOLOv3 (2018)

YOLO architecture