- Published on
Just a few days ago, a new model of the Yolo family was presented. Its main trick is that, unlike its older brothers, it is able to recognize virtually any object in the image (which are of interest to a person) without prior training and it does it in real-time mode! Sounds pretty good, doesn't it? In this article we will try to understand what magic is hiding inside the new architecture. I would like to say that this article will be more of an introductory one, so I recommend those who like strict math to read the original article after reading it. But before we start the review, let's learn/remember the main types of object detection tasks in an image.