6533b7defe1ef96bd1276534

RESEARCH PRODUCT

DenseYOLO: Yet Faster, Lighter and More Accurate YOLO

El-bay BourennaneSolomon Negussie Tesema

subject

Class (computer programming)Computer sciencebusiness.industry05 social sciencesDetectorFunction (mathematics)010501 environmental sciencesObject (computer science)01 natural sciencesObject detectionImage (mathematics)Task (computing)Simple (abstract algebra)0502 economics and businessComputer visionArtificial intelligence050207 economicsbusiness0105 earth and related environmental sciences

description

As much as an object detector should be accurate, it should be light and fast as well. However, current object detectors tend to be either inaccurate when lightweight or very slow and heavy when accurate. Accordingly, determining tolerable tradeoff between speed and accuracy of an object detector is not a simple task. One of the object detectors that have commendable balance of speed and accuracy is YOLOv2. YOLOv2 performs detection by dividing an input image into grids and training each grid cell to predict certain number of objects. In this paper we propose a new approach to even make YOLOv2 more fast and accurate. We re-purpose YOLOv2 into a dense object detector by using fine-grained grids, where a cell predicts only one object and its corresponding class and objectness confidence score. Our approach also trains the system to learn to pick a best fitting anchor box instead of the fixed anchor assignment during ground-truth annotation as used by YOLOv2. We will also introduce a new loss function to balance the overwhelming imbalance between the number of grids responsible of detecting an object and those that should not.

https://doi.org/10.1109/iemcon51383.2020.9284923