Semi Supervised YOLOv2
Summary
This project was carried out during an internship at IRIT, Toulouse, with Axel Carlier as tutor.
YOLOv2
The fisrt part of this project was about obtaining a functional and trainable model for image detection based on YOLOv2 1. This was done by extracting and adapting code snippets from an existing implementation to a Jupyter Notebook.
Results
I obtained, on the WildLife dataset, the following quantitative results :
YOLOv2 (30 epochs) | ||
---|---|---|
validation mAP | 0.65 | |
buffalo mAP | 0.59 | |
elephant mAP | 0.59 | |
rhino mAP | 0.77 | |
zebra mAP | 0.64 | |
training mAP | 0.73 | |
buffalo mAP | 0.85 | |
elephant mAP | 0.59 | |
rhino mAP | 0.89 | |
zebra mAP | 0.61 |
And qualitative results :



Semi-supervised Learning for Image Detection

The second part of this project was the implementation of a semi-supervised training for object detection 2 on this functional model. This affected the loss of the model ; initially YOLOv2 has the following loss :
But to in order take into account the pseudo-labels generated by the teacher, I modified the loss as follows :
Results
I obtained the following results :
Teacher (60 epochs) | Student (30 epochs) | StudentΒ² (30 epochs) | |
---|---|---|---|
training mAP | 0.79 | 0.82 | 0.81 |
validation mAP | 0.67 | 0.70 | 0.68 |
Showing proof of concept that the student performs better than the teacher, but also that using the student to train a new student (StudentΒ²
here) does not seem to make any improvement.
J. Redmon and A. Farhadi, βYOLO9000: Better, Faster, Stronger,β 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 6517-6525, https://doi.org/10.1109/CVPR.2017.690. β©οΈ
Sohn, K., Zhang, Z., Li, C., Zhang, H., Lee, C., & Pfister, T. (2020). A Simple Semi-Supervised Learning Framework for Object Detection. https://arxiv.org/abs/2005.04757. β©οΈ