/human-detection

Primary LanguageJupyter Notebook

Human video detection

Given a youtube commercial video, use any existing algorithms to generate a video that shows the existence of humans within the video by drawing boxes around them for each frame.

Exemple video : https://youtu.be/h4s0llOpKrU

MISS DIOR – The new Eau de Parfum

Exemple result :

Low confidence is choose according low false positive detection for person class.

name confidence precision recall mAP F1
yolov3_result.json 0.25 0.90 0.80 0.79 0.84
yolov4-tiny_result.json 0.25 0.90 0.69 0.69 0.79
yolov5s_result.json 0.25 0.92 0.78 0.75 0.84
yolov4-csp_result.json 0.25 0.94 0.86 0.86 0.90
yolov5m_result.json 0.25 0.93 0.85 0.84 0.89
yolov4x-mish_result.json 0.25 0.95 0.87 0.87 0.91
yolov4_result.json 0.25 0.94 0.87 0.87 0.90
yolov5l_result.json 0.25 0.94 0.85 0.84 0.89
yolov5x_result.json 0.25 0.93 0.84 0.84 0.88

Other results :

How to reproduce result

Note : All the experimentation was done under colab research notebook.

Reproduce online with Colab Research Notebook (Open Acces)

To reproduce the result, you can follow the specified notebook :

There are also saved locally in the notebook directory.

Appendix : Methodology history

1. Methods exploration

Try rapidly several models to validate proof of concept

2. Reading ressources :

3. Automate video prediction generation from YOLOv3 and YOLOv4* models using Darknet (neural network framework)

Bounding box prediction on save on json format (darknet format) in predictions directory

4. Automate video prediction generation from YOLOv5* models

Bounding box prediction on save on json format (darknet format) in predictions directory

5. Ref video and pred models comparison

name confidence precision recall mAP F1
yolov3_result.json 0.25 0.90 0.80 0.79 0.84
yolov4-tiny_result.json 0.25 0.90 0.69 0.69 0.79
yolov5s_result.json 0.25 0.92 0.78 0.75 0.84
yolov4-csp_result.json 0.25 0.94 0.86 0.86 0.90
yolov5m_result.json 0.25 0.93 0.85 0.84 0.89
yolov4x-mish_result.json 0.25 0.95 0.87 0.87 0.91
yolov4_result.json 0.25 0.94 0.87 0.87 0.90
yolov5l_result.json 0.25 0.94 0.85 0.84 0.89
yolov5x_result.json 0.25 0.93 0.84 0.84 0.88

6. Difficult case

Case 1 : Under water view

Difficult case 1

Case 2 : Top view

Difficult case 2

Case 3 : Distant view

Difficult case 3

Prototype - Conceivable solution