/bincrowd

Official Implementation of ACMMM'21 paper "Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting"

Primary LanguagePythonMIT LicenseMIT

Wisdom of (Binned) Crowds

website | arxiv

The idea behind our work is to tackle the high variance of error that is ignored when considering de facto statistical performance measures (MSE,MAE) for performance evaluation in the crowd counting domain. Our recipe involves finding bins across the range of counts that are optimal in a Bayesian sense and later systematically modifying the standard crowd counting pipeline to incorporate decrease of variance at each step.

The website contains interactive plotly plots, which bring out the issue with using MAE/MSE as evaluation metrics. Additionally, it contains the plots that enable inspection of the performance at a bin-level.

TL;DR tweet thread providing an overview of our approach

Overview Diagram

here

Step 1 : Data Preparation & Splits (Stages(1,2)) / Binning

How to find bins for new dataset

In our paradigm the first step is to create strata over the count range to enable balanced data splits. We recommend to divide the entire range of counts into strata which are optimal in Bayesian sense, and later sample images that belong to these strata. The procedure for data split generation is provided in the folder named binning.


Step 2 : Minibatch Sampling (Stage 3)

How to sample the minibatch from bins

Next step in our paradigm is to sample from these bins. We exeprimented with two differnt ways of choosing the bin, they are :

    1. Round Robin sampling
    2. Random sampling

For a simple visualisation of the procedure you can refer to the folder named sampling. In the same folder, we present a demo on DM Count network about procedure to include the sampling into any network.


Step 3 : Optimization (Stage 4)

How to include strata aware optimization in your network

The next step of the procedure is to include strata aware optimization into any network. We add this loss as an additive loss along with a scaleable hyper parameter (λ1 in Sec 3.4) to the original network's loss function. The idea behind adoption of strata aware optimization is to reduce variance. A pyTorch implementation of that optimization is provided in this folder named optimization.













Step 4 : Evaluation (Stage-5)

here

How to evaluate your networks performance at a bin level

The final step in our paradigm is to have a better represented evaulation metric. The generally used MAE conceals standard deviation, to bring out this we evaluate the performance at a strata-level (mean and std) and a pooled mean and std. The folder evaluation consists of a notebook to do the same.

How to evaluate your networks performance using percentage error

Additionally this folder consists of code and procedure to evaluate the model using TPER curves.

Localization evaluation of crowd counting networks.

The code for evaluation using GAME metric is provided in the evaluation folder.


Pretrained models and test code

The pretrained models that were used to prepare results for different datasets and networks are given in the following zenodo links. The testing demo script for DMCount is provided in the folder testing.

Network name link
BL (NWPU,UCF,STA,STB) here
DM Count (NWPU,UCF,STA,STB) here
SCARNet (NWPU,UCF,STA,STB) here
SDCNet (NWPU,UCF,STA,STB) here
SFANet (NWPU,UCF,STA,STB) here

Pull Request

To include your network in our work please create a pull request with the name <your_network_name>. Fill up the details of your network in the following format, and add it into the description:

Network name:                       Ex(DM-Count,...)
Dataset name:                       Ex(NWPU,STA,STB,...)
Sampling type:                      Ex(Round Robin, Ramdom Sampling)
Conference or arxiv link:
Github repo:
Contact email:

Attach a file with the name <your_network_name_dataset_name_type_of_binning>.txt, replace your_network_name with your networks name, dataset_name with the dataset name and type_of_sampling ( rr for round robin sampling and rs for random sampling) on which results of the file correspond to. The file should contain row-wise information in the following format:

image_name_1 , target_1, predicted_1
image_name_2 , target_2, predicted_2
.
.
.
image_name_n , target_n, predicted_n

Here image_name_i can be a string or an int that refers to the input image name to the network and predicted_i refers to the count predicted by your network from the image_name_i, and target_i represents its ground truth.

To generate the above mentioned txt you can take help of the code here

Citation

If you are using our work, Please cite us at :

@inproceedings{10.1145/3474085.3475522,
author = {Sravya Vardhani Shivapuja, Mansi Pradeep Khamkar, Divij Bajaj, Ganesh Ramakrishnan, Ravi Kiran Sarvadevabhatla},
title = {Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm
for Crowd Counting
},
booktitle = {Proceedings of the 2021 ACM Conference on Multimedia},
year = {2021},
location = {Virtual Event, China},
publisher = {ACM},
address = {China},
}