pytorch-coteaching

Coteaching (Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy Labels) is a robust training framework on extremly label noise settings. The key idea in Coteaching is to maintain two networks simultaneously, and cross-trains on instances screened by the “small loss” criteria.

Paper: https://arxiv.org/abs/1804.06872

Usage

Install requirements.txt

pip install -r requirements.txt

Preprocessing (build noisy data)

python main.py \
    --run_mode preprocess \
    --noise_prob 0.5 \
    --noise_type sym \
    --dataset CIFAR10

Training

python main.py \
    --run_mode train \
    --model coteach \
    --num_gradual 15 \
    --lr 0.001 \
    --batch_size 128 \
    --num_class 10

Experiments on CIFAR-10 (Image)

Performance results

num_gradual = 15

Settings / Models	CNN (reproduce, standard)	CNN (paper, standard)	CNN (reproduce, coteaching)	CNN (paper, coteaching)
Sym (ε = 20%)	71.7%	76.25%	81.5%	82.3%
Sym (ε = 50%)	43.0%	48.87%	72.2%	74.02%

Experiments on TREC (Text)

Performance results

num_gradual = 5

Settings / Models	TextCNN (reproduce, standard)	TextCNN (reproduce, coteaching)
Sym (ε = 20%)	80.36%	83.56%
Sym (ε = 50%)	41.9%	73.19%

Original code repository

https://github.com/bhanML/Co-teaching

yeachan-kr/pytorch-coteaching

pytorch-coteaching

Usage

Experiments on CIFAR-10 (Image)

Performance results

Experiments on TREC (Text)

Performance results

Original code repository