Online Continual Learning with Contrastive Vision Transformer (CVT)

📋Dependence

torch==1.3.1

torchvision==0.4.2

numpy==1.16.4

absl-py==0.9.0

cachetools==4.0.0

certifi==2019.11.28

chardet==3.0.4

Cython==0.29.15

google-auth==1.11.2

google-auth-oauthlib==0.4.1

googledrivedownloader==0.4

grpcio==1.27.2

idna==2.8

Markdown==3.2.1

oauthlib==3.1.0

Pillow==6.1.0

protobuf==3.11.3

pyasn1==0.4.8

pyasn1-modules==0.2.8

quadprog==0.1.7

requests==2.22.0

requests-oauthlib==1.3.0

rsa==4.0

six==1.14.0

tensorboard==2.0.1

urllib3==1.25.8

Werkzeug==1.0.0

📋Running

Use ./utils/main.py to run experiments.
New models can be added to the models/ folder.
New datasets can be added to the datasets/ folder.

📋Results

We demonstrate the average incremental performance under the Task-free protocol with 500 memory buffer, which is the result of evaluating on all the tasks observed so far after completing each task. The results are curves of accuracy and forgetting after each task. It is observed that the performance of most methods degrades rapidly as new tasks arrive, while our method consistently outperforms the state-of-the-art methods in both accuracy and forgetting throughout the learning.

📋Conclusion

In this paper, we propose a novel attention-based framework, Contrastive Vision Transformer (CVT), to effectively mitigate the catastrophic forgetting for online CL. To the best of our knowledge, this paper is the first in the literature to design a Transformer for online CL. CVT contains external attention and learnable focuses to accumulate previous knowledge and maintain class-specific information. With a proposed focal contrastive loss in training, CVT rebalances contrastive continual learning between new and past classes and improves the inter-class distinction and intra-class aggregation. Moreover, CVT designs a dual-classifier structure to decouple learning current classes and balancing all seen classes. Extensive experimental results show that our approach significantly outperforms current state-of-the-art methods with fewer parameters. Ablation analyses validate the effectiveness of the proposed components.

The source code will be made publicly available.

JosephKJ/CL_Transformer_ECCV22

Online Continual Learning with Contrastive Vision Transformer (CVT)

📋Dependence

📋Running

📋Results

📋Conclusion