*Equal Contributions +Corresponding Author
Xidian University, McMaster University
This is the official repository for our recent paper, "Semantics-aware Contrastive Semi-supervised Learning for Low-light Drone Image Enhancement .
Despite the impressive advancements made in recent low-light image enhancement techniques, the scarcity of annotated data has emerged as a significant obstacle to further advancements. To address this issue, we propose a mean-teacher-based Semi-supervised low-light enhancement framework to utilize the unlabeled data for model optimization. However, the naive implementation of the mean-teacher method encounters two primary challenges. The utilization of pixel-wise distance in the mean-teacher method may lead to the overfitting of incorrect labels, which results in confirmation bias. To mitigate this issue, we introduce semantics-aware contrastive regularization as a preventive measure against overfitting on incorrect labels.
Experimental results demonstrate that our method achieves remarkable quantitative and qualitative improvements over the existing methods.
Figure 1. An overview of our approach.
- Ubuntu==18.04
- Pytorch==1.8.1
- CUDA==11.1
Other dependencies are listed in requirements.txt
Install Segment Anything:
pip install git+https://github.com/facebookresearch/segment-anything.git
or clone the repository locally and install with
git clone git@github.com:facebookresearch/segment-anything.git
cd segment-anything; pip install -e .
Install Mobile Segment Anything:
pip install git+https://github.com/ChaoningZhang/MobileSAM.git
or clone the repository locally and install with
git clone git@github.com:ChaoningZhang/MobileSAM.git
cd MobileSAM; pip install -e .
The following optional dependencies are necessary for mask post-processing, saving masks in COCO format, the example notebooks, and exporting the model in ONNX format. jupyter
is also required to run the example notebooks.
pip install opencv-python timm transformers fairscale loralib pyiqa
-i https://pypi.tuna.tsinghua.edu.cn/simple
pip install adamp -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install -U openmim -i https://pypi.tuna.tsinghua.edu.cn/simple
mim install mmcv
pip uninstall timm
pip install timm -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install Pillow==9.2.0 -i https://pypi.tuna.tsinghua.edu.cn/simple
Run data_split.py
to randomly split your paired datasets into training, validation and testing set.
Run estimate_illumination.py
to get illumination map of the corresponding image.
Finally, the structure of data
are aligned as follows:
data
├── labeled
│ ├── input
│ └── GT
│
├── unlabeled
│ ├── input
│
└── val
├── input
└── GT
└── test
├── benchmarkA
├── input
You can download the training set and test sets from benchmarks UIEB, EUVP, UWCNN, Sea-thru, RUIE.
Put your test benchmark under data/test
folder, run estimate_illumination.py
to get its illumination map.
Setup the following three paths in test.py
model_root = 'model/lol_ckpt_begin_0404/model_e200.pth'
input_root = 'data/LOLv1/val'
save_path = 'result/lol_ckpt_begin_0404/'
Run test.py
and you can find results from folder result
.
python test_withgrad.py
To train the framework, run create_candiate.py
to initialize reliable bank. Hyper-parameters can be modified in trainer.py
.
Setup the following optioins in train.py
:
parser.add_argument('--data_dir', default='./data/LOLv2_real', type=str, help='data root path')
parser.add_argument('--save_path', default='./model/ckpt_begin_04014_on_LOLv2_real/', type=str)
For continue trainning, please setup:
parser.add_argument('--resume', default='False', type=str, help='if resume')
parser.add_argument('--resume_path', default='./model/ckpt_begin_0408_on_visdrone/model_e160.pth', type=str, help='if resume')
Run train.py
to start training.
CUDA_VISIBLE_DEVICES=2 nohup python train_lolv1.py --gpus 1 --train_batchsize 6 > logs/train_on_lolv1_visdrone_0414.txt
If you use the code in this repo for your work, please cite the following bib entries:
@inproceedings{huang2023contrastive,
title={Contrastive Semi-supervised Learning for Underwater Image Restoration via Reliable Bank},
author={Huang, Shirui and Wang, Keyan and Liu, Huan and Chen, Jun and Li, Yunsong},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={18145--18155},
year={2023}
}
The training code architecture is based on the PS-MT and DMT-Net and thanks for their work. We also thank for the following repositories: IQA-Pytorch, UWNR, MIRNetv2, 2022-CVPR-AirNet, SPSR, Non-Local-Sparse-Attention, AFF, AECR-Net, UIEB, EUVP, UWCNN, Sea-thru, RUIE, MMLE, PWRNet, Ucolor, CWR, FUnIE-GAN