There are three steps to train the classification network.
Download the HPH dataset here
First, use the on-the-shelf VLM for zero-shot inference with our proposed method to filter out noisy samples on the training set.
python zero_shot_HPH_un.py --gpu 0
Second, after obtaining high-quality pseudo-labels, you can train a classification network.
python train_pseudo.py --gpu 0