Run sh train.sh to run mask2former + dinov2, you can change the dataset, this codebase focus on LVIS.