Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization
Primary LanguagePython