/MUGEN_baseline

multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the training, evaluation and inference codes for these baselines.

Primary LanguagePythonOtherNOASSERTION

Stargazers

No one’s star this repository yet.