Learning multi-modal representations by watching hundreds of surgical video lectures
Primary LanguagePython