Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)
Primary LanguagePythonMIT LicenseMIT