A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Primary LanguagePythonOtherNOASSERTION