Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause