Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU
Primary LanguageJupyter NotebookMIT LicenseMIT
No issues in this repository yet.