/MERLIN_text_to_video_search

[EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline

MIT LicenseMIT

Stargazers