akoepke's Stars
oawiles/X2Face
Pytorch code for ECCV 2018 paper
autonomousvision/plant
[CoRL'22] PlanT: Explainable Planning Transformers via Object-Level Representations
oawiles/FAb-Net
Pytorch code for BMVC 2018 paper
yanbeic/CCL
PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
ExplainableML/WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts"
akoepke/audio-retrieval-benchmark
Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".
ExplainableML/AVCA-GZSL
This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language"
baumgach/tue-slurm-helloworld
Instructions and examples to deploy some PyTorch code on slurm using a Singularity Container
oncescuandreea/audio-retrieval
Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch
ExplainableML/CLEVR-X
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
ExplainableML/TCAF-GZSL
This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"
ExplainableML/ImageFreeZSL
ExplainableML/ZerAuCap
[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords
ExplainableML/AVDIFF-GFSL
This repository contains the code for our DAGM GCPR 2023 paper "Text-to-feature diffusion for audio-visual few-shot learning"
ExplainableML/Spurious_CM_Retrieval
Official PyTorch implementation of CVPR 2023 MULA Workshop paper "Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval"
ExplainableML/ReGaDa
BMVC 2023: Video-adverb retrieval with compositional adverb-action embeddings
ExplainableML/CCL
Code on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
ExplainableML/Deep-Graph-Persistence
Code for the paper "Addressing caveats of neural persistence with deep graph persistence".
oawiles/iccvw19_classemb
Code for ICCV Workshop paper: Self-supervised learning of class embeddings from video
ExplainableML/ZS-A2T
[GCPR 2023] Zero-shot Translation of Attention Patterns in VQA Models to Natural Language
oncescuandreea/audio_egovlp
This is the official codebase used for obtaining the results in the ICASSP 2024 paper: A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval
oncescuandreea/DTU_text_audio