xia-zhe's Stars
stoneMo/EZ-AVGZL
Official Codebase of "Audio-visual Generalized Zero-shot Learning the Easy Way" (ECCV 2024)
ExplainableML/AVCA-GZSL
This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language"
Graph-and-Geometric-Learning/hyperbolic-transformer
liwrui/SceneDreamer360
GeWu-Lab/OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
IBM/AdaMML
Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.
GeWu-Lab/TSPM
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
matthewdm0816/BridgeQA
[AAAI 24] Official Codebase for BridgeQA: Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA
Chunmian-art/City-3DQA
liwrui/Awesome-3D-Generation
amazon-science/QA-ViT
Mr-Neko/JM3D
The offical implemention of JM3D.
UMass-Foundation-Model/3D-VLA
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
yccyenchicheng/SDFusion
yushuang-wu/IPoD
alibaba/ipod
graldij/transformer-fusion
Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.
LeapLabTHU/MLLA
Official repository of MLLA (NeurIPS 2024)
reml-group/MUSIC-AVQA-R
mosaf/Awesome-DL-based-CS-MRI
"Awesome-DL-based-CS-MRI" is a curated collection of resources, tools, and research papers related to deep learning-based Compressed Sensing in Magnetic Resonance Imaging (CS-MRI). It's a valuable resource for those interested in this cutting-edge field, promoting knowledge sharing and collaboration among researchers and practitioners.
xia-zhe/MSTR
XiudingCai/Awesome-Mamba-Collection
A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.
JHome1/GiO-GiT
marlin-codes/Awesome-Hyperbolic-Representation-and-Deep-Learning
Paper list about hyperbolic embedding, hyperbolic models,hyperbolic applications
AmeenAli/HiddenMambaAttn
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
rikeilong/MCD-forAVQA
Official Implementation for Answering Diverse Questions via Text Attached with Key Audio-Visual Clues
camenduru/VideoMamba-jupyter
camenduru/VideoMamba-hf
GeWu-Lab/PSTP-Net
GeWu-Lab/awesome-audiovisual-learning
A curated list of audio-visual learning methods and datasets.