jasongief

jasongief's Stars

babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python35.5k 307 8825.2k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.4k 116 3931.4k
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10.1k 135 51868
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10k 97 674976
jason718/awesome-self-supervised-learning
A curated list of awesome self-supervised methods
6.2k 248 19828
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Language:Python3.3k 40 57195
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python1.6k 13 141197
AlexHex7/Non-local_pytorch
Implementation of Non-local Block.
Language:Python1.6k 18 50276
EdisonLeeeee/Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
788 33 153
zhenyuw16/UniDetector
Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".
Language:Python549 14 3525
linzhiqiu/cross_modal_adaptation
Cross-modal few-shot adaptation with CLIP
Language:Python324 13 1936
GeWu-Lab/OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
Language:Python244 4 4719
OpenNLPLab/TransnormerLLM
Official implementation of TransNormerLLM: A Faster and Better LLM
Language:Python231 14 1111
ziqipang/LM4VisualEncoding
[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"
Language:Python228 4 97
OpenNLPLab/FAVDBench
[CVPR 2023] Official implementation of the paper: Fine-grained Audible Video Description
Language:Python77 1 107
OpenNLPLab/Tnn
[ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper - Toeplitz Neural Network for Sequence Modeling
Language:Python75 2 13
MengyuanChen21/Awesome-Evidential-Deep-Learning
A curated publication list on evidential deep learning.
62 3 14
haoyi-duan/DG-SCT
NeurIPS'2023 official implementation code
Language:Python59 4 94
OpenNLPLab/Transnormer
[EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer
Language:Python55 3 35
MengyuanChen21/CVPR2023-CMPAE
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
Language:Python34 2 03
MengyuanChen21/NeurIPS2024-CSP
[NeurIPS 2024] Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models
Language:Python30 1 00
jasongief/CPSP
[2023 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line
Language:Python27 2 45
OpenNLPLab/FNAC_AVL
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning
Language:Python23 0 34
Georgelingzj/up-to-date-Vision-Language-Models
Up-to-date Vision Language Models collection. Mainly focus on computer vision
20 5 00
VUT-HFUT/Micro-Action
[TCSVT 2024] Official implementation of the paper: Benchmarking Micro-action Recognition: Dataset, Methods, and Applications
Language:Jupyter Notebook17 1 11
GeWu-Lab/TSPM
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
Language:Python14 2 31
GeWu-Lab/LFAV
Towards Long Form Audio-visual Video Understanding
Language:Python9 2 20
VUT-HFUT/MiGA2023_Track1
[IJCAI 2023]The Champion of Micro-gesture Classification sub-challenge in MiGA@IJCAI2023.
Language:Python8 1 01
zhangbin-ai/APL
APL for AVQA task
Language:Python5 0 21
jinxiang-liu/SSL-TIE
Official code for ACMMM2022 paper, "Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation"
Language:Python4 1 02