XIAO1e's Stars
asfathermou/human-computer-interaction
国科大人机交互大作业:多模态情感识别
bowang-lab/Graph-Mamba
Graph-Mamba: Towards Long-Range Graph Sequence Modelling with Selective State Spaces
eeyhsong/NICE-EEG
[ICLR 2024] M/EEG-based image decoding with contrastive learning. i. Propose a contrastive learning framework to align image and eeg. ii. Resolving brain activity for biological plausibility.
XLearning-SCU/2021-NeurIPS-NCR
zjunet/Brant-X
CLIP-MUSED/CLIP-MUSED
CFM-MSG/Code_URMF
MarcLafon/gallop
Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
CharlesYang030/PolCLIP
PolCLIP: A Unified Image-Text Word Sense Disambiguation Model via Generating Multimodal Complementary Representations
BGU-CS-VIL/WTConv
Wavelet Convolutions for Large Receptive Fields. ECCV 2024.
KunpengLi1994/VSRN
PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"
LgQu/DIME
Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21
bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
XLearning-SCU/2022-TPAMI-SURE
PyTorch implementation for Robust Multi-view Clustering with Incomplete Information (TPAMI 2022).
KMnP/vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
anosorae/IRRA
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)
ai-dawang/PlugNPlay-Modules
layumi/Person_reID_baseline_pytorch
:bouncing_ball_person: Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
layumi/Image-Text-Embedding
TOMM2020 Dual-Path Convolutional Image-Text Embedding :feet: https://arxiv.org/abs/1711.05535
AlanChou/Super-Loss
PyTorch implementation of the paper "SuperLoss: A Generic Loss for Robust Curriculum Learning" in NIPS 2020.
jbdel/vilmedic
ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field
NVlabs/DoRA
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
test-time-training/ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
alxndrTL/mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
mne-tools/mne-bids
MNE-BIDS is a Python package that allows you to read and write BIDS-compatible datasets with the help of MNE-Python.
ishine/DoRA-1
[ICML2024] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
kingjr/meg-masc
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
mml-book/mml-book.github.io
Companion webpage to the book "Mathematics For Machine Learning"