bbbdbbb's Stars
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
mli/paper-reading
深度学习经典、新论文逐段精读
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
jbhuang0604/awesome-tips
clovaai/stargan-v2
StarGAN v2 - Official PyTorch Implementation (CVPR 2020)
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
invictus717/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
tcapelle/Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
zeroQiaoba/MERTools
Toolkits for Multimodal Emotion Recognition
fabawi/ImageBind-LoRA
Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA
ZebangCheng/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
zeroQiaoba/AffectGPT
Explainable Multimodal Emotion Reasoning (EMER) and AffectGPT
sunlicai/MAE-DFER
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition (ACM MM 2023)
llyx97/TempCompass
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou
FilippoMB/Diffusion_models_tutorial
AIGCDesignGroup/WordArt
This work introduces WordArt Designer, a user-driven framework for artistic typography synthesis, relying on Large Language Models (LLM).
PinJui/FDRL
Unofficial implementation of "Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition - CVPR'21"
zylMozart/Disentangle_GraphHom
[Neurips 2024] Disentangled Graph Homophily
MIPS-COLT/MER-MCE
This paper presents our winning submission to Subtask 2 of SemEval 2024 Task 3 on multimodal emotion cause analysis in conversations.
zfkarl/scAgent
A pytorch implementation for paper "scAgent: A Versatile Single-cell Multi-omics Data Analysis Framework via Multi-agent Collaboration"
Lum1104/EIBench
EIBench: Assessing the Emotion Interpretation ability of Vision Large Language Models
bbbdbbb/MiniGPT-4-captions
Generating captions on image datasets using MiniGPT-v2
CASIA-Affective-Computing-Group/MER2023-Baseline