bbbdbbb

bbbdbbb's Stars

rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook38k 406 1154.9k
mli/paper-reading
深度学习经典、新论文逐段精读
27.8k 734 02.5k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27.1k 211 4.4k5.6k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.5k 263 130861
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook12.2k 98 3481.6k
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language:HTML11.3k 268 49954
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10.2k 97 679989
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
Language:Jupyter Notebook3.8k 96 30413
jbhuang0604/awesome-tips
3.5k 99 4197
clovaai/stargan-v2
StarGAN v2 - Official PyTorch Implementation (CVPR 2020)
Language:Python3.5k 78 163662
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.9k 33 158265
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python1.6k 13 141199
invictus717/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
Language:Python1.6k 22 68116
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Language:Python849 19 2545
tcapelle/Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
Language:Jupyter Notebook395 2 059
zeroQiaoba/MERTools
Toolkits for Multimodal Emotion Recognition
Language:Python177 3 717
fabawi/ImageBind-LoRA
Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA
Language:Python176 3 1316
ZebangCheng/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Language:Python176 7 2915
zeroQiaoba/AffectGPT
Explainable Multimodal Emotion Reasoning (EMER) and AffectGPT
Language:Python124 4 58
sunlicai/MAE-DFER
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition (ACM MM 2023)
Language:Python99 3 1815
llyx97/TempCompass
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou
Language:Python96 4 42
FilippoMB/Diffusion_models_tutorial
Language:Jupyter Notebook90 1 19
AIGCDesignGroup/WordArt
This work introduces WordArt Designer, a user-driven framework for artistic typography synthesis, relying on Large Language Models (LLM).
35 1 03
PinJui/FDRL
Unofficial implementation of "Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition - CVPR'21"
Language:Python18 1 21
zylMozart/Disentangle_GraphHom
[Neurips 2024] Disentangled Graph Homophily
Language:Python18 4 12
MIPS-COLT/MER-MCE
This paper presents our winning submission to Subtask 2 of SemEval 2024 Task 3 on multimodal emotion cause analysis in conversations.
Language:Python15 2 53
zfkarl/scAgent
A pytorch implementation for paper "scAgent: A Versatile Single-cell Multi-omics Data Analysis Framework via Multi-agent Collaboration"
Language:Python9 2 01
Lum1104/EIBench
EIBench: Assessing the Emotion Interpretation ability of Vision Large Language Models
Language:Python7 2 00
bbbdbbb/MiniGPT-4-captions
Generating captions on image datasets using MiniGPT-v2
Language:Python6 2 10
CASIA-Affective-Computing-Group/MER2023-Baseline
Language:Python4 1 00