junchen-fu's Stars
yuankaishen2001/AUFormer
[ECCV 2024🔥] The official code for the paper AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors.
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
thu-nics/MoA
The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
shenweichen/DeepCTR-Torch
【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.
shenweichen/DeepCTR
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
MendelXu/SAN
Open-vocabulary Semantic Segmentation
CrossmodalGroup/LAPS
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024
IsaacRodgz/ConcatBERT
Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
GistNoesis/FourierKAN
SAI990323/TALLRec
ghdtjr/A-LLMRec
BaohaoLiao/mefts
[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Mangul-Lab-USC/Mangul-Lab-USC.github.io
Website for Mangul Lab.
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
rutgerswiselab/GenRec
Large Language Model for Generative Recommendation
hw-du/CBiT
Implementation of the paper "Contrastive Learning with Bidirectional Transformers for Sequential Recommendation".
enoche/MultimodalRecSys
A curated list of awesome resources about multimodal recommender systems.
GAIR-Lab/IISAN
IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT
MMSR23/MMSR
An Empirical Study of Training Multi-modal Sequential Recommendation Models
perixtar/2024-Tech-OA
List of Tech Company OAs. Save your time from finding them all over the internet.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
pykt-team/pykt-toolkit
pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models
TXH-mercury/VAST
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
enoche/MMRec
A Toolbox for MultiModal Recommendation. Integrating 10+ Models...
Xiaohao-Liu/CLHE
The implementation of paper "Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction".
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
westlake-repl/Adapter4Rec
Multi-domain Recommendation with Adapter Tuning