Infinitywxh's Stars
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
pavanravva/Enhanced-MMASD
DNA-Rendering/DNA-Rendering
DNA-RENDERING: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering
caizhongang/humman_toolbox
Toolbox for HuMMan Dataset
movienet/movienet-tools
Tools for movie and video research
PantoMatrix/PantoMatrix
PantoMatrix: Co-Speech Talking Head and Gestures Generation
LivXue/VCIN
Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reasoning with Variational Causal Inference Network for Explanatory Visual Question Answering"
PKUTAN/SAWT
Official python implementation for ICML 2024: "Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment Problem"
IDEA-Research/MotionLLM
[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos
jonbarron/website
feifeiobama/RectifID
[NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
Becomebright/GroundVQA
Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
muditbhargava66/PyxLSTM
Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.
RenShuhuai-Andy/TimeChat
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
Wang-ML-Lab/llm-continual-learning-survey
Continual Learning of Large Language Models: A Comprehensive Survey
TaeryungLee/MultiAct_RELEASE
Official PyTorch implementation of "MultiAct: Long-Term 3D Human Motion Generation from Multiple Action Labels", in AAAI 2023 (Oral presentation).
Event-AHU/Mamba_State_Space_Model_Paper_List
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
EricGuo5513/momask-codes
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
boheumd/MA-LMM
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
kyegomez/Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
yyyujintang/Awesome-Mamba-Papers
Awesome Papers related to Mamba.
state-spaces/s4
Structured state space sequence models
yawenzeng/Awesome-Cross-Modal-Video-Moment-Retrieval
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
johnma2006/mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
radarFudan/Awesome-state-space-models
Collection of papers on state-space models
md-mohaiminul/ViS4mer
state-spaces/mamba
Mamba SSM architecture
yifanzhang-pro/Matrix-SSL
Official implementation of ICML 2024 paper "Matrix Information Theory for Self-supervised Learning" (https://arxiv.org/abs/2305.17326)
IDEA-Research/Motion-X
[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"