EricLina's Stars
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
ashleve/lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
MzeroMiko/VMamba
VMamba: Visual State Space Models,code is based on mamba
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
OpenGVLab/VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
PKU-YuanGroup/Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
NVlabs/MambaVision
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
luogen1996/LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
OpenGVLab/video-mamba-suite
The suite of modeling video with Mamba
LeapLabTHU/MLLA
Official repository of MLLA (NeurIPS 2024)
Westlake-AI/MogaNet
[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network
tyshiwo1/DiM-DiffusionMamba
The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
klauscc/VindLU
goombalab/hydra
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
EasonXiao-888/GrootVL
[NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model
egoschema/EgoSchema
jinhyunj/EaTR
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
eamartin/parallelizing_linear_rnns
Cranial-XIX/longhorn
Official PyTorch Implementation of the Longhorn Deep State Space Model
MCG-NJU/VFIMamba
VFIMamba: Video Frame Interpolation with State Space Models
OpenGVLab/De-focus-Attention-Networks
Learning 1D Causal Visual Representation with De-focus Attention Networks
facebookresearch/VidOSC
Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)
chenziwenhaoshuai/Vision-Mamba2
Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality