jianlong-yuan
Interested in Dense Prediction, such as Depth Estimation and Semantic Segmentation
Alibaba-DAMObeijing
jianlong-yuan's Stars
Stability-AI/generative-models
Generative Models by Stability AI
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
RayVentura/ShortGPT
🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
williamyang1991/VToonify
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
DjangoPeng/openai-quickstart
A comprehensive guide to understanding and implementing large language models with hands-on examples using LangChain for GenAI applications.
xiaobai1217/Awesome-Video-Datasets
Video datasets
ali-vilab/videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
omerbt/Text2LIVE
Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)
YBYBZhang/ControlVideo
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
rese1f/MovieChat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
iejMac/video2dataset
Easily create large video dataset from video urls
RaymondWang987/NVDS
ICCV 2023 "Neural Video Depth Stabilizer" (NVDS) & TPAMI 2024 "NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation" (NVDS+)
microsoft/XPretrain
Multi-modality pre-training
yzhang2016/video-generation-survey
A reading list of video generation
sstzal/DiffTalk
[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"
forence/Awesome-Visual-Captioning
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
OPPO-Mente-Lab/Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
showlab/all-in-one
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
showlab/EgoVLP
[NeurIPS2022] Egocentric Video-Language Pretraining
tumurzakov/AnimateDiff
AnimationDiff with train
tgc1997/Awesome-Video-Captioning
A curated list of research papers in Video Captioning
simon3dv/SLR-SFS
Code release for the paper "Simulating Fluids in Real-World Still Images"
TalalWasim/Video-FocalNets
Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]
kyegomez/StarlightVision
A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.
haha-lisa/Style-A-Video
liveseongho/Awesome-Video-Language-Understanding
A Survey on video and language understanding.