jianlong-yuan

Interested in Dense Prediction, such as Depth Estimation and Semantic Segmentation

Alibaba-DAMObeijing

jianlong-yuan's Stars

Stability-AI/generative-models
Generative Models by Stability AI
Language:Python25.6k 264 3192.8k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python11.4k 83 5371.1k
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python11.3k 104 376916
RayVentura/ShortGPT
🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation
Language:Python6.3k 70 117828
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python3k 47 0188
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python3k 33 160270
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
Language:Python1.6k 74 45139
DjangoPeng/openai-quickstart
A comprehensive guide to understanding and implementing large language models with hands-on examples using LangChain for GenAI applications.
Language:Jupyter Notebook1.5k 44 151k
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Language:Python1.4k 12 3463
xiaobai1217/Awesome-Video-Datasets
Video datasets
1.4k 29 12101
showlab/Show-1
[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Language:Python1.1k 36 2056
MC-E/DragonDiffusion
ICLR 2024 (Spotlight)
Language:Python750 40 3020
rese1f/MovieChat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Language:Python606 12 8542
segmind/distill-sd
Segmind Distilled diffusion
Language:Python595 17 1738
iejMac/video2dataset
Easily create large video dataset from video urls
Language:Python592 9 15670
RaymondWang987/NVDS
ICCV 2023 "Neural Video Depth Stabilizer" (NVDS) & TPAMI 2024 "NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation" (NVDS+)
Language:Python508 22 3623
ziqihuangg/ReVersion
[SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images
Language:Python501 19 919
microsoft/XPretrain
Multi-modality pre-training
Language:Python489 12 4037
forence/Awesome-Visual-Captioning
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
413 19 450
VQAssessment/DOVER
[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.
Language:Jupyter Notebook351 5 3734
cure-lab/PnPInversion
[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"
Language:Jupyter Notebook298 7 1813
OPPO-Mente-Lab/Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Language:Python294 7 1213
showlab/all-in-one
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
Language:Python281 6 2117
showlab/EgoVLP
[NeurIPS 2022] Egocentric Video-Language Pretraining
Language:Python235 3 2820
facebookresearch/ActivityNet-Entities
A Dataset for Grounded Video Description
Language:Python160 16 924
jiaxilv/GPT4Motion
Language:Python140 16 55
tgc1997/Awesome-Video-Captioning
A curated list of research papers in Video Captioning
120 2 014
simon3dv/SLR-SFS
Code release for the paper "Simulating Fluids in Real-World Still Images"
Language:Python112 3 69
liveseongho/Awesome-Video-Language-Understanding
A Survey on video and language understanding.
48 1 02
kyegomez/Gen1
My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML
Language:Python26 3 13

jianlong-yuan

jianlong-yuan's Stars

Stability-AI/generative-models

mlfoundations/open_clip

guoyww/AnimateDiff

RayVentura/ShortGPT

PixArt-alpha/PixArt-alpha

DAMO-NLP-SG/Video-LLaMA

omerbt/TokenFlow

DjangoPeng/openai-quickstart

facebookresearch/MetaCLIP

xiaobai1217/Awesome-Video-Datasets

showlab/Show-1

MC-E/DragonDiffusion

rese1f/MovieChat

segmind/distill-sd

iejMac/video2dataset

RaymondWang987/NVDS

ziqihuangg/ReVersion

microsoft/XPretrain

forence/Awesome-Visual-Captioning

VQAssessment/DOVER

cure-lab/PnPInversion

OPPO-Mente-Lab/Subject-Diffusion

showlab/all-in-one

showlab/EgoVLP

facebookresearch/ActivityNet-Entities

jiaxilv/GPT4Motion

tgc1997/Awesome-Video-Captioning

simon3dv/SLR-SFS

liveseongho/Awesome-Video-Language-Understanding

kyegomez/Gen1