xiaoqian-shen

KAUSTSaudi Arabia

xiaoqian-shen's Stars

timothybrooks/instruct-pix2pix
Language:Python6.1k 70 115527
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
2.9k 124 18175
Doubiiu/DynamiCrafter
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.2k 30 105170
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Language:Python1.7k 11 131113
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.6k 21 8581
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
Language:Python1.5k 78 41134
Fantasy-Studio/Paint-by-Example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
Language:Python1k 23 5394
Vchitect/SEINE
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Language:Python864 24 2859
Vchitect/LaVie
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
Language:Python791 27 2358
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Language:Python693 29 6434
castorini/daam
Diffusion attentive attribution maps for interpreting Stable Diffusion.
Language:Jupyter Notebook646 12 4761
ExponentialML/Text-To-Video-Finetuning
Finetune ModelScope's Text To Video model using Diffusers 🧨
Language:Python646 18 68106
allenai/unified-io-2
Language:Python540 15 1625
AILab-CVC/SEED
Official implementation of SEED-LLaMA (ICLR 2024).
Language:Python529 14 4330
YingqingHe/LVDM
LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation
Language:Python426 28 2114
kohjingyu/gill
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
Language:Jupyter Notebook403 16 4033
dvlab-research/Video-P2P
Video-P2P: Video Editing with Cross-attention Control
Language:Python359 9 1624
AILab-CVC/FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
Language:Python350 6 1324
AILab-CVC/TaleCrafter
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters
248 25 713
jamespark3922/visual-comet
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
Language:Python83 4 1311
genforce/StyleSV
[ICLR 2023] Towards Smooth Video Composition
Language:Python82 8 24
bytedance/Shot2Story
A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.
Language:Python76 6 134
google/storybench
Language:Python47 4 13
ubc-vision/Make-A-Story
Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023
Language:Jupyter Notebook36 8 53
xiaoqian-shen/StoryGPT-V
Language:Jupyter Notebook34 5 42
adymaharana/StoryViz
Language:Python30 3 52
adymaharana/VLCStoryGan
Official code repository for the EMNLP 2021 paper
Language:Python26 2 165
princetonvisualai/pointingqa
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
Language:Python18 3 22
yonseivnl/cmota
Language:Python91
ali-vilab/i2vgen-xl
6 2 00

xiaoqian-shen

xiaoqian-shen's Stars

timothybrooks/instruct-pix2pix

showlab/Awesome-Video-Diffusion

Doubiiu/DynamiCrafter

dvlab-research/LISA

baaivision/Emu

omerbt/TokenFlow

Fantasy-Studio/Paint-by-Example

Vchitect/SEINE

Vchitect/LaVie

mbzuai-oryx/groundingLMM

castorini/daam

ExponentialML/Text-To-Video-Finetuning

allenai/unified-io-2

AILab-CVC/SEED

YingqingHe/LVDM

kohjingyu/gill

dvlab-research/Video-P2P

AILab-CVC/FreeNoise

AILab-CVC/TaleCrafter

jamespark3922/visual-comet

genforce/StyleSV

bytedance/Shot2Story

google/storybench

ubc-vision/Make-A-Story

xiaoqian-shen/StoryGPT-V

adymaharana/StoryViz

adymaharana/VLCStoryGan

princetonvisualai/pointingqa

yonseivnl/cmota

ali-vilab/i2vgen-xl