jiajiaxiaoskx's Stars
DefaultRui/BEV-Scene-Graph
[ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
guyyariv/TempoTokens
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
TianxingWu/FreeInit
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
lingorX/HieraSeg
CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarchy.
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.