jiajiaxiaoskx

Zhejiang UniversityHangzhou

jiajiaxiaoskx's Stars

DefaultRui/BEV-Scene-Graph
[ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation
Language:Python12020
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
1.4k70
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Language:Python1.7k122
guyyariv/TempoTokens
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
Language:Python11113
TianxingWu/FreeInit
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language:Python50224
lingorX/HieraSeg
CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarchy.
Language:Python26926
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Language:Jupyter Notebook2.9k343