Hai-chao-Zhang's Stars
LLaVA-VL/LLaVA-NeXT
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
jongwoopark7978/LVNet
rese1f/MovieChat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
kijai/ComfyUI-SVD
Experimental use of stable-video-diffusion in ComfyUI
GigaAI-research/General-World-Models-Survey
Hai-chao-Zhang/OOSTraj
[CVPR24] OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
ma-xu/Rewrite-the-Stars
[CVPR 2024] Rewrite the Stars
Stability-AI/generative-models
Generative Models by Stability AI
bryanbocao/vifit
Repository of the paper ViFiT in MobiCom 2023 ISACom Workshop.
Hai-chao-Zhang/Vi-FiDatasetProcessing
Preprocess ViFi Multimodal Dataset with GPS provided
Jianglin954/LGI-LS
[NeurIPS 2023] Latent Graph Inference with Limited Supervision
ma-xu/Context-Cluster
[ICLR 2023 Oral] Image as Set of Points
wyzjack/AdaM3
[ICDM 2023] Momentum is All You Need for Data-Driven Adaptive Optimization
J094/kitti_4_orbslam3_vio
Extract KITTI imu and gnss data from raw data for ORB_SLAM3 evaluation. The imu data and gnss data are stored in EuRoC format.
wyzjack/Awesome-3D-AnomalyDetection
Awesome papers on 3D anomaly detection.
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
GuyTevet/motion-diffusion-model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
NVlabs/stylegan2
StyleGAN2 - Official TensorFlow Implementation
justinpinkney/awesome-pretrained-stylegan3
A collection of pretrained models for StyleGAN3
sidward14/Style-AttnGAN
Improves Text to Image synthesis from AttnGAN by integrating the scale-specific control from StyleGAN; can optionally use GPT-2 as text encoder
vifi2021/Vi-Fi
IPSN22 submission
bryanbocao/vitag
Repository of the paper ViTag in SECON 2022 and demo (Best Demo Award).
jackyjsy/SCGAN
Spatially Constrained GAN (SCGAN) for Face and Fashion Synthesis
wyzjack/SLA2P
[CIKM 2022] Self-supervision Meets Adversarial Perturbation: A Novel Framework for Anomaly Detection (PyTorch)
PITI-Synthesis/PITI
PITI: Pretraining is All You Need for Image-to-Image Translation
ykasten/layered-neural-atlases
zju3dv/object_nerf
Code for "Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering", ICCV 2021