king159
CSE Ph.D. at The Chinese University of Hong Kong (CUHK)
The Chinese University of Hong KongHongKong, China
Pinned Repositories
Pair-Net
[IEEE TPAMI-2024] Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
svd-mv
Unofficial Implementation of "Stable Video Diffusion Multi-View"
Otter
š¦¦ Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
attention-interpolation-diffusion
[NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion
HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
MambaOut
MambaOut: Do We Really Need Mamba for Vision?
king159's Repositories
king159/Pair-Net
[IEEE TPAMI-2024] Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
king159/svd-mv
Unofficial Implementation of "Stable Video Diffusion Multi-View"
king159/king159.github.io