Jia-Wei-Liu
PhD student at ShowLab, National University of Singapore.
National University of SingaporeSingapore
Jia-Wei-Liu's Stars
NUS-HPC-AI-Lab/DD-Ranking
Data distillation benchmark
weijiawu/ParaDiffusion
Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'
NUS-HPC-AI-Lab/R-MeeTo
Give us minutes, we give back a faster Mamba. The official implementation of "Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training".
showlab/FQGAN
FQGAN: Factorized Visual Tokenization and Generation
showlab/ROICtrl
Code for ROICtrl: Boosting Instance Control for Visual Generation
showlab/ShowUI
Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
showlab/VideoLISA
[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
abcyzj/MeshRet
Official implementation for the NeurIPS 2024 spotlight paper "Skinned Motion Retargeting with Dense Geometric Interaction Perception".
showlab/Exo2Ego-V
showlab/EvolveDirector
[NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
showlab/RingID
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
showlab/Awesome-GUI-Agent
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
zhaohengyuan1/Genixer
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
showlab/DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
Yanqing0327/MLLMs-Augmented
The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》
yihua7/SC-GS
[CVPR 2024] Code for SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes
VAST-AI-Research/TriplaneGaussian
TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.
NUS-HPC-AI-Lab/Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
showlab/T2VScore
T2VScore: Towards A Better Metric for Text-to-Video Generation
showlab/cosmo
opendatalab/CLIP-Parrot-Bias
ECCV2024_Parrot Captions Teach CLIP to Spot Text
showlab/ShowRoom3D
This is the project page of ShowRoom3D
HelenMao/MAG-Edit
MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)
TencentARC/ViT-Lens
[CVPR 2024] ViT-Lens: Towards Omni-modal Representations
TencentARC/HOSNeRF
HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
showlab/VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence