thucz
PhD Student at Tsinghua University, intern @ Baidu Vis, previously intern @ Tencent ARC Lab
CSCG, Tsinghua UniversityBeijing
Pinned Repositories
LGM
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
diffusers
š¤ Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
VMamba
VMamba: Visual State Space Modelsļ¼code is based on mamba
mamba
Mamba SSM architecture
GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
PanoGRF
[NeurIPS2023] PanoGRF: Generalizable Spherical Radiance Fields for Wide-baseline Panoramas(or 360-degree image)
thucz.github.io
thucz's Repositories
thucz/PanoGRF
[NeurIPS2023] PanoGRF: Generalizable Spherical Radiance Fields for Wide-baseline Panoramas(or 360-degree image)
thucz/GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
thucz/thucz.github.io