icemansina
I am an assistant professor at CUHKSZ. My research is about 3D computer vision and AI guided interdisciplinary research.
The University of Hong KongHong Kong
icemansina's Stars
CompVis/stable-diffusion
A latent text-to-image diffusion model
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
google-deepmind/alphafold
Open source code for AlphaFold 2.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
MrNeRF/awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
zhaoxin94/awesome-domain-adaptation
A collection of AWESOME things about domian adaptation
dk-liang/Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
facebookresearch/esm
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
aqlaboratory/openfold
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
FLHonker/Awesome-Knowledge-Distillation
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
ytongbai/LVM
ActiveVisionLab/Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
Thinklab-SJTU/Awesome-LLM4AD
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
zchoi/Awesome-Embodied-Agent-with-LLMs
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
yanx27/2DPASS
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds (ECCV 2022) :fire:
Haiyang-W/DSVT
[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"
wzzheng/OccWorld
[ECCV 2024] 3D World Model for Autonomous Driving
westlake-repl/SaProt
Saprot: Protein Language Model with Structural Alphabet (AA+3Di)
Ghostish/Open3DSOT
Open source library for Single Object Tracking in point clouds.
zhanghm1995/Forge_VFM4AD
A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.
yanx27/JS3C-Net
Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion (AAAI 2021)
JMoonr/LATR
[ICCV2023 Oral] LATR: 3D Lane Detection from Monocular Images with Transformer
Nightmare-n/UniPAD
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)
woodfrog/maptracker
Code for paper "MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping", ECCV 2024 (Oral)
med-air/Endo-FM
[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train
CurryYuan/InstanceRefer
[ICCV 2021] InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
ZhanHeshen/PointCMT
[NeurIPS2022] Let Images Give You More: Point Cloud Cross-Modal Training for Shape Analysis