songrise
Currently RA @ PolyU. Prev CS BSc @ PolyU, Hong Kong. Interseted in CG / CV / Multimodal AI.
The Hong Kong Polytechnic University
songrise's Stars
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
MrNeRF/awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
amusi/ICCV2023-Papers-with-Code
ICCV 2023 论文和开源项目合集
SwinTransformer/Swin-Transformer-Semantic-Segmentation
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.
xiaobai1217/Awesome-Video-Datasets
Video datasets
buaacyw/GaussianEditor
[CVPR 2024] GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
fudan-zvg/SETR
[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
KMnP/vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
ildoonet/pytorch-gradual-warmup-lr
Gradually-Warmup Learning Rate Scheduler for PyTorch
christophschuhmann/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
graphdeco-inria/diff-gaussian-rasterization
LTH14/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
allenai/visprog
Official code for VisProg (CVPR 2023 Best Paper!)
facebookresearch/omnivore
Omnivore: A Single Model for Many Visual Modalities
icoz69/StyleAvatar3D
Official repo for StyleAvatar3D
lichengunc/refer
Referring Expression Datasets API
JD-P/simulacra-aesthetic-captions
Dataset of prompts, synthetic AI generated images, and aesthetic ratings.
OutofAi/2D-Gaussian-Splatting
A 2D Gaussian Splatting paper for no obvious reasons. Enjoy!
ashawkey/diff-gaussian-rasterization
ThibaultGROUEIX/ChamferDistancePytorch
Chamfer Distance in Pytorch with f-score
facebookresearch/mmbt
Supervised Multimodal Bitransformers for Classifying Images and Text
JonathonLuiten/diff-gaussian-rasterization-w-depth
yikaiw/TokenFusion
[CVPR 2022] Code release for "Multimodal Token Fusion for Vision Transformers"
billywzh717/N24News
a-nagrani/CVPR2020_Poster
Speech2Action CVPR Poster Source Code
RuipingL/TransKD
yaoweilee/PMF
Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023
liangsheng02/Modular-and-Parameter-Efficient-Multimodal-Fusionwith-Prompting
songrise/ConditionalPrompt
[arXiv2023] Conditional Prompt Tuning for Multimodal Fusion