kamwoh
- Deep Learning Beginner at 2015 - PhD student at University of Surrey 🇬🇧 at 2021 - Research Intern at Meta at 2024
University of Surrey
kamwoh's Stars
danielgatis/rembg
Rembg is a tool to remove images background
Mikubill/sd-webui-controlnet
WebUI extension for ControlNet
rtqichen/torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
kohya-ss/sd-scripts
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
XPixelGroup/DiffBIR
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
pytorch/tnt
A lightweight library for PyTorch training tools and utilities
facebookresearch/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
Stability-AI/stable-fast-3d
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Zheng-Chong/CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
zhmiao/OpenLongTailRecognition-OLTR
Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)
clu0/unet.cu
UNet diffusion model in pure CUDA
PrimeIntellect-ai/OpenDiloco
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
lunarring/lunar_tools
toolkit for interactive exhibitions
instantX-research/CSGO
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
naver-ai/rope-vit
[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"
YanzuoLu/CFLD
[CVPR 2024 Highlight] Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
cvlab-epfl/MeshUDF
Fast and Differentiable Meshing of Unsigned Distance Field Networks
jiuntian/interactdiffusion
[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".
yangxiaofeng/rectified_flow_prior
Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors
EternalEvan/FlowIE
This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"
Surrey-UP-Lab/GS-LPM
Localized Gaussian Point Management
tim-speed/flexdiffuse
Adaptation of Stable Diffusion with extra prompt guidance from images... An attempt at making the most flexible pipeline that will allow users to fully explore the capabilities of stable-diffusion.
lyuPang/CrossInitialization
zju-vipa/ProtoPFormer
ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition
cardinalblue/ArtAdapter
Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation
RajaeeKh/TriNerfLet
TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation Code