kamwoh

- Deep Learning Beginner at 2015 - PhD student at University of Surrey 🇬🇧 at 2021 - Research Intern at Meta at 2024

University of Surrey

kamwoh's Stars

danielgatis/rembg
Rembg is a tool to remove images background
Language:Python17.7k 150 5151.9k
Mikubill/sd-webui-controlnet
WebUI extension for ControlNet
Language:Python17.3k 148 1.5k2k
rtqichen/torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
Language:Python5.7k 127 219945
kohya-ss/sd-scripts
Language:Python5.6k 56 1.2k910
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Jupyter Notebook3.8k 43 188322
XPixelGroup/DiffBIR
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Language:Python3.5k 35 142295
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
Language:Python2.3k 30 98147
pytorch/tnt
A lightweight library for PyTorch training tools and utilities
Language:Python1.7k 43 71278
facebookresearch/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
Language:Python1.5k 21 40148
Stability-AI/stable-fast-3d
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
Language:Python1.3k 20 57153
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
Language:Python1.2k 45 2741
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Language:Python1.2k 18 7366
Zheng-Chong/CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
Language:Python1.1k 12 88131
zhmiao/OpenLongTailRecognition-OLTR
Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)
Language:Python851 29 71129
clu0/unet.cu
UNet diffusion model in pure CUDA
Language:Cuda592 3 027
PrimeIntellect-ai/OpenDiloco
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
Language:Python416 6 637
lunarring/lunar_tools
toolkit for interactive exhibitions
Language:Python284 5 125
instantX-research/CSGO
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
Language:Jupyter Notebook282 18 1710
naver-ai/rope-vit
[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"
Language:Python265 9 157
YanzuoLu/CFLD
[CVPR 2024 Highlight] Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Language:Jupyter Notebook204 6 4313
cvlab-epfl/MeshUDF
Fast and Differentiable Meshing of Unsigned Distance Field Networks
Language:Cython144 6 67
jiuntian/interactdiffusion
[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".
Language:Python112 3 1610
yangxiaofeng/rectified_flow_prior
Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors
Language:Python92 4 83
EternalEvan/FlowIE
This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"
Language:Python90 3 162
Surrey-UP-Lab/GS-LPM
Localized Gaussian Point Management
Language:Python60 3 72
tim-speed/flexdiffuse
Adaptation of Stable Diffusion with extra prompt guidance from images... An attempt at making the most flexible pipeline that will allow users to fully explore the capabilities of stable-diffusion.
Language:Python44 3 15
lyuPang/CrossInitialization
Language:Python36 2 24
zju-vipa/ProtoPFormer
ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition
Language:Python34 3 511
cardinalblue/ArtAdapter
Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation
30 15 40
RajaeeKh/TriNerfLet
TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation Code
Language:HTML22 5 20

kamwoh

kamwoh's Stars

danielgatis/rembg

Mikubill/sd-webui-controlnet

rtqichen/torchdiffeq

kohya-ss/sd-scripts

Tencent/HunyuanDiT

XPixelGroup/DiffBIR

IDEA-Research/DWPose

pytorch/tnt

facebookresearch/multimodal

Stability-AI/stable-fast-3d

gnobitab/InstaFlow

LTH14/mar

Zheng-Chong/CatVTON

zhmiao/OpenLongTailRecognition-OLTR

clu0/unet.cu

PrimeIntellect-ai/OpenDiloco

lunarring/lunar_tools

instantX-research/CSGO

naver-ai/rope-vit

YanzuoLu/CFLD

cvlab-epfl/MeshUDF

jiuntian/interactdiffusion

yangxiaofeng/rectified_flow_prior

EternalEvan/FlowIE

Surrey-UP-Lab/GS-LPM

tim-speed/flexdiffuse

lyuPang/CrossInitialization

zju-vipa/ProtoPFormer

cardinalblue/ArtAdapter

RajaeeKh/TriNerfLet