hkchengrex

Ph.D. student at the University of Illinois Urbana-Champaign. Oxygen consuming.

Champaign, IL

Pinned Repositories

CascadePSP
[CVPR 2020] CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
Language:Python869 16 7296
Cutie
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
Language:Python974 6 13588
Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook41 1 011
Mask-Propagation
[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.
Language:Python130 7 4621
MiVOS
[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!
Language:Python484 15 5563
MMAudio
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Language:Python1.9k 22 79227
Scribble-to-Mask
[CVPR 2021] MiVOS - Scribble to Mask module
Language:Python88 4 715
STCN
[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
Language:Python561 8 15971
Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Language:Python1.4k 15 115137
XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Language:Python1.9k 20 147204

hkchengrex's Repositories

hkchengrex/MMAudio
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Language:Python1.9k 22 79227
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Language:Python1.9k 20 147204
hkchengrex/Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Language:Python1.4k 15 115137
hkchengrex/Cutie
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
Language:Python974 6 13588
hkchengrex/CascadePSP
[CVPR 2020] CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
Language:Python869 16 7296
hkchengrex/STCN
[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
Language:Python561 8 15971
hkchengrex/MiVOS
[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!
Language:Python484 15 5563
hkchengrex/Mask-Propagation
[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.
Language:Python130 7 4621
hkchengrex/Scribble-to-Mask
[CVPR 2021] MiVOS - Scribble to Mask module
Language:Python88 4 715
hkchengrex/Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook41 1 011
hkchengrex/av-benchmark
Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs, LAION-CLAP, MS-CLAP, DeSync
Language:Python36 2 62
hkchengrex/vos-benchmark
Fast and general video object segmentation evaluation.
Language:Python33 2 45
hkchengrex/C2OT
[ICCV 2025] The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation
Language:Jupyter Notebook16 1 11
hkchengrex/nitrous-ema
Fast and simple post-hoc EMA (Karras et al., 2023) for PyTorch with minimal `.item()` calls. ~78% lower overhead than ema_pytorch.
Language:Python13 1 0
hkchengrex/davis2016-evaluation
Language:Python8 3 10
hkchengrex/shared-memory-tensor-dataset
This repository provides an example of reading from a single shared memory tensor from multiple processes (e.g., with DDP).
Language:Python5 1 01
hkchengrex/BlenderVOSRenderer
Language:Python2 2 0
hkchengrex/STM
Video Object Segmentation using Space-Time Memory Networks
Language:Python2 2 0
hkchengrex/ema-pytorch
A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model
Language:Python1 0 0
hkchengrex/kinetics_to_frames
Convert kinetics datasets (or other video datasets) to frames. Support resizing and temporal sampling for space efficiency.
Language:Python1 2 01
hkchengrex/Single-View-Metrology-Step-By-Step
An implementation of Single View Metrology (Criminisi99) with step-by-step guidance in a Jupyter Notebook.
Language:Jupyter Notebook1 2 0
hkchengrex/CLAP
Contrastive Language-Audio Pretraining
Language:Python0 0
hkchengrex/fbrs_interactive_segmentation
[CVPR2020] f-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation https://arxiv.org/abs/2001.10331
Language:Python2 0
hkchengrex/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python0 01
hkchengrex/MS-CLAP
Learning audio concepts from natural language supervision
Language:Python0 01
hkchengrex/passt_hear21
Inference code for PaSST, using the HEAR API.
Language:Python0 0
hkchengrex/pythia
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Language:Python2 0
hkchengrex/RAFT
Language:Python1 0
hkchengrex/RGMP
Fast Video Object Segmentation by Reference-Guided Mask Propagation
Language:Python2 0
hkchengrex/so
Stackoverflow answers
Language:Python2 0