SherlockSunset
Research Areas: Computer Vsion, Object Detection, 3D Defect Inspection.
Nanyang Technological UniversitySingapore
SherlockSunset's Stars
stuffmatic/fSpy
A cross platform app for quick and easy still image camera matching
facebookresearch/pippo
Pippo: High-Resolution Multi-View Humans from a Single Image
bcmi/Light-A-Video
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
LizhenWangT/StyleAvatar
Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video
FireRedTeam/FireRedASR
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
TimeMarker-LLM/TimeMarker
A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability
Saiyan-World/goku
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
pq-yang/MatAnyone
[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation
RupertLuo/Valley
The official repository of "Video assistant towards large language model makes everything easy"
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
deepseek-ai/DeepSeek-R1
OpenBMB/MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
cangcz/AnchorCrafter
conallwang/MeGA
The official implementation of "MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing".
TencentARC/StereoCrafter
A framework to convert any 2D videos to immersive stereoscopic 3D
jixiaozhong/Sonic
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
Pscgylotti/RAIN
mks0601/ExAvatar_RELEASE
Official PyTorch implementation of "Expressive Whole-Body 3D Gaussian Avatar", ECCV 2024.
remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
JollyToday/AI_Image_Translator_Translate_Images
AI Image Translation Tool-An excellent translator for photos, pictures, posters, covers, banners and product images.AI图片翻译-很棒的批量跨境电商|海报|商品图片翻译,擦除干净,排版整齐。
zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Anonym0u3/AttentiveEraser
Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance" (AAAI 2025 Oral)
fudan-generative-vision/hallo3
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
YuelangX/Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
sicxu/Deep3DFaceRecon_pytorch
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
Brian417-cup/AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
facebookresearch/AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"