Pinned Repositories
ArtiBoost
[CVPR 2022 Oral] ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis
blivechat
用于OBS的仿YouTube风格的bilibili直播评论栏
chat-langchain
chatgpt-on-wechat
使用ChatGPT搭建微信聊天机器人,基于OpenAI API和itchat实现。Wechat robot based on ChatGPT, which using OpenAI api and itchat library.
custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion
DenseMutualAttention
[WACV2023] Interacting Hand-Object Pose Estimation via Dense Mutual Attention
dgrasp
Official code release for CVPR 2022 paper D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
Ditto
Code for Ditto: Building Digital Twins of Articulated Objects from Interaction
douyin_crawl
抖音视频批量爬取
EasySynth
Unreal Engine plugin for easy creation of synthetic image datasets
josh-zhu's Repositories
josh-zhu/blivechat
用于OBS的仿YouTube风格的bilibili直播评论栏
josh-zhu/Ditto
Code for Ditto: Building Digital Twins of Articulated Objects from Interaction
josh-zhu/EgoHOS
Fine-Grained Egocentric Hand-Object Segmentation, ECCV 2022
josh-zhu/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
josh-zhu/google-research
Google Research
josh-zhu/GRAM
GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation (CVPR 2022 Oral)
josh-zhu/hififace
Unofficial PyTorch Implementation for HifiFace (https://arxiv.org/abs/2106.09965)
josh-zhu/HOIG
[NeurIPS 2022 Spotlight] Hand-Object Interaction Image Generation
josh-zhu/IMS-Toucan
IMS-Toucan is a toolkit to train state-of-the-art Speech Synthesis models. Everything is pure Python and PyTorch based to keep it as simple and beginner-friendly, yet powerful as possible.
josh-zhu/josh-zhu
Config files for my GitHub profile.
josh-zhu/LIVE-Layerwise-Image-Vectorization
[CVPR 2022 Oral] Towards Layer-wise Image Vectorization
josh-zhu/NeRF-Art
NeRF-Art: Text-Driven Neural Radiance Fields Stylization
josh-zhu/NeuRay
[CVPR2022] Neural Rays for Occlusion-aware Image-based Rendering
josh-zhu/One-Shot-Voice-Cloning
:relaxed: One Shot Voice Cloning base on Unet-TTS
josh-zhu/OTVM
One-Trimap Video Matting (ECCV 2022)
josh-zhu/Phone-Level-Mixture-Density-Network-for-TTS
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
josh-zhu/pixano-app
Pixano App is a web-based smart-annotation tool for computer vision applications.
josh-zhu/plla-tisvs
Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
josh-zhu/ppg-vc
PPG-Based Voice Conversion
josh-zhu/RPCMVOS
[AAAI22_Oral] Reliable Propagation-Correction Modulation for Video Object Segmentation
josh-zhu/S2CRNet
josh-zhu/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
josh-zhu/SemanticGuidedHumanMatting
Robust Human Matting via Semantic Guidance, ACCV 2022.
josh-zhu/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
josh-zhu/SSP-NeRF
Code for "Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation"
josh-zhu/stylegan3-editing
Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" https://arxiv.org/abs/2201.13433
josh-zhu/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
josh-zhu/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone