Pinned Repositories
ArtiBoost
[CVPR 2022 Oral] ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis
blivechat
用于OBS的仿YouTube风格的bilibili直播评论栏
chat-langchain
chatgpt-on-wechat
使用ChatGPT搭建微信聊天机器人,基于OpenAI API和itchat实现。Wechat robot based on ChatGPT, which using OpenAI api and itchat library.
custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion
DenseMutualAttention
[WACV2023] Interacting Hand-Object Pose Estimation via Dense Mutual Attention
dgrasp
Official code release for CVPR 2022 paper D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
Ditto
Code for Ditto: Building Digital Twins of Articulated Objects from Interaction
douyin_crawl
抖音视频批量爬取
EasySynth
Unreal Engine plugin for easy creation of synthetic image datasets
josh-zhu's Repositories
josh-zhu/ArtiBoost
[CVPR 2022 Oral] ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis
josh-zhu/chat-langchain
josh-zhu/chatgpt-on-wechat
使用ChatGPT搭建微信聊天机器人,基于OpenAI API和itchat实现。Wechat robot based on ChatGPT, which using OpenAI api and itchat library.
josh-zhu/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion
josh-zhu/DenseMutualAttention
[WACV2023] Interacting Hand-Object Pose Estimation via Dense Mutual Attention
josh-zhu/dgrasp
Official code release for CVPR 2022 paper D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
josh-zhu/douyin_crawl
抖音视频批量爬取
josh-zhu/EasySynth
Unreal Engine plugin for easy creation of synthetic image datasets
josh-zhu/EasyVC
变声技术综合评比
josh-zhu/EDTalk
[ECCV 2024] EDTalk - Official PyTorch Implementation
josh-zhu/fish-speech
Brand new TTS solution
josh-zhu/GaussianAvatars
[CVPR 2024 (Highlight)] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
josh-zhu/HandAvatar
josh-zhu/hyperreel
Code release for HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling
josh-zhu/langchain
⚡ Building applications with LLMs through composability ⚡
josh-zhu/langflow
⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
josh-zhu/LivePortrait
Make one portrait alive!
josh-zhu/MidJourney-Wrapper
MidJourney wrapper in Discord.
josh-zhu/react-nice-avatar
react library for generating avatar
josh-zhu/ScalingNeuralFaceSynthesis
josh-zhu/so-vits-svc
SoftVC VITS Singing Voice Conversion
josh-zhu/talking-face-arxiv-daily
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
josh-zhu/TextBox
TextBox 2.0 is a text generation library with pre-trained language models
josh-zhu/torch-ngp
A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.
josh-zhu/TTS-arxiv-daily
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
josh-zhu/v4l2loopback
v4l2-loopback device
josh-zhu/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
josh-zhu/vall-e-1
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Can be trained on a single GPU!
josh-zhu/visual-chatgpt
Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
josh-zhu/whisperX
WhisperX: Automatic Speech Recognition with Accurate Word-level Timestamps.