Pinned Repositories
amazon-freertos
Cloud-native IoT operating system for microcontrollers.
AnimateAnyone-unofficial
Unofficial Implementation of Animate Anyone
audiowmark
Audio Watermarking
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
autotuner
base
basic library for cross platform,implement by c++,some code from open source chromium.
chrome-music-lab
A collection of experiments for exploring how music works, all built with the Web Audio API.
chromium-base
Trimmed Chromium base library, based on zcbenz/base-minimal and mini_chromium, but with more useful stuff. Currently works under linux and windows only.
reproducingcodes
Project Repo for Reproducible Codes Class
vid2vid
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.
linzai1992's Repositories
linzai1992/AnimateAnyone-unofficial
Unofficial Implementation of Animate Anyone
linzai1992/audiowmark
Audio Watermarking
linzai1992/auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
linzai1992/autotuner
linzai1992/chrome-music-lab
A collection of experiments for exploring how music works, all built with the Web Audio API.
linzai1992/crank
Non-parallel Voice Conversion
linzai1992/CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
linzai1992/DurIAN
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
linzai1992/face-nn
游戏捏脸,基于神经风格迁移框架生成逼真人脸
linzai1992/google-research
Google Research
linzai1992/IMAGDressing
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
linzai1992/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
linzai1992/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
linzai1992/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)
linzai1992/multiband_melgan
An unofficial implementation of https://arxiv.org/abs/2005.05106
linzai1992/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
linzai1992/NeuralVoicePuppetryMMD
This github contains the network architectures of NeuralVoicePuppetry.
linzai1992/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
linzai1992/pitch-net
Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).
linzai1992/PPSpeech
PPSpeech: Phrase based Parallel End-to-End TTS System
linzai1992/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
linzai1992/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
linzai1992/speech-driven-animation
linzai1992/spleeter
Deezer source separation library including pretrained models.
linzai1992/torch_npss
pytorch implementation of Neural Parametric Singing Synthesizer 歌声合成
linzai1992/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
linzai1992/VPEval
VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation
linzai1992/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
linzai1992/WaveGrad
Implementation of Google Brain's WaveGrad vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
linzai1992/WGANSing
Multi-voice singing voice synthesis