renxiangnan's Stars
iperov/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
dragonflydb/dragonfly
A modern replacement for Redis and Memcached
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
fishaudio/fish-speech
Brand new TTS solution
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
shenweichen/AlgoNotes
【浅梦学习笔记】文章汇总:包含 排序&CXR预估,召回匹配,用户画像&特征工程,推荐搜索综合 计算广告,大数据,图算法,NLP&CV,求职面试 等内容
harlanhong/awesome-talking-head-generation
microsoft/NeuralSpeech
google/tensorstore
Library for reading and writing large multi-dimensional arrays.
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
PlayVoice/vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
apachecn/apachecn-dl-zh
ApacheCN 深度学习译文集
gitmylo/bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
huangjunheng/recommendation_model
练习下用pytorch来复现下经典的推荐系统模型, 如MF, FM, DeepConn, MMOE, PLE, DeepFM, NFM, DCN, AFM, AutoInt, ONN, FiBiNET, DCN-v2, AFN, DCAP等
lucidrains/local-attention
An implementation of local windowed attention for language modeling
saifhassan/Wav2Lip-HD
High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN
lifeiteng/SoundStorm
oceancx/Realtime-Voice-Clone-Chinese
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
gzn00417/Commercial-Video-Recognition
基于数据挖掘的tik tok商用广告视频识别
uclwe/rtb
Bid optimisation methods for real-time bidding in online display advertising.
databricks-industry-solutions/real-time-bidding
From display to video, the value of an impression can only be realized if an ad is viewed by a user. Therefore, when using programmatic advertising to buy inventory, it’s important to take viewability into account. In this Solution Accelerator, learn how to predict ad viewability to optimize your real-time bidding strategy.