renxiangnan

renxiangnan's Stars

iperov/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
Language:Python46.7k 1.1k 1.3k10.4k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python36.1k 366 3145.6k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python33.3k 279 1.1k4k
dragonflydb/dragonfly
A modern replacement for Redis and Memcached
Language:C++25.2k 161 1.1k908
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.6k 203 3722.1k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook12.8k 170 5111.8k
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
Language:C++8.6k 201 2.6k1.7k
fishaudio/fish-speech
Brand new TTS solution
Language:Python7.4k 61 329588
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
Language:Jupyter Notebook3.1k 48 79406
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python2.9k 88 97417
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
2.9k 53 8247
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
Language:Python2.4k 30 116196
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.4k 60 170255
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2k 49 126320
shenweichen/AlgoNotes
【浅梦学习笔记】文章汇总:包含排序&CXR预估，召回匹配，用户画像&特征工程，推荐搜索综合计算广告，大数据，图算法，NLP&CV，求职面试等内容
1.6k 31 0220
harlanhong/awesome-talking-head-generation
1.4k 77 3109
microsoft/NeuralSpeech
Language:Python1.4k 34 124183
google/tensorstore
Library for reading and writing large multi-dimensional arrays.
Language:C++1.3k 29 149120
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k 53 31100
PlayVoice/vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
Language:Python1.1k 24 162168
apachecn/apachecn-dl-zh
ApacheCN 深度学习译文集
Language:JavaScript779 21 7198
gitmylo/bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
Language:Python637 17 43108
huangjunheng/recommendation_model
练习下用pytorch来复现下经典的推荐系统模型, 如MF, FM, DeepConn, MMOE, PLE, DeepFM, NFM, DCN, AFM, AutoInt, ONN, FiBiNET, DCN-v2, AFN, DCAP等
Language:Python515 4 7117
lucidrains/local-attention
An implementation of local windowed attention for language modeling
Language:Python367 4 1939
saifhassan/Wav2Lip-HD
High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN
Language:Python343 13 4576
lifeiteng/SoundStorm
70 16 04
oceancx/Realtime-Voice-Clone-Chinese
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python49 0 014
gzn00417/Commercial-Video-Recognition
基于数据挖掘的tik tok商用广告视频识别
Language:Jupyter Notebook13 1 01
uclwe/rtb
Bid optimisation methods for real-time bidding in online display advertising.
Language:Jupyter Notebook13 1 17
databricks-industry-solutions/real-time-bidding
From display to video, the value of an impression can only be realized if an ad is viewed by a user. Therefore, when using programmatic advertising to buy inventory, it’s important to take viewability into account. In this Solution Accelerator, learn how to predict ad viewability to optimize your real-time bidding strategy.
Language:Python4 4 02