robotnc

NationalChip

robotnc's Stars

Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python40.2k 452 3195.2k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python38k 303 1.2k4.7k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook37.1k 328 4554.4k
DayBreak-u/chineseocr_lite
超轻量级中文ocr，支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
Language:C++12k 241 3702.3k
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python7.2k 55 2071.3k
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Language:Python4.4k 42 104729
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.6k 58 71316
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python3k 87 98417
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.5k 60 178274
TigerResearch/TigerBot
TigerBot: A multi-language multi-task LLM
Language:Python2.3k 31 126193
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2.1k 48 127323
qiuqiangkong/audioset_tagging_cnn
Language:Python1.4k 14 69261
Edresson/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Language:Jupyter Notebook943 25 5182
dulaiduwang003/TIME-SEA-chatgpt
基于SpringBoot3开发的Ai平台含双端网页以及小程序包含各类Ai模型和绘图 ,含支付双端数据同步支持自定义预设词,功能板块定义 web兼容手机展示
Language:Vue445 7 25120
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language:Python325 10 2745
KevinMIN95/StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Language:Python242 6 2138
openasic-org/xk265
xk265：HEVC/H.265 Video Encoder IP Core (RTL)
Language:Verilog241 19 479
hyperconnect/TC-ResNet
Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
Language:Python221 18 1256
keonlee9420/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Language:Python192 6 1723
Enny1991/beamformers
Easy to use Beamformers for multi-channel speech separation/enhancement
Language:Python191 4 449
WelkinYang/GradTTS
Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"
Language:Python190 5 319
harvard-edge/multilingual_kws
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
Language:Jupyter Notebook172 16 3338
wolverinn/HEVC-CU-depths-prediction-CNN
Using convolutional neural networks to predict the Coding Units (CUs) depths in HEVC intra-prediction mode, in order to reduce the time of the encoding process in HEVC.
Language:Python81 2 827
TeamPyOgg/PyOgg
Simple OGG Vorbis, Opus and FLAC bindings for Python
Language:Python68 5 6629
katsugeneration/tensor-fsmn
Feedforward Sequential Memory Networks (FSMN) implemented by tensorflow
Language:Python50 6 324
YiwenShaoStephen/pychain_example
Language:Python48 5 1120
Edresson/Coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Jupyter Notebook33 2 022
yinruiqing/fsmn
Feedforward Sequential Memory Networks
Language:Python15 4 03
xiaoli1996/SSBPR
5 3 00
d5555/FSMN
pytorch FSMN
Language:Python3 1 01

robotnc

robotnc's Stars

Stability-AI/stablediffusion

coqui-ai/TTS

suno-ai/bark

DayBreak-u/chineseocr_lite

jaywalnut310/vits

MoonInTheRiver/DiffSinger

facebookresearch/encodec

enhuiz/vall-e

lucidrains/audiolm-pytorch

TigerResearch/TigerBot

lifeiteng/vall-e

qiuqiangkong/audioset_tagging_cnn

Edresson/YourTTS

dulaiduwang003/TIME-SEA-chatgpt

keonlee9420/DiffGAN-TTS

KevinMIN95/StyleSpeech

openasic-org/xk265

hyperconnect/TC-ResNet

keonlee9420/StyleSpeech

Enny1991/beamformers

WelkinYang/GradTTS

harvard-edge/multilingual_kws

wolverinn/HEVC-CU-depths-prediction-CNN

TeamPyOgg/PyOgg

katsugeneration/tensor-fsmn

YiwenShaoStephen/pychain_example

Edresson/Coqui-TTS

yinruiqing/fsmn

xiaoli1996/SSBPR

d5555/FSMN