Pinned Repositories
additional_openjtalk_dic
DeepLearningExamples
Deep Learning Examples
espnet
End-to-End Speech Processing Toolkit
fewshot-font-generation
The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).
FT-w2v2-ser
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
groonga-command-token-count
groonga-tokenizer-yangram
huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
merumeru-rururu's Repositories
merumeru-rururu/espnet
End-to-End Speech Processing Toolkit
merumeru-rururu/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
merumeru-rururu/additional_openjtalk_dic
merumeru-rururu/DeepLearningExamples
Deep Learning Examples
merumeru-rururu/fewshot-font-generation
The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).
merumeru-rururu/FT-w2v2-ser
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
merumeru-rururu/groonga-command-token-count
merumeru-rururu/groonga-tokenizer-yangram
merumeru-rururu/huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
merumeru-rururu/jtubespeech
merumeru-rururu/lhotse
Tools for handling speech data in machine learning projects.
merumeru-rururu/mammoth.js
Convert Word documents (.docx files) to HTML
merumeru-rururu/myFM
A Python/C++ implementation of Bayesian Factorization Machines
merumeru-rururu/phrase_break_prediction
Scripts for training a phrase break prediction system
merumeru-rururu/pyJuliusAlign
One-button-press forced aligner for Japanese, using Julius.
merumeru-rururu/pyopenjtalk
Python wrapper for OpenJTalk
merumeru-rururu/pyvcroid2
Python Library to Access to Core DLL of VOICEROID2
merumeru-rururu/Recommender-System-LightFM
scalable Recommeder System for e-commerece using LightFM package in python
merumeru-rururu/rvc-webui
This project is a fork of liujing04/Retrieval-based-Voice-Conversion-WebUI
merumeru-rururu/soxan
Wav2Vec for speech recognition, classification, and audio classification
merumeru-rururu/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
merumeru-rururu/Style-Bert-VITS2
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
merumeru-rururu/StyleTTS
Official Implementation of StyleTTS
merumeru-rururu/TTSController
各種 Text-to-Speech エンジンを統一的に操作するライブラリです
merumeru-rururu/ttsQuestV3Voicevox
merumeru-rururu/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
merumeru-rururu/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
merumeru-rururu/voiceroid_daemon
VOICEROID2のHTTPサーバーデーモン
merumeru-rururu/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
merumeru-rururu/voicevox_cli_client
VOICEVOX ENGINE、COEIROINK用コマンドラインクライアント。複数のエンジンを使用した並列処理もできます