oryosu

tokyo

oryosu's Stars

ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
Language:Python1.5k187
yxlllc/DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
Language:Python1.8k233
bshall/knn-vc
Voice Conversion With Just Nearest Neighbors
Language:Python43364
mmorise/World
A high-quality speech analysis, manipulation and synthesis system
Language:C++1.2k249
google/oboe
Oboe is a C++ library that makes it easy to build high-performance audio apps on Android.
Language:C++3.6k554
RustAudio/cpal
Cross-platform audio I/O library in pure Rust
Language:Rust2.6k344
gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python71783
sarulab-speech/UTMOS22
UT-Sarulab MOS prediction system using SSL models
Language:Python15214
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.3k2k
llvm/torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
Language:C++1.3k453
shiguredo/sora-unity-sdk
WebRTC SFU Sora Unity SDK
Language:C++7312
astral-sh/rye
a Hassle-Free Python Experience
Language:Rust12.9k447
xiph/opus
Modern audio compression for the internet.
Language:C2.2k587
PlayVoice/lora-svc
singing voice change based on whisper, and lora for singing voice clone
Language:Python60979
WangHelin1997/MaskSpec
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Language:Python389
MWM-io/SpecTNT-pytorch
Unofficial implementation of SpecTNT in pytorch
Language:Python423
rkmt/summarize_arxv
Language:Python17320
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.1k738
Vaibhavs10/fast-whisper-finetuning
Language:Jupyter Notebook41334
solidiquis/erdtree
A modern, cross-platform, multi-threaded, and general purpose filesystem and disk-usage utility that is aware of .gitignore and hidden file rules.
Language:Rust2.3k60
AndreyGuzhov/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
Language:Python72990
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook34k4k
google/clasp
🔗 Command Line Apps Script Projects
Language:TypeScript4.5k422
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k67
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
Language:Python21.6k3.3k
kamepong/ConvS2S-VC
Language:Python297
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C++33.4k3.4k
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Language:Python1.9k163
nnsvs/nnsvs
Neural network-based singing voice synthesis library for research
Language:Python67381
markowanga/stweet
Advanced python library to scrap Twitter (tweets, users) from unofficial API
Language:Python57767

oryosu

oryosu's Stars

ufal/whisper_streaming

yxlllc/DDSP-SVC

bshall/knn-vc

mmorise/World

google/oboe

RustAudio/cpal

gemelo-ai/vocos

sarulab-speech/UTMOS22

facebookresearch/audiocraft

llvm/torch-mlir

shiguredo/sora-unity-sdk

astral-sh/rye

xiph/opus

PlayVoice/lora-svc

WangHelin1997/MaskSpec

MWM-io/SpecTNT-pytorch

rkmt/summarize_arxv

facebookresearch/ImageBind

Vaibhavs10/fast-whisper-finetuning

solidiquis/erdtree

AndreyGuzhov/AudioCLIP

suno-ai/bark

google/clasp

archinetai/audio-ai-timeline

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

kamepong/ConvS2S-VC

ggerganov/whisper.cpp

archinetai/audio-diffusion-pytorch

nnsvs/nnsvs

markowanga/stweet