kouohhashi's Stars
jaycode/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper (added support for other languages)
tyiannak/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
quillforms/quillforms
Open Source TypeForm Alternative Based on React JS and Typescript | Best Typeform Clone | Conversational Multi Step Form
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
joas8211/payload-tenancy
Multi-tenancy plugin for Payload CMS
payloadcms/payload
Payload is the open-source, fullstack Next.js framework, giving you instant backend superpowers. Get a full TypeScript backend and admin panel instantly. Use Payload as a headless CMS or for building powerful applications.
payloadcms/enterprise-website
An enterprise website frontend that can show how to build large websites on a design system, at scale
payloadcms/enterprise-website-cms
An enterprise website CMS that can show how to build large websites on a design system, at scale
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
afialapis/reactstrap-date-picker
A Reactstrap based, zero dependencies, date picker
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
saharmor/realtime-transcription-playground
A real-time transcription project using React and socketio
eduardtomasek/lz-string-python
lz-string for python 3
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
Alexander-H-Liu/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
majianjia/nnom
A higher-level Neural Network library for microcontrollers.
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
facebook/docusaurus
Easy to maintain open source documentation websites.
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
BorisChumichev/kissfft-js
Javascript port of KissFFT via Emscripten.
OpenMined/SyferText
A privacy preserving NLP framework
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
OpenMined/PySyft
Perform data science on data that remains in someone else's server
SamLynnEvans/Transformer
Transformer seq2seq model, program that can build a language translator from parallel corpus
keitakurita/Practical_NLP_in_PyTorch
A repository containing tutorials for practical NLP using PyTorch
fastai/course-nlp
A Code-First Introduction to NLP course
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.