vincentwi's Stars
lbzhao970/PsFuture
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
huggingface/parler-tts
Inference and training library for high-quality TTS models.
just-an-experiment/viva-translate
Real-time translation copilot for your browser
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
vincentwi/Anomaly-Detection
lukasz-madon/awesome-remote-job
A curated list of awesome remote jobs and resources. Inspired by https://github.com/vinta/awesome-python
iv-org/invidious
Invidious is an alternative front-end to YouTube
JulienRineau/wav2vec
Pytorch implementation of wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
bytedance/neurst
Neural end-to-end Speech Translation Toolkit
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation
zhangshaolei1998/Awesome-Simultaneous-Translation
Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
Snowad14/IAStreameur
AIStreameur : Faite streamer vos personne préféré !
pjsip/pjproject
PJSIP project
frgnca/AudioDeviceCmdlets
AudioDeviceCmdlets is a suite of PowerShell Cmdlets to control audio devices on Windows
audiorouterdev/audio-router
Routes audio from programs to different audio devices.
xenolightning/AudioSwitcher
.NET Library which facilitates interacting with Audio Devices on Windows
briankendall/proxy-audio-device
A virtual audio driver for macOS that sends all audio to another output
Sharrnah/whispering
Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Azure/azure-search-vector-samples
A repository of code samples for Vector search capabilities in Azure AI Search.
Tally-Health/BuccalComparison
iScience paper 2022
ahkarami/Great-Deep-Learning-Tutorials
A Great Collection of Deep Learning Tutorials and Repositories
maziarraissi/Applied-Deep-Learning
Applied Deep Learning Course
MITDeepLearning/introtodeeplearning
Lab Materials for MIT 6.S191: Introduction to Deep Learning
hendrycks/ethics
Aligning AI With Shared Human Values (ICLR 2021)
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.