Varshul's Stars
Open-Speech-EkStep/indic-punct
romner-set/btop-gpu
A monitor of resources, forked for GPU support – merged into btop!
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
152334H/tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
dubverse-ai/MahaVed
Collection of open source speech datasets
LAION-AI/natural_voice_assistant
Vaibhavs10/open-tts-tracker
soumik-kanad/diff2lip
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
csteinmetz1/ai-audio-startups
Community list of startups working with AI in audio and music technology
dubverse-ai/MahaTTS
dubverse-ai/rvc-data-prep
extract and isolate vocals from media files. supports multispeaker media as well.
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
iamadamdev/bypass-paywalls-chrome
Bypass Paywalls web browser extension for Chrome and Firefox.
EleutherAI/DALLE-mtf
Open-AI's DALL-E for large scale training in mesh-tensorflow.
sigsep/open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
nd7141/icml2020
Notebook for comprehensive analysis of authors, organizations, and countries of ICML 2020 papers.
monicahq/monica
Personal CRM. Remember everything about your friends, family and business relationships.
oiusu/Tacotron-2
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
slanglabs-projects/asr-wer-bench
Workbench for benchmarking Word Error Rate (WER) of Automatic Speech Recognition (ASR) systems on a given data set.
narVidhai/tamil-nlp-catalog
Awesome List of Tamil NLP & AI Resources
Verssae/flask-tacotron2-tts-web-app
flask+tornado based NVIDIA tacotron2+waveglow tts web app
CUNY-CL/wikipron-modeling
Proposed splits for the LREC Wikipron paper
CUNY-CL/wikipron
Massively multilingual pronunciation mining
libindic/indic-trans
The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including English.
Jeevesh8/GCP-PyTorch-Setup
PyTorch, CUDA and Anaconda Setup for Ubuntu 18.04 LTS VM on GCP