ajd12342

PhD CS, UT Austin. Prev. B.Tech CS, IIT Bombay. Working on ASR and NLP. I love writing clean, documented code.

UT AustinAustin, Texas

ajd12342's Stars

xai-org/grok-1
Grok open release
Language:Python49.6k 575 2108.3k
koalaman/shellcheck
ShellCheck, a static analysis tool for shell scripts
Language:Haskell36.5k 413 2.7k1.8k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python32.6k 187 5623.5k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.4k 120 1.1k1.4k
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python12.7k 125 7541.1k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python12.6k 140 7251.3k
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook11k 142 3601.1k
dottxt-ai/outlines
Structured Text Generation
Language:Python9.8k 47 631499
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Jupyter Notebook7.8k 75 216589
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.7k 87 130748
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python4.7k 54 117476
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
Language:Python3.9k 79 128661
THUNLP-MT/MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
Language:TeX2.4k 166 23449
rsennrich/subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
Language:Python2.2k 55 87464
mut-ex/gligen-gui
An intuitive GUI for GLIGEN that uses ComfyUI in the backend
Language:JavaScript2k 14 35191
AkariAsai/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
Language:Python1.9k 17 84171
haoheliu/voicefixer
General Speech Restoration
Language:Python1k 17 59133
SpeechColab/GigaSpeech
Large, modern dataset for speech recognition
Language:Shell649 18 6262
liusongxiang/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
460 46 228
JarodMica/audiobook_maker
Language:Python313 9 5651
huggingface/dataspeech
Language:Python308 13 1647
jishengpeng/TextrolSpeech
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models (2024 ICASSP)
Language:Python147 7 15
gmltmd789/UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
Language:Jupyter Notebook133 11 913
IDRnD/VoxTube
The VoxTube dataset official repository
Language:HTML61 5 41
vectominist/spin
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"
Language:Python44 2 55
raj-sutariya/indic-num2words
Python library for converting numbers to words for all Indian Languages.
Language:Python33 5 410
skit-ai/slu-prosody
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 2023.
Language:Jupyter Notebook23 4 13
amazon-science/synthesizrr
Synthesizing realistic and diverse text-datasets from augmented LLMs
Language:Python7 0 03
kaistmm/voxsim_trainer
[INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset
Language:Python7 2 00
GussailRaat/Devanagari-Hindi-Language-in-pdfLatex
Easy Steps to write Devanagari (Hindi) Language in pdfLatex
Language:Python5 1 03

ajd12342

ajd12342's Stars

xai-org/grok-1

koalaman/shellcheck

2noise/ChatTTS

Dao-AILab/flash-attention

SYSTRAN/faster-whisper

m-bain/whisperX

facebookresearch/seamless_communication

dottxt-ai/outlines

open-mmlab/Amphion

jasonppy/VoiceCraft

huggingface/parler-tts

metavoiceio/metavoice-src

THUNLP-MT/MT-Reading-List

rsennrich/subword-nmt

mut-ex/gligen-gui

AkariAsai/self-rag

haoheliu/voicefixer

SpeechColab/GigaSpeech

liusongxiang/Large-Audio-Models

JarodMica/audiobook_maker

huggingface/dataspeech

jishengpeng/TextrolSpeech

gmltmd789/UnitSpeech

IDRnD/VoxTube

vectominist/spin

raj-sutariya/indic-num2words

skit-ai/slu-prosody

amazon-science/synthesizrr

kaistmm/voxsim_trainer

GussailRaat/Devanagari-Hindi-Language-in-pdfLatex