aliceebaird

@HumeAI NY

aliceebaird's Stars

ml-explore/mlx
MLX: An array framework for Apple silicon
Language:C++18.3k 150 5931.1k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python13.3k 141 7531.4k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.8k 212 2.4k2.6k
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.5k 99 93783
SkalskiP/courses
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Language:Python5.5k 92 7501
wookayin/gpustat
📊 A simple command-line utility for querying and monitoring GPU status
Language:Python4.1k 44 122283
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Language:Python2.2k 33 158163
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python2k 31 165514
DmitryUlyanov/Multicore-TSNE
Parallel t-SNE implementation with Python and Torch wrappers.
Language:C++1.9k 43 63229
csteinmetz1/ai-audio-startups
Community list of startups working with AI in audio and music technology
1.6k 69 5140
Kyubyong/g2p
g2p: English Grapheme To Phoneme Conversion
Language:Python829 18 26129
csteinmetz1/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
Language:Python671 15 3658
shahules786/mayavoz
Pytorch based speech enhancement toolkit.
Language:Python333 14 1624
Jiaxin-Ye/TIM-Net_SER
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".
Language:Python167 10 2226
neonbjb/tts-scores
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
Language:Python147 5 1415
afourast/avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
Language:Python111 11 926
HumeAI/hume-python-sdk
Python client for Hume AI
Language:Python97 11 1523
iariav/End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
Language:Python48 1 511
HumeAI/competitions
Hume AI ML Competitions
Language:Python23 14 25
EIHW/MuSe-2023
Language:Python18 4 07
facebookresearch/emphassess
This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses paper (de Seyssel et al., 2023).
Language:Python14 4 21
lstappen/MuSe-Toolbox
A Phyton toolbox to fuse multiple continuous emotion annotations from several raters and diarization them to classes!
Language:MATLAB14 5 12
dusty-phillips/similar-sounding-words
A list of similar sounding words to help disambiguate voice coding
Language:HTML12 1 01
zaocan666/DyViSE
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
Language:Python11 1 12
felixbur/syntAct
Scripts to generate a database of simulated emotional expression.
Language:Python8 3 01
idiap/ExVo-2022
Extracting pre-trained self-supervised embeddings for ICML ExVO 2022 challenge
Language:Python5 3 0
EIHW/ComParE2023
Language:Nix4 3 10
EIHW/prototypical-network-audio-evaluation
Language:Python4 3 02
nfb-onf/sound-of-laughter
Language:Python3 11 00
aliceebaird/temp_blanket
Language:Python1 1 00

aliceebaird

aliceebaird's Stars

ml-explore/mlx

m-bain/whisperX

NVIDIA/NeMo

facebookresearch/ImageBind

SkalskiP/courses

wookayin/gpustat

linto-ai/whisper-timestamped

jik876/hifi-gan

DmitryUlyanov/Multicore-TSNE

csteinmetz1/ai-audio-startups

Kyubyong/g2p

csteinmetz1/pyloudnorm

shahules786/mayavoz

Jiaxin-Ye/TIM-Net_SER

neonbjb/tts-scores

afourast/avobjects

HumeAI/hume-python-sdk

iariav/End-to-End-VAD

HumeAI/competitions

EIHW/MuSe-2023

facebookresearch/emphassess

lstappen/MuSe-Toolbox

dusty-phillips/similar-sounding-words

zaocan666/DyViSE

felixbur/syntAct

idiap/ExVo-2022

EIHW/ComParE2023

EIHW/prototypical-network-audio-evaluation

nfb-onf/sound-of-laughter

aliceebaird/temp_blanket