jhauret

PhD student in machine learning applied to acoustics at Cnam, Paris. Bringing deep learning from research to production.

Conservatoire National des Arts et MétiersParis

jhauret's Stars

bshall/hubert
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Language:Python33754
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python37.9k6.1k
Aria-K-Alethia/BigCodec
Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"
Language:Python1026
ZhangXInFD/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Language:Python50143
kyutai-labs/moshi
Language:Python7k546
MarcLafon/heatood
This repo contains the official implementation of Hybrid Energy Based Model in the Feature Space for Out-of-Distribution Detection (ICML'23).
Language:Python74
NathanGodey/headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https://arxiv.org/abs/2309.08351)
Language:Python244
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
77944
jhauret/vibravox
Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.
Language:Python271
haoheliu/SemantiCodec-inference
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
Language:Python16310
urgent-challenge/urgent2024_challenge
Official data preparation scripts for the URGENT 2024 Challenge
Language:Python725
perladoubinsky/SemAug
[WAVC 2024] Official implementation of the paper: Semantic Generative Augmentations for Few-shot Counting
Language:Python9
muqiaoy/PAAP
Language:Python282
huggingface/competitions
Language:Python11712
jhauret/eben
Repo for source code of EBEN: Extreme Bandwidth Extension Network
Language:Python709
SamsungLabs/hifi_plusplus
HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)
Language:Python767
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Language:Python2.1k162
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.8k430
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python1.2k117
QxLabIreland/listening-test
An open source platform for browser based speech and audio subjective quality tests.
Language:TypeScript337
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.3k424
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python21.1k2.2k
samsad35/source-filter-vae
Learning and controlling the source-filter representation of speech with a variational autoencoder
Language:Python455
google/visqol
Perceptual Quality Estimator for speech and audio
Language:C++712126
facebookresearch/Noresqa
This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.
Language:Python9213
elevoctech/ESMB-corpus
151
RookieJunChen/Inter-SubNet
The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.
Language:Python9512
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook5.9k1.2k
facebookresearch/textlesslib
Library for Textless Spoken Language Processing
Language:Python52851
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.5k266

jhauret

jhauret's Stars

bshall/hubert

karpathy/nanoGPT

Aria-K-Alethia/BigCodec

ZhangXInFD/SpeechTokenizer

kyutai-labs/moshi

MarcLafon/heatood

NathanGodey/headless-lm

ga642381/speech-trident

jhauret/vibravox

haoheliu/SemantiCodec-inference

urgent-challenge/urgent2024_challenge

perladoubinsky/SemAug

muqiaoy/PAAP

huggingface/competitions

jhauret/eben

SamsungLabs/hifi_plusplus

linto-ai/whisper-timestamped

resemble-ai/Resemblyzer

descriptinc/descript-audio-codec

QxLabIreland/listening-test

asteroid-team/asteroid

facebookresearch/audiocraft

samsad35/source-filter-vae

google/visqol

facebookresearch/Noresqa

elevoctech/ESMB-corpus

RookieJunChen/Inter-SubNet

CompVis/taming-transformers

facebookresearch/textlesslib

lucidrains/audiolm-pytorch