Labbeti
PhD in computer science, focused on automated audio captioning. Always looking to learn new things about deep learning. I also like hot chocolate and my cats.
IRITToulouse, France
Labbeti's Stars
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
bloomberg/memray
Memray is a memory profiler for Python
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
hsutter/cppfront
A personal experimental C++ Syntax 2 -> Syntax 1 compiler
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
crdoconnor/strictyaml
Type-safe YAML parser and validator.
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
microsoft/Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
qiuqiangkong/audioset_tagging_cnn
toshas/torch-fidelity
High-fidelity performance metrics for generative models in PyTorch
HazyResearch/H3
Language Modeling with the H3 State Space Model
samuela/git-re-basin
Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
soundata/soundata
Python library for downloading, loading & working with sound datasets
mlco2/impact
ML has an impact on the climate. But not all models are born equal. Compute your model's emissions with our calculator and add the results to your paper with our generated latex template
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
coteries/cedille-ai
✒️ Cedille is a large French language model (6B), released under an open-source license
audio-captioning/audio-captioning-papers
A list of papers about audio captioning
XinhaoMei/DCASE2021_task6_v2
Code for CVSSP submission to DCASE 2021 Task 6
MorenoLaQuatra/audiocaps-download
This package aims at simplifying the download of the AudioCaps dataset.
Vaibhavs10/dcase-2023-workshop
ConstanceDws/DCASE_2023
RonFrancesca/SED-carbon-footprint
Performance and Energy Balance: A Comprehensive Study of State-of-the-Art Sound Event Detection Systems
felixgontier/dcase2021aac
FlorentMeyer/fsd50k_speech_model_finetuning