Labbeti

PhD in computer science, focused on automated audio captioning. Always looking to learn new things about deep learning. I also like hot chocolate and my cats.

IRITToulouse, France

Labbeti's Stars

facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python21.2k 212 3942.2k
bloomberg/memray
Memray is a memory profiler for Python
Language:Python13.5k 59 202397
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.9k 132 224875
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
Language:Python10.2k 113 714895
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.6k 36 2981.1k
hsutter/cppfront
A personal experimental C++ Syntax 2 -> Syntax 1 compiler
Language:C++5.6k 113 743252
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python4.9k 57 236426
crdoconnor/strictyaml
Type-safe YAML parser and validator.
Language:Python1.5k 28 16361
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
Language:Python1.5k 29 93148
microsoft/Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Language:Python1.4k 21 166182
qiuqiangkong/audioset_tagging_cnn
Language:Python1.4k 14 69258
toshas/torch-fidelity
High-fidelity performance metrics for generative models in PyTorch
Language:Python1k 7 3770
HazyResearch/H3
Language Modeling with the H3 State Space Model
Language:Assembly516 32 2654
samuela/git-re-basin
Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
Language:Python476 8 1441
soundata/soundata
Python library for downloading, loading & working with sound datasets
Language:Python329 11 7923
mlco2/impact
ML has an impact on the climate. But not all models are born equal. Compute your model's emissions with our calculator and add the results to your paper with our generated latex template
Language:HTML212 6 1740
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
Language:Python209 5 2712
coteries/cedille-ai
✒️ Cedille is a large French language model (6B), released under an open-source license
203 13 312
audio-captioning/audio-captioning-papers
A list of papers about audio captioning
78 7 28
XinhaoMei/DCASE2021_task6_v2
Code for CVSSP submission to DCASE 2021 Task 6
Language:Jupyter Notebook35 2 76
MorenoLaQuatra/audiocaps-download
This package aims at simplifying the download of the AudioCaps dataset.
Language:Python31 2 44
Vaibhavs10/dcase-2023-workshop
Language:Jupyter Notebook15 3 01
ConstanceDws/DCASE_2023
Language:Jupyter Notebook61
RonFrancesca/SED-carbon-footprint
Performance and Energy Balance: A Comprehensive Study of State-of-the-Art Sound Event Detection Systems
Language:Jupyter Notebook6 2 00
felixgontier/dcase2021aac
Language:Python5 2 10
FlorentMeyer/fsd50k_speech_model_finetuning
Language:Python2 1 00

Labbeti

Labbeti's Stars

facebookresearch/audiocraft

bloomberg/memray

BlinkDL/RWKV-LM

kkroening/ffmpeg-python

lucidrains/denoising-diffusion-pytorch

hsutter/cppfront

lucidrains/x-transformers

crdoconnor/strictyaml

LAION-AI/CLAP

microsoft/Semi-supervised-learning

qiuqiangkong/audioset_tagging_cnn

toshas/torch-fidelity

HazyResearch/H3

samuela/git-re-basin

soundata/soundata

mlco2/impact

XinhaoMei/WavCaps

coteries/cedille-ai

audio-captioning/audio-captioning-papers

XinhaoMei/DCASE2021_task6_v2

MorenoLaQuatra/audiocaps-download

Vaibhavs10/dcase-2023-workshop

ConstanceDws/DCASE_2023

RonFrancesca/SED-carbon-footprint

felixgontier/dcase2021aac

FlorentMeyer/fsd50k_speech_model_finetuning