audio-captioning

There are 27 repositories under audio-captioning topic.

soham97/awesome-sound_event_detection
Reading list for research topics in Sound AI
176 9 19
Labbeti/aac-datasets
Audio Captioning datasets for PyTorch.
Language:Python113 2 36
TheoCoombes/ClipCap
Using pretrained encoder and language models to generate captions from multimedia inputs.
Language:Python94 6 513
audio-captioning/clotho-dataset
Python code for handling the Clotho dataset.
Language:Python80 5 315
ilaria-manco/muscaps
Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
Language:Jupyter Notebook78 5 37
ilaria-manco/song-describer
Song Describer is a data collection platform for annotating music with textual descriptions.
Language:Python57 5 15
an-tran528/wavetransformer
Code base for WaveTransformer: A novel architecture for automated audio captioning
Language:Python43 1 19
Labbeti/aac-metrics
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
Language:Python40 3 113
audio-captioning/dcase-2020-baseline
Audio captioning baseline system for DCASE 2020 challenge.
Language:Python38 3 1211
slSeanWU/beats-conformer-bart-audio-captioner
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
Language:Jupyter Notebook36 2 31
soham97/sound_ai_progress
Tracking states of the arts and recent results (bibliography) on sound tasks.
32 5 01
minguinho26/Prefix_AAC_ICASSP2023
Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"
Language:Jupyter Notebook29 2 52
lukewys/dcase_2020_T6
2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning-results#wuyusong2020_t6
Language:Python22 2 65
blmoistawinde/fense
Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.
Language:Python20 2 01
ExplainableML/ZerAuCap
[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords
Language:Python17 7 01
audio-captioning/caption-evaluation-tools
Tools for the evaluation of audio captioning.
Language:Jupyter Notebook16 4 12
Labbeti/conette-audio-captioning
CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding
Language:Python14 2 40
iOPENCap/awesome-unimodal-training
text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)
11 0 00
Sreyan88/RECAP
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
Language:Python11 2 10
abikaki/DCASE-Workshop-Papers
Workshop on Detection and Classification of Acoustic Scenes and Events
10 2 00
satvik-dixit/mace
Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems
Language:Python10 2 01
Labbeti/dcase2024-task6-baseline
DCASE2024 Challenge Task 6 baseline system (Automated Audio Captioning)
Language:Python5 2 20
audio-captioning/clotho-dataloader
PyTorch dataloader for Clotho dataset.
Language:Python4 2 10
paniquex/Automated_Audio_Captioning_DCASE2020
6-th task solution of DCASE2020
Language:Python4 1 00
dr-costas/clotho-baseline-dataset
Code for using with the Clotho dataset
Language:Python3 1 11
zelaki/wsac
This reporsitory code form Weakly Supervised Automaed Audio Captioning via Text Only Training
Language:Python2 1 00
Labbeti/dcase2021task6
IRIT-UPS DCASE 2021 AUDIO CAPTIONING SYSTEM
Language:Python0 2 00

audio-captioning

soham97/awesome-sound_event_detection

Labbeti/aac-datasets

TheoCoombes/ClipCap

audio-captioning/clotho-dataset

ilaria-manco/muscaps

ilaria-manco/song-describer

an-tran528/wavetransformer

Labbeti/aac-metrics

audio-captioning/dcase-2020-baseline

slSeanWU/beats-conformer-bart-audio-captioner

soham97/sound_ai_progress

minguinho26/Prefix_AAC_ICASSP2023

lukewys/dcase_2020_T6

blmoistawinde/fense

ExplainableML/ZerAuCap

audio-captioning/caption-evaluation-tools

Labbeti/conette-audio-captioning

iOPENCap/awesome-unimodal-training

Sreyan88/RECAP

abikaki/DCASE-Workshop-Papers

satvik-dixit/mace

Labbeti/dcase2024-task6-baseline

audio-captioning/clotho-dataloader

paniquex/Automated_Audio_Captioning_DCASE2020

dr-costas/clotho-baseline-dataset

zelaki/wsac

Labbeti/dcase2021task6