This repository provides the procedure to conduct experiments with Lighthouse for the paper "Language-based Audio Moment Retrieval" (under review). In addition, it supports the following functionalities (coming soon):
- Generation of Clotho-Moments from Clotho and UnAv-100
- Extraction of CLAP Features
- Evaluation of Zero-shot Sound Event Detection
-
Install Lighthouse
-
Download extracted CLAP features of Clotho-Moment/UnAV100-subset/TUT Sound Events 2017 from here
- You can also download wav files from here
-
Set the path to the downloaded features in "(LIGHTHOUSE_PATH)/features".
- For example, if you downloaded Clotho-Moment features, set the path to "(LIGHTHOUSE_PATH)/features/clotho-moment".
-
Run the following command to train the AMR model:
python training/train.py --model qd_detr --dataset clotho-moment --feature clap
-
Run the following command to evaluate the AMR model:
model=qd_detr dataset=unav100-subset feature=clap model_path={lighthouse_dir}/results/qd_detr/clotho-moment/clap/best.ckpt eval_split_name=val eval_path=data/unav100-subset/unav100-subset_test_release.jsonl python training/evaluate.py \ --model $model \ --dataset $dataset \ --feature $feature \ --model_path $model_path \ --eval_split_name $eval_split_name \ --eval_path $eval_path
In ./clotho-moment_generetor
, you can find the procedure to generate Clotho-Moments from Clotho and Walking Tours.
Please read the README.md in the directory for more details.
In ./feature_extractor
, you can find the procedure to extract CLAP features for lighthouse.
Please read the README.md in the directory for more details.
In ./zero-shot_sed_eval
, you can find the procedure to evaluate the zero-shot SED system.
Please read the README.md in the directory for more details.