andrewowens/multisensory
Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
PythonApache-2.0
Issues
- 0
- 1
Download sample-data.zip NOT FOUND
#46 opened by elinaoikonomaki - 1
model architecture
#45 opened by riyaj8888 - 3
Download pretrain models
#44 opened by WikiChao - 0
- 1
How to calculate SDR?
#41 opened by ruizewang - 0
- 7
- 10
- 2
Questions about sourcesep.py
#39 opened by ruizewang - 6
About the input format
#11 opened by ASHA-KOTERU - 1
- 0
- 1
Questions about VoxCeleb2 dataset
#36 opened by YiyuLuo - 0
What GPU used?
#38 opened by ruizewang - 1
Issue with sound source localization
#33 opened by jacobsharp10 - 0
duration_mult flag
#37 opened by kzhang3256 - 1
Issue on datasets
#35 opened by LindaCY - 2
Issue on Large Videos
#32 opened by ChaitanyaBoggavarapu - 0
- 1
Question about the test in Table 3
#19 opened by THU-cui - 2
- 2
question about using 'sep_example.tf'
#23 opened by wl3b10s - 0
Could you provide the dataset?
#28 opened by ruizewang - 0
About the input file for shift model training
#29 opened by ruizewang - 2
- 3
Question about the shift_net.py' training
#16 opened by xiaoyiming - 10
In which way the video frames combine
#25 opened by tuffr5 - 0
What is the format of the tensor in the code?
#26 opened by tuffr5 - 4
- 4
RuntimeError: Command failed! ffmpeg -i "/tmp/ao_M0QAze.wav" -r 29.970000 -loglevel warning -safe 0 -f concat -i "/tmp/ao_cnpblR.txt" -pix_fmt yuv420p -vcodec h264 -strict -2 -y -acodec aac "../results/fg_cam_translator.mp4"
#5 opened by xsingit - 3
- 10
Question about training
#9 opened by Lugangz - 0
> > In the source separation model it seems like you are using *.tf files as input (rec_files_from_path in sep_dset.py).Can you please provide the format to create those TFRecord files
#22 opened by xuanhanyu - 4
- 1
- 5
TypeError: convolution() got multiple values for argument 'weights_regularizer'
#7 opened by chouqin3 - 0
Test set used in paper
#13 opened by medhini - 0
error compiling
#10 opened by Askdeep - 1
make_video_helper() missing 3 required positional arguments: 'x', 'in_dir', and 'tmp_ext'
#8 opened by Lugangz - 3
Questions about the models
#6 opened by orthosiphon - 3
- 0
Supported on Linux
#4 opened by rsmithgi - 2
- 2