andrewowens/multisensory

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

PythonApache-2.0

Issues

difference between "large" and "full" sep models
#47 opened 2 years ago by sanjeelparekh
0
Download sample-data.zip NOT FOUND
#46 opened 3 years ago by elinaoikonomaki
1
model architecture
#45 opened 3 years ago by riyaj8888
1
Download pretrain models
#44 opened 3 years ago by WikiChao
3
Question about the original audio waveform input
#43 opened 3 years ago by luhuijun666
0
How to calculate SDR?
#41 opened 4 years ago by ruizewang
1
Question about the test in Table 2 GRID transfer
#42 opened 4 years ago by ruizewang
0
question about sourcesep training result on new dataset
#21 opened 5 years ago by xiaoyiming
7
Why acc doesn't change when shift_model training?
#31 opened 5 years ago by ruizewang
10
Questions about sourcesep.py
#39 opened 4 years ago by ruizewang
2
About the input format
#11 opened 6 years ago by ASHA-KOTERU
6
file input for blind audio source separation
#20 opened 5 years ago by prashantmaheshwari94
1
Improvement on using pretrained model
#40 opened 4 years ago by ChaitanyaBoggavarapu
0
Questions about VoxCeleb2 dataset
#36 opened 5 years ago by YiyuLuo
1
What GPU used?
#38 opened 4 years ago by ruizewang
0
Issue with sound source localization
#33 opened 5 years ago by jacobsharp10
1
duration_mult flag
#37 opened 5 years ago by kzhang3256
0
Issue on datasets
#35 opened 5 years ago by LindaCY
1
Issue on Large Videos
#32 opened 5 years ago by ChaitanyaBoggavarapu
2
Some questions about training and testing shift model
#34 opened 5 years ago by ruizewang
0
Question about the test in Table 3
#19 opened 5 years ago by THU-cui
1
What are feats['im_0'] and feats['im_1'] of example for shift model?
#30 opened 5 years ago by ruizewang
2
question about using 'sep_example.tf'
#23 opened 5 years ago by wl3b10s
2
Could you provide the dataset?
#28 opened 5 years ago by ruizewang
0
About the input file for shift model training
#29 opened 5 years ago by ruizewang
0
Questions about the files in ".txt" format used to train the "shift" model
#18 opened 5 years ago by yxixi
2
Question about the shift_net.py' training
#16 opened 6 years ago by xiaoyiming
3
In which way the video frames combine
#25 opened 5 years ago by tuffr5
10
What is the format of the tensor in the code?
#26 opened 5 years ago by tuffr5
0
I RuntimeError: Command failed! ffmpeg -i "/tmp/ao_wmjz0ezg.wav" -r 29.970000 -loglevel warning -safe 0 -f concat -i "/tmp/ao_i2pwi0b8.txt" -pix_fmt yuv420p -vcodec h264 -strict -2 -y -acodec aac "results/fg_translator.mp4"
#24 opened 5 years ago
4
RuntimeError: Command failed! ffmpeg -i "/tmp/ao_M0QAze.wav" -r 29.970000 -loglevel warning -safe 0 -f concat -i "/tmp/ao_cnpblR.txt" -pix_fmt yuv420p -vcodec h264 -strict -2 -y -acodec aac "../results/fg_cam_translator.mp4"
#5 opened 6 years ago by xsingit
4
whre is the sep_module (calss or funtion）in sourcesep.py
#15 opened 6 years ago by xiaoyiming
3
Question about training
#9 opened 6 years ago by Lugangz
10
> > In the source separation model it seems like you are using *.tf files as input (rec_files_from_path in sep_dset.py).Can you please provide the format to create those TFRecord files
#22 opened 5 years ago by xuanhanyu
0
How to train the "shift" and "cam" model for sound source location?
#17 opened 6 years ago by yxixi
4
Questions about the entrance of the training function
#14 opened 6 years ago by yxixi
1
TypeError: convolution() got multiple values for argument 'weights_regularizer'
#7 opened 6 years ago by chouqin3
5
Test set used in paper
#13 opened 6 years ago by medhini
0
error compiling
#10 opened 6 years ago by Askdeep
0
make_video_helper() missing 3 required positional arguments: 'x', 'in_dir', and 'tmp_ext'
#8 opened 6 years ago by Lugangz
1
Questions about the models
#6 opened 6 years ago by orthosiphon
3
Question about fine-tune for full sep model
#3 opened 6 years ago by LionnelBall
3
Supported on Linux
#4 opened 6 years ago by rsmithgi
0
How do I run source separation on a different video?
#2 opened 6 years ago by jayavanth
2
Getting /bin/sh: 1: ffmpeg-length: not found
#1 opened 6 years ago by jayavanth
2