dr-pato/audio_visual_speech_enhancement

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

PythonApache-2.0

Issues

On the issue of model training
#32 opened 2 years ago by lzyhub
0
Parameter setting
#24 opened 4 years ago by truewangxiaolong
2
Problem with files inside testing results
#23 opened 4 years ago by jungin-jin-choi
6
Input format
#20 opened 4 years ago by truewangxiaolong
2
Error from training script
#22 opened 4 years ago by jungin-jin-choi
1
Facial landmark extractor not working
#21 opened 4 years ago by jungin-jin-choi
1
Question about training vl2m with fixed TFRecord type
#18 opened 4 years ago by hmy410
3
Training: Values size X by output shape Y
#11 opened 5 years ago by nanometer34688
9
Training.py
#17 opened 5 years ago by pradnyapagar05
1
mean and standard deviation file
#14 opened 5 years ago by pradnyapagar05
1
little more detail about how to train AV c-ref model
#16 opened 5 years ago by khs8727
2
memory leak on gpu
#15 opened 5 years ago by khs8727
2
Training Model
#12 opened 5 years ago by pradnyapagar05
4
Parameter passing confusion
#13 opened 5 years ago by malineha
4
Training has "None values not supported"
#9 opened 5 years ago by nanometer34688
2
Value error when trying to cut audio
#10 opened 5 years ago by nanometer34688
4
Training function arguments problem
#7 opened 5 years ago by malineha
5
Training/Testing/Validation Set Split
#8 opened 5 years ago by malineha
2
Can't Understand the directory structure
#5 opened 5 years ago by Priyanka-narode
1
base training configuration
#2 opened 5 years ago by crankyz
3
can you tell me how to get GRID dataset
#3 opened 5 years ago by lvyilan23
4
TFRecords generation
#1 opened 6 years ago by crankyz
6