dr-pato/audio_visual_speech_enhancement
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
PythonApache-2.0
Issues
- 0
On the issue of model training
#32 opened by lzyhub - 2
Parameter setting
#24 opened by truewangxiaolong - 6
Problem with files inside testing results
#23 opened by jungin-jin-choi - 2
Input format
#20 opened by truewangxiaolong - 1
Error from training script
#22 opened by jungin-jin-choi - 1
Facial landmark extractor not working
#21 opened by jungin-jin-choi - 3
- 9
Training: Values size X by output shape Y
#11 opened by nanometer34688 - 1
Training.py
#17 opened by pradnyapagar05 - 1
mean and standard deviation file
#14 opened by pradnyapagar05 - 2
- 2
memory leak on gpu
#15 opened by khs8727 - 4
Training Model
#12 opened by pradnyapagar05 - 4
Parameter passing confusion
#13 opened by malineha - 2
- 4
Value error when trying to cut audio
#10 opened by nanometer34688 - 5
Training function arguments problem
#7 opened by malineha - 2
Training/Testing/Validation Set Split
#8 opened by malineha - 1
- 3
base training configuration
#2 opened by crankyz - 4
can you tell me how to get GRID dataset
#3 opened by lvyilan23 - 6
TFRecords generation
#1 opened by crankyz