matthijsvk/multimodalSR
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
Jupyter NotebookMIT
Issues
- 0
Error during training
#2 opened by QDZ123
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
Jupyter NotebookMIT