Pinned Repositories
pml2-book
Probabilistic Machine Learning: Advanced Topics
A-Simple-Baseline-For-Knowledge-Based-VQA
Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"
ConvLab-3
deep-clustering
A tensorflow implementation for Deep clustering: Discriminative embeddings for segmentation and separation
end-to-end-lipreading
Pytorch code for End-to-End Audiovisual Speech Recognition
GHR
Capturing Conversational Interaction for Question Answering via Global History Reasoning (Qian et al., NAACL findings 2022)
KWS-Net
Seeing Wake Words: Audio-visual Keyword Spotting
Lipreading-ResNet
Torch code for using Residual Networks with LSTMs for Lipreading
s3prl_correlation
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
Speaker-Embeddings-Correlation-Pooling
Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"
tstafylakis's Repositories
tstafylakis/Lipreading-ResNet
Torch code for using Residual Networks with LSTMs for Lipreading
tstafylakis/Speaker-Embeddings-Correlation-Pooling
Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"
tstafylakis/KWS-Net
Seeing Wake Words: Audio-visual Keyword Spotting
tstafylakis/s3prl_correlation
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
tstafylakis/A-Simple-Baseline-For-Knowledge-Based-VQA
Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"
tstafylakis/ConvLab-3
tstafylakis/deep-clustering
A tensorflow implementation for Deep clustering: Discriminative embeddings for segmentation and separation
tstafylakis/end-to-end-lipreading
Pytorch code for End-to-End Audiovisual Speech Recognition
tstafylakis/GHR
Capturing Conversational Interaction for Question Answering via Global History Reasoning (Qian et al., NAACL findings 2022)