Pinned Repositories
Activity-Recognition-using-Keras
Recognises activities based on the given video. Model is trained to identify sports activities.
DOA
DOA
five-video-classification-methods
Code that accompanies my blog post outlining five video classification methods in Keras and TensorFlow
rethinking-network-pruning
Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)
tdoa
TDOA based on GCC-PHAT
Text-Classification
Implementation of papers for text classification task on DBpedia
adventures-in-ml-code
This repository holds all the code for the site http://www.adventuresinmachinelearning.com
arXivTimes
repository to research & share the machine learning articles
audio-captioning-papers
A list of papers about audio captioning
audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
tamzeed-unc's Repositories
tamzeed-unc/sync-audio-rec-mqtt
tamzeed-unc/csi-processing
tamzeed-unc/speechbrain
A PyTorch-based Speech Toolkit
tamzeed-unc/audio-captioning-papers
A list of papers about audio captioning
tamzeed-unc/rulstm
Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention. International Conference on Computer Vision, 2019.
tamzeed-unc/temporal-binding-network
Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch
tamzeed-unc/awesome-self-supervised-learning
A curated list of awesome self-supervised methods
tamzeed-unc/five-video-classification-methods
Code that accompanies my blog post outlining five video classification methods in Keras and TensorFlow
tamzeed-unc/speech-driven-animation
tamzeed-unc/EmotionalConversionStarGAN
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
tamzeed-unc/BAGAN
Keras implementation of Balancing GAN (BAGAN) applied to the MNIST example.
tamzeed-unc/rethinking-network-pruning
Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)
tamzeed-unc/ganhacks
starter from "How to Train a GAN?" at NIPS2016
tamzeed-unc/adventures-in-ml-code
This repository holds all the code for the site http://www.adventuresinmachinelearning.com
tamzeed-unc/docs
TensorFlow documentation
tamzeed-unc/keras-GTN
An example implementation of Uber's Generative Teaching Network (GTN) with Keras
tamzeed-unc/arXivTimes
repository to research & share the machine learning articles
tamzeed-unc/seld-net
Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional recurrent neural network
tamzeed-unc/DOA
DOA
tamzeed-unc/cnn-lstm-network
Tensorflow implementation of embed CNN-LSTM network for sentiment analysis task.
tamzeed-unc/audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
tamzeed-unc/spleeter
Deezer source separation library including pretrained models.
tamzeed-unc/Activity-Recognition-using-Keras
Recognises activities based on the given video. Model is trained to identify sports activities.
tamzeed-unc/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
tamzeed-unc/fake-voice-detection
Using temporal convolution to detect Audio Deepfakes
tamzeed-unc/segan
Speech Enhancement Generative Adversarial Network in TensorFlow
tamzeed-unc/WiAR
WiFi-based activity recognition dataset
tamzeed-unc/CycleGAN
Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.
tamzeed-unc/deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
tamzeed-unc/multisensory
Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features