Pinned Repositories
catt
The official implementation of CATT Arabic diacritization models.
adversarial-machine-learning-example
Train a CNN model on MNIST dataset and use it to develop an adversarial example to fool the model
deepspeech2-online-decoder
Online (real-time) decoder to be used with DeepSpeech2 model
kaldi-ruby-api
An API wrapper written in Ruby for Kaldi ASRs
linear-regression-numpy
Implementation of Linear Regression Model using the Normal Equation (Closed-form solution) and the Gradient Descent Algorithm (Open-form solution))
psu-language-modeling-session
The code of the "Language Models and Their Applications" session
psu-sentiment-analysis-session
PSU Sentiment Analysis Session Code
sbvqa2.0
The official implementation of the paper: SBVQA 2.0: Robust End-to-End Speech-Based Visual Question Answering for Open-Ended Questions
shieldrnn
The implementation of ShieldRNN
wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
farisalasmary's Repositories
farisalasmary/wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
farisalasmary/psu-language-modeling-session
The code of the "Language Models and Their Applications" session
farisalasmary/sbvqa2.0
The official implementation of the paper: SBVQA 2.0: Robust End-to-End Speech-Based Visual Question Answering for Open-Ended Questions
farisalasmary/psu-sentiment-analysis-session
PSU Sentiment Analysis Session Code
farisalasmary/shieldrnn
The implementation of ShieldRNN
farisalasmary/adversarial-machine-learning-example
Train a CNN model on MNIST dataset and use it to develop an adversarial example to fool the model
farisalasmary/arastance
farisalasmary/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
farisalasmary/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
farisalasmary/ctcdecode
PyTorch CTC Decoder bindings
farisalasmary/CTDNN
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
farisalasmary/DeepFilterNet
Noise supression using deep filtering
farisalasmary/deepspeech.pytorch
Speech Recognition using DeepSpeech2.
farisalasmary/elgen
farisalasmary/Face-Transformer
Face Transformer for Recognition
farisalasmary/facenet-pytorch-glint360k
A PyTorch implementation of the 'FaceNet' paper for training a facial recognition model with Triplet Loss using the glint360k dataset. A pre-trained model using Triplet Loss is available for download.
farisalasmary/kaldi-serve
Server framework for Kaldi ASR Toolkit
farisalasmary/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
farisalasmary/NeMo
NeMo: a toolkit for conversational AI
farisalasmary/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
farisalasmary/pydub
Manipulate audio with a simple and easy high level interface
farisalasmary/recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
farisalasmary/sequitur-g2p
This is a github repository of the abandonware Sequitur G2P by Bisani & Ney
farisalasmary/Speech-Transformer
PyTorch re-implementation of Speech-Transformer
farisalasmary/train-transformer-xl-huggingface
This repo contains a notebook that illustrates how to train Transformer-XL on 🤗 Transformers library
farisalasmary/transformer
PyTorch Implementation of "Attention Is All You Need"
farisalasmary/transformer-xl
farisalasmary/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
farisalasmary/VQVAE-Pytorch
This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE
farisalasmary/warp-ctc
Pytorch Bindings for warp-ctc FIX ERRORS related to CUDA 10.1