Pinned Repositories
AI_Bot_for_playing_PacMan
A deep convolutional neural net (CNN) for playing the well-known arcade game ‘Ms. Pac-Man’ trained using Q-learning (reinforcement learning). Game data is obtained using the ‘OpenAI Gym’. The neural net significantly outperforms all tried uniform-random strategies. We do not inject any prior knowledge in the Q-nets. For instance, we do not explicitly use the distance from the ghosts - the system only has access to the game frames and has to learn everything by itself.
Creative_Text_Generator
A deep recurrent neural net (RNN) for both character-level and word-level language modeling that, given any starting context, generates creative text depending on the training data. Training was done using OANC corpus. Even with probabilistic sampling of character sequences, the RNN almost always outputs valid dictionary words making meaningful sentences.
dlr
The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonathan Berant).
dss
Sequence Modeling with Structured State Spaces
English_Vietnamese_Translator
A recurrent neural net (RNN) based Neural Machine Translator for translating English to Vietnamese built using an Encoder-Decoder model. Training was done using a small-scale parallel corpus of TED talks.
gmat
The accompanying code for the paper "GMAT: Global Memory Augmentation for Transformers" (Ankit Gupta and Jonathan Berant).
injecting_numeracy
The accompanying code for "Injecting Numerical Reasoning Skills into Language Models" (Mor Geva*, Ankit Gupta* and Jonathan Berant, ACL 2020).
mel-asr
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George Saon, Brian Kingsbury. Interspeech 2024).
Sentiment_Analysis
A deep recurrent neural network (RNN) based classifier trained using the IMDb reviews dataset for detecting +ve/-ve sentiment in text. Text is encoded using trainable word embeddings and we get 90% test accuracy. Classifier can then be used to determine the sentiment score on Twitter data collected using the Twitter API.
top_k_attention
The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonathan Berant. SustaiNLP 2021).
ag1988's Repositories
ag1988/injecting_numeracy
The accompanying code for "Injecting Numerical Reasoning Skills into Language Models" (Mor Geva*, Ankit Gupta* and Jonathan Berant, ACL 2020).
ag1988/dss
Sequence Modeling with Structured State Spaces
ag1988/top_k_attention
The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonathan Berant. SustaiNLP 2021).
ag1988/dlr
The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonathan Berant).
ag1988/mel-asr
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George Saon, Brian Kingsbury. Interspeech 2024).
ag1988/gmat
The accompanying code for the paper "GMAT: Global Memory Augmentation for Transformers" (Ankit Gupta and Jonathan Berant).
ag1988/AI_Bot_for_playing_PacMan
A deep convolutional neural net (CNN) for playing the well-known arcade game ‘Ms. Pac-Man’ trained using Q-learning (reinforcement learning). Game data is obtained using the ‘OpenAI Gym’. The neural net significantly outperforms all tried uniform-random strategies. We do not inject any prior knowledge in the Q-nets. For instance, we do not explicitly use the distance from the ghosts - the system only has access to the game frames and has to learn everything by itself.
ag1988/English_Vietnamese_Translator
A recurrent neural net (RNN) based Neural Machine Translator for translating English to Vietnamese built using an Encoder-Decoder model. Training was done using a small-scale parallel corpus of TED talks.
ag1988/Sentiment_Analysis
A deep recurrent neural network (RNN) based classifier trained using the IMDb reviews dataset for detecting +ve/-ve sentiment in text. Text is encoded using trainable word embeddings and we get 90% test accuracy. Classifier can then be used to determine the sentiment score on Twitter data collected using the Twitter API.
ag1988/Creative_Text_Generator
A deep recurrent neural net (RNN) for both character-level and word-level language modeling that, given any starting context, generates creative text depending on the training data. Training was done using OANC corpus. Even with probabilistic sampling of character sequences, the RNN almost always outputs valid dictionary words making meaningful sentences.
ag1988/Snapchat_Filter
A Snapchat-like application based on the standard pipeline of 1) Face Detection using Histogram of Oriented Gradients (HOG) descriptor, 2) Facial Landmark Estimation using Kazemi-Sullivan and finally, 3) adding overlays/lenses (like funny glasses, etc) to the frames using the previously estimated landmarks.
ag1988/value_aware_attn
The accompanying code for "Value-aware Approximate Attention" (Ankit Gupta and Jonathan Berant, EMNLP 2021).