POS-tagging-with-RNNs 📑 ➡️ 🆔

In this assignment we address the task of POS tagging on the well known Penn Treebank Dataset. The aim is to implement and compare the performances of classic recurrent models which make use of GloVe embeddings to later choose the best two among them and analyze their errors. The obtained results are quite satisfactory, even considering the reduced size of the model (in terms of number of parameters).

Implemented models

We've implemented, trained, validated and tested the 4 following models:

1. BiLSTM (baseline)

This model has just a bi-directional layer of LSTM cells, in addition to the dense input layer (followed by the fixed embedding layer) and the dense time-distributed output layer.

2. Double BiLSTM

Similar to the baseline, but with an additional bidirectional LSTM layer.

3. BiLSTM + Dense

Similar to the baseline, but with an additional Dense distributed layer before the output layer.

4. BiGRU

Similar to the baseline, but with a bi-directional GRU layer instead of a bi-directional LSTM layer.

Instructions

Run Assignment_1.ipynb to visualize and/or reproduce our train-validation-test-error analysis pipeline 📗
Read assignment_1.pdf to visualize the report 📜