Jeremy Chow
7/15/2019
Goal: Generate text in the style of Edgar Allen Poe, specifically emulating his writing style in the short story dataset from Kaggle "Spooky Author Identification" competition: https://www.kaggle.com/c/spooky-author-identification
- Embedding layer
- Helps model understand 'meaning' of words by mapping them to representative vector space instead of semantic integers
- Stacked LSTM layers
- Stacked LSTMs add more depth than additional cells in a single LSTM layer (see paper: https://arxiv.org/abs/1303.5778)
- The first LSTM layer must have
return sequences
flag set to True in order to pass sequence information to the second LSTM layer instead of just its end states
- Dense (regression) layer with ReLU activation
- Dense layer with Softmax activation
- Outputs word probability across entire vocab
Input: First of all I dismembered the corpse
Model:
"First of all i dismembered the corpse which is profound since in the first place he appeared at first so suddenly as any matter no answer was impossible to find my sake he now happened to be sure it was he suspected or caution or gods and some voice held forth upon"