- Understanding LSTM Networks by Chris Olah
- The Unreasonable Effectiveness of Recurrent Neural Networks by Andrej Karpathy
- Exploring LSTMs by Edwin Chen
- A really good conceptual overview of Word2Vec from Chris McCormick
- First Word2Vec paper from Mikolov et al.
- Neural Information Processing Systems, paper with improvements for Word2Vec also from Mikolov et al.
- Awesome blog with amazing illustrations explaning RNN Seq2Seq models and Attention networks.