The Gritty Details of Deep Learning Common Derivatives Activations Softmax Numerical Stability Levenshtein Distance and WER LSTM: Forward and Backward CTC: Connectionist Temporal Classification LSA: Listen-Attend-Spell HMM: Hidden Markov Model