A curated list of awesome embedding models tutorials, projects and communities. Please feel free to pull requests to add links.
Word2vec
- Efficient Estimation of Word Representations in Vector Space
- Distributed Representations of Words and Phrases and their Compositionality
- word2vec Parameter Learning Explained
- word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method
GloVe
- GloVe: Global Vectors for Word Representation
- Improving Word Representations via Global Context and Multiple Word Prototypes
FastText
Embedding Enhancement
- Retrofitting Word Vectors to Semantic Lexicons
- Better Word Representations with Recursive Neural Networks for Morphology
- Dependency-Based Word Embeddings
- Not All Neural Embeddings are Born Equal
- Two/Too Simple Adaptations of Word2Vec for Syntax Problems
Comparing count-based vs predict-based method
- Linguistic Regularities in Sparse and Explicit Word Representations
- Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors
- Improving Distributional Similarity with Lessons Learned from Word Embeddings
Evaluation Method
Sentence
Document
- SENSEMBED: Learning Sense Embeddings for Word and Relational Similarity
- Multi-Prototype Vector-Space Models of Word Meaning
- Recurrent neural network based language model
- A Neural Probabilistic Language Model
- Linguistic Regularities in Continuous Space Word Representations
- SemEval-2012 Task 2
- WordSimilarity-353
- Stanford's Contextual Word Similarities (SCWS)
- Stanford Rare Word (RW) Similarity Dataset