- Import and observe dataset
- Combine Wikipedia and IMDb plot summaries
- Tokenization
- Stemming
- Club together Tokenize & Stem
- Create TfidfVectorizer
- Fit transform TfidfVectorizer
- Import KMeans and create clusters
- Calculate similarity distance
- Import Matplotlib, Linkage, and Dendrograms
- Create merging and plot dendrogram
- Which movies are most similar?