/QA-Clustering

Implementation of some algorithms for text clustering

Primary LanguagePython

QA-Clustering

  • Clustering the answers by following methods

    1. LDA (latent Dirichlet allocation)
      • gensim module
      • scikit module
    2. Hierarchical
    3. Incremental
  • We can weight the words in two ways

    1. Bag of Words
    2. TF-IDF
  • Measuring similarities between the words

    1. Euclidean Distance
    2. Edit Distance