quora-question-pairs

TODO (ALEX):

TODO(Abhishek):

  • Review code.
  • Check the hyperparameters for the experiments.

TODO(Cipta):

  • Finish autoencoder experiments
  • Integrate with Alex's models

TODO (ALL):

  • Ensemble Best Models
    • Linear Interpolation
    • Hand Crafted Features concatenated with model predictions and run through model

Hand Crafted Features:

  • n words q1, n words q2
  • difference in word count
  • pct word similarity
  • number of common words
  • cosine similarity of avrage of word2vex embeddings
  • pos tags
  • Start with same quesiton word

Evaluation:

  • Accuracy
  • Precision
  • Recall
  • F1