/Duplicate-Questions-Classifier

This is my first project on NLP algorithms and techniques to identify duplicate questions.

Primary LanguageJupyter Notebook

Mini Project V

The repo with instructions for Mini Project V

Welcome to your final mini-project of this bootcamp. We hope you will enjoy it.

Description

We will combine the skills we developed in the previous modules to identify duplicate questions in a dataset provided by Quora. This dataset was labeled by human experts which is an expensive process. The model you will build will need to automatically identify and label duplicate questions.

We are going to need to build a classifier model to achieve this result.

Data

The labeled dataset can be downloaded from here.