/BigDataMiningTechniques

The purpose is to familiarize me with the basic stages of the process used to implement data mining techniques, namely: collection, preprocessing, cleaning, conversion, application of data mining techniques and evaluation. The implementation will be in the Python programming language using the SciKit Learn tool and the gensim library.

Primary LanguagePython

#Big Data Mining Techniques

##Prerequisite

To run the project you need to have installed:

  • Python3
  • Numpy
  • Pandas
  • gensim
  • nltk
  • sklearn

##To run the project

  1. Update paths defined in run.py file.
  2. python3 run.py