/Data-Mining-Script-Rating-Prediction

Using Ensemble Learning (Gradient and AdaBoost stacked model) and text analytics techniques (NER, sentiment analysis, etc) to predict TV series' ratings from text scripts.

Primary LanguageJupyter Notebook

Data-Mining-Script-Generation

  • Get Metadata
  • Get Script Data
  • Clean Metadata
  • Clean Script data
  • Identify data mining attributes
  • Mine attributes
  • Consolidate Data
  • Build Classifier
  • Test and evaluate classifier
  • **Detailed data analysis with the script to generate graphs