Text Classification Analysis

Full Documentation can be found here

The task is to classify a video into different classes based on its title and description using different Techniques (Naive Bayes, Support Vector Machines, Adaboost, and LSTM) and analyzing their performance. These classes are chosen to be(but are not limited to):

  • Travel Blogs
  • Science and Technology
  • Food
  • Manufacturing
  • History
  • Art and Music

Data Gathering

For this problem, I need some metadata about videos belonging to different categories thus I used the Youtube API v3. The scraped raw data has been saved in the 'Collected_data_raw.csv' file.

Performance Analysis

Naive Bayes

Support Vector Machine

1_shYMumLw6hchAks1RHzC4w

1_7Ps1tyjPpfvgtLCn26i6tw