/TedTalk-Popularity-Prediction

applied advanced machine learning techniques to predict the popularity of TED Talk

Primary LanguageJupyter Notebook

Link to our GitHub Pages

TED Talks Popularity Prediction Project

Introduction

Welcome to our TED Talks Popularity Prediction project, where we apply advanced machine learning techniques to predict the popularity of TED Talks. Our multidisciplinary team has leveraged sentiment analysis, association rule mining, and support vector regression to delve deep into what makes a TED Talk resonate with its audience. This repository contains all the code, reports, and resources used in our project

Conclusion

Our project demonstrates the power of combining multiple machine learning techniques to predict the popularity of digital content. Through our analyses, we've uncovered the significant impact of emotional tone, speaker background, and content type on viewer engagement. This work not only advances our understanding of content popularity but also lays the groundwork for future explorations into automated content analysis.

- proposal.pdf: project proposal

- midterm_report.pdf: midterm report

- final.pdf: final report

Main Directory

  • index.html: the html code for the GitHub Page's home page
  • midterm.html: the html code for the GitHub Page with the midterm report
  • proposal.html: the html code for the GitHub Page with the proposal
  • gantt.html: the html code for the GitHub Page with our Gantt chart
  • requirements.txt: list of required Python packages for the repository
  • video.html: the html code for the GitHub Page with the youtube video

Models' Folders

association_mining

Folder containing Association Rule Mining code

data_preprocessing

Folder containing Data Preprocessing code

eda

Folder containing EDA code

  • data_eda.ipynb: performs exploratory data analysis on the dataset
  • eda_part2.ipynb: performs more exploratory data analysis on the dataset, and focuses on visualizing the distribution of the features

sentiment_analysis

Folder containing Sentiment Analysis code

svr

Folder containing SVR code

  • SVR_topics.ipynb: this is our code for the SVR model that uses topics as a feature.
  • SVR_emotions.ipynb: this is our code for the SVR model that uses detected emotions as a feature.
  • SVR_occupations.ipynb: this is our code for the SVR model that uses occupations as a feature.

text gen

Folder containing Text Generation code

Other Folders

css

Folder containing CSS code

  • style.css: style.css document for our GitHub page

data

Folder containing data