/ptf-kommentare

Notes & code for my Protoypefund project about Machine Learning & news comments & language change

Primary LanguageJupyter NotebookMIT LicenseMIT

PTF-Kommentare

This repository contains code and notes for my Prototype Fund project. It was mainly done between 01.03.2019 and 01.09.2019. The topic: Explaining machine learning and natural language processing at the example of news comments, and visualize language change.

Sub Projects

The work is devided into serveral sub projects:

Create your own Visualizations of Language Change

Here is a short guide on how to create your own videos. An example video here.

  1. Divide your data in time slices & create a word embedding for each slice
  2. Save the embedding in KeyedVectors format of gensim (using hyperhyper to create stable word embeddings is advised)
  3. Install ffmpeg
  4. pip install git+https://github.com/jfilter/adjustText && pip install gensim scikit-learn matplotlib colormath
  5. Adopt the code in this notebook (so you also need to have either Jupyter Lab or Jupyter Notebook installed.)

Right now, it's not that easy to create those videos. However, it's doable and I'm willing to help you. The 'important' part of the code is commented thoroughly. Please contact me for assistance.

Two papers for a more scientific background:

Some more papers here.

Sponsoring

This work was funded by the German Federal Ministry of Education and Research.