schokoblume/Data-Literacy

Data preperation

Closed this issue · 0 comments

  1. Standardize by all articles published for that day from Ausland and Politik [ ]

  2. Join data: article number (after we standardized) and sonntagsfrage werte.

  • Problem: They have different times, survey only every couple weeks.
  • Option 1: articles from one / two weeks before survey count for that survey.
  • Option 2: all articles after survey n - 1 until survey n count for survey n -> no
  • Option 3: shortest or longest time between surveys, or median

Start with median. Important: Write script so it is easy to change the time frame!