nateraluis/systemiq-test

Data cleaning & merging

Closed this issue · 0 comments

  1. Please load the users and activity tables into Python.

    • How many variables are in the datasets?
    • How many observations are in the datasets?
  2. How many

    • Male users are in the dataset?
    • Female users are in the dataset?
    • For how many users is no gender information available?

If there is no gender information available, please assume that the gender is male (“M”) for all further questions.

  1. Merge the two datasets and:
    • Calculate the days since registration for each activity.
    • What is the total mean and median revenue generated per user (ie across all activities)?
    • How do you interpret the result?
  2. Please visualise the distribution of the revenues by user.
  3. What is the average week 1 revenue, ie the revenue generated by the user in their first week (ie within the first 7 days since registration)?