/shopify_data_science_challenge

Shopify Technical Challenge for Fall 2022 Data Science Internship

Primary LanguageJupyter Notebook

shopify_data_science_challenge

Shopify Technical Challenge Submission for Fall 2022 Data Science Internship

Solutions can viewed in the Jupyter Notebook or the PDF file.

Project Description

This project was completed as part of my application for Shopify's Fall 2022 Data Science Internship. The project specifications are included below:

Question 1

Given some sample data, write a program to answer the following:

On Shopify, we have exactly 100 sneaker shops, and each of these shops sells only one model of shoe. We want to do some analysis of the average order value (AOV). When we look at orders data over a 30 day window, we naively calculate an AOV of $3145.13. Given that we know these shops are selling sneakers, a relatively affordable item, something seems wrong with our analysis.

  1. Think about what could be going wrong with our calculation. Think about a better way to evaluate this data.
  2. What metric would you report for this dataset?
  3. What is its value?

Question 2

For this question you’ll need to use SQL. Follow this link to access the data set required for the challenge. Please use queries to answer the following questions. Paste your queries along with your final numerical answers below.

  1. How many orders were shipped by Speedy Express in total?
  2. What is the last name of the employee with the most orders?
  3. What product was ordered the most by customers in Germany?