Data Science Summer Intern assignment 2022

Assignment for candidates

This is a mandatory assignment for everyone applying for the data science internship. Please return your answers along with your application.

Table of Contents

  1. Overview
  2. Choosing the data
  3. Choosing the approach
  4. Working with the data
  5. Your background and Wolt

Overview

Thank you for applying for Wolt's 2022 Data Science Internship! The idea is not to spend an extensive amount of time on this, but to show how you think and approach problems.

There are two parts to this assignment:

  • first, you get to pick a dataset, explore it and prepare a couple of models based on it,
  • secondly, we'll take a look at your academic and extracurricular interests.

Present your answers in a reproducible way and in an open format. Make sure you include the code and your reasoning, too. It pays to have a proper idea of Wolt's product and business, so it might be a good idea to take a look at our blog or some of the recent articles around the web.

There are many ways to successfully complete this assignment. Make sure the person reading the document you produce understands why you made the choices you made. If you have a unique approach and the right set of skills, this will be your chance to set yourself apart and shine!


Choosing the data

We have prepared a couple of options for you. Feel free to choose which data you use based on your background and ambitions. You only need to choose one.

  • Time series. Consider the flow of orders in the provided file as a process fluctuating in time.
  • Routing. Every order in the provided file has a pick-up and a drop-off location, and they are connected via a route on a map. OpenStreetMap has the suitable map data, and you can use tools such as OSMnx and NetworkX to utilize it.
  • Image processing. Select a freely available food image classification dataset. For inspiration, take a look at this repo with food categories classification data and code or Kaggle. A suitable dataset for this task will have category labels associated with pictures of food.

You don't have to limit your choices to these, and you can also freely combine and enrich datasets.


Choosing the approach

Armed with a dataset, come up with a modeling task that is relevant to Wolt. To give you an idea what we are looking for, the task might look something like these:

  • How many orders are we going to get tomorrow? Or next week?
  • Where will the orders be delivered in an hour?
  • Routing will obviously affect the total delivery time. Are there some routes or route segments that should be avoided at certain times? * How well does the route explain the delivery time?
  • Not all dishes listed on Wolt have all the ingredients listed, but we have pictures of most dishes. Can we try to recommend only vegan dishes to vegans, based on the associated images?
  • …and so on!

The minimum requirement is that your approach will result in some kind of predictive model. Choose a task that properly showcases your skills!


Working with the data

Exploration

Produce interesting statistics and graphs about the dataset. Show the most important features and explain what you see.

Modeling

Why did you choose the approach, what kind of benefits do you see in solving it? What kind of metrics can you use to evaluate how good the solution is?

Based on the approach you choose, produce a model suitable for the task. You should include the preparation work, feature engineering and your thought process in your answer.

Evaluation

Are you happy with the results? What kind of results would you expect to see, if this was deployed to production?

Further development

Make slight modifications to the model or take a completely different method to solve it. Compare your two solutions. Strengths, weaknesses? What should you consider when you compare different models? If you had more time and resources, what kind of development could be done to make the solution better?


Your background and Wolt

After the practical work, let's discuss what you have learned and what your ambitions are. Write a bit about the problems you like to work with. Have you written your thesis or a larger piece of coursework about something that you would see beneficial for Wolt? If you already have work history, are there some things that you would like to try here? Based on your knowledge about us, are there some problems you would like to help us solve? Do you have some relevant, interesting minors or side projects? We are always interested in enthusiastic people with fresh ideas, and this could be the opportunity to put something you recently learned into use!