Welcome to the Data Engineer Intern assessment for Shopify! This assessment is designed to evaluate your skills in SQL, data manipulation, and problem-solving. Please follow the instructions below to complete the assessment.
The assessment is based on a simulated dataset containing sales information from an e-commerce platform. The dataset is provided in the /data
directory as a number of CSV files named <table-name>_data.csv
. These dataset includes columns such as product_id
, sales_amount
, customer_id
, etc.
- Fork the Repository: Start by forking this repository to your local machine.
- Create a new Branch: Create a new branch to store your work in
- Data Understanding: Review the
schema.sql
file and data.csv
files and understand its structure and columns. - Write SQL Queries: Create SQL files (
task1.sql
,task2.sql
,task3.sql
) in the/sql
directory to solve each task mentioned above. - Submit your Work: Once completed, create a pull request with your changes to the
main
branch and submit the link to your PR
- Ensure your SQL files contain clear and commented queries for each task.
- Use the provided datasets (
<table-name>_data.csv
) for all tasks. - Feel free to ask any clarifying questions by creating an issue in this repository.
Good luck, and we look forward to reviewing your work!