This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

Last update: Jan 11, 2022

Overview

Data-Science-Intern-Challenge

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

Summer 2022 Data Science Intern Challenge

Please complete the following questions, and provide your thought process/work. You can attach your work in a text file, link, etc. on the application page. Please ensure answers are easily visible for reviewers!

Question 1: Given some sample data, write a program to answer the following: click here to access the required data set

On Shopify, we have exactly 100 sneaker shops, and each of these shops sells only one model of shoe. We want to do some analysis of the average order value (AOV). When we look at orders data over a 30 day window, we naively calculate an AOV of $3145.13. Given that we know these shops are selling sneakers, a relatively affordable item, something seems wrong with our analysis.

Think about what could be going wrong with our calculation. Think about a better way to evaluate this data.

Answer: The wrong average was calculated using this method: total of all order values/ number of order_values. This is wrong because the formula didn't consider the fact that an order can have multiple items. I have tried to explain the problem with code. Click Here to view it.

What metric would you report for this dataset?

Answer: The correct approach would be to divide the total of all order_values by the sum of total_items. By following this method, we would consider the fact that an order can have multiple items.

What is its value?

Answer: $357.92

Question 2: For this question you’ll need to use SQL. Follow this link to access the data set required for the challenge. Please use queries to answer the following questions. Paste your queries along with your final numerical answers below.

How many orders were shipped by Speedy Express in total?

Answer: 54

What is the last name of the employee with the most orders?

Answer: Peacock

What product was ordered the most by customers in Germany?

Answer: Boston Crab Meat. This product was ordered 160 times in total.

Click here to check the sql queries.

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

Related tags

Overview

Data-Science-Intern-Challenge

Summer 2022 Data Science Intern Challenge

Owner

This is the official implementation for the paper "(Almost) Free Incentivized Exploration from Decentralized Learning Agents" in NeurIPS 2021.

NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for providing continuous calculation.

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.

Specificity-preserving RGB-D Saliency Detection

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

E2C implementation in PyTorch

The Noise Contrastive Estimation for softmax output written in Pytorch

Predicts an answer in yes or no.

Artificial Neural network regression model to predict the energy output in a combined cycle power plant.

ruptures: change point detection in Python

clustering moroccan stocks time series data using k-means with dtw (dynamic time warping)

A simple log parser and summariser for IIS web server logs

A simple root calculater for python

[ICCV21] Official implementation of the "Social NCE: Contrastive Learning of Socially-aware Motion Representations" in PyTorch.

Politecnico of Turin Thesis: "Implementation and Evaluation of an Educational Chatbot based on NLP Techniques"

Video2x - A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR.

The repo of Feedback Networks, CVPR17

Pytorch implementation of Learning with Opponent-Learning Awareness