Mercari-price-prediction

Problem Description
Data and Features
Exploratory Data Analysis
Approach
Architecture
Approach

Problem Description

Product pricing gets even harder at scale, considering just how many products are sold online. Clothing has strong seasonal pricing trends and is heavily influenced by brand names, while electronics have fluctuating prices based on product specs.

Mercari, Japan’s biggest community-powered shopping app, knows this problem deeply. They’d like to offer pricing suggestions to sellers, but this is tough because their sellers are enabled to put just about anything, or any bundle of things, on Mercari's marketplace.

This model automatically suggests the right product prices, provided user-inputted text descriptions of products, including details like product category name, brand name, and item condition.

Data and Features

The files consist of a list of product listings. These files are tab-delimited.

train_id or test_id - the id of the listing
name - the title of the listing. Note that we have cleaned the data to remove text that look like prices (e.g. $20) to avoid leakage. These removed prices are represented as [rm]
item_condition_id - the condition of the items provided by the seller
category_name - category of the listing
brand_name
price - the price that the item was sold for. This is the target variable that you will predict. The unit is USD. This column doesn't exist in test.tsv since that is what you will predict.
shipping - 1 if shipping fee is paid by seller and 0 by buyer
item_description - the full description of the item. Note that we have cleaned the data to remove text that look like prices (e.g. $20) to avoid leakage. These removed prices are represented as [rm]

Exploratory Data Analysis

Though a separate EDA notebook is added to this project with detailed description, the interactive graphs have issues when rendered in the ipython display of github. Thus refere to the following link for the compelete analysis https://www.kaggle.com/nehaytamore/my-eda-of-mercari

Architecture

Approach