/dark_market_ml

Use machine learning regression models to predict the bitcoin price of dark market cocaine.

Primary LanguageJupyter Notebook

dark_market_ml

Use various machine learning models to predict the btc price of dark market cocaine. Our Project_code.ipynb contains the complete workflow of our process: Data Cleaning, Exploratory Data Analysis, Feature Engineering, Modeling, and Evaluation.

Project contributors include: Shirley Li, Jingxian Li, Michael Schulze, & Mundy Reimer

Our presentation slides can be found here.


Public Data Set found here: https://www.kaggle.com/everling/cocaine-listings

Description:

The dataset is approximately 1,400 cleaned and standardized product listings from Dream Market's "Cocaine" category. It was collected with web-scraping and text extraction techniques in July 2017.

Extracted features for each listing include: product_title ships_from_to quantity in grams quality btc_price vendor details shipping dummy variables (true/false columns)

For further details on the creation of this dataset and what it contains, see the blog post here: https://medium.com/thought-skipper/dark-market-regression-calculating-the-price-distribution-of-cocaine-from-market-listings-10aeff1e89e0


Files:

Final_code: Contains all our code from modeling to visualization.

dream_market_cocaine_listings: Original CSV file we downloaded from Kaggle.

fixed_cocaine_listings: Dataset we used after cleaning the original Kaggle CSV.