Pinned Repositories
credit-card-fraud
Exploration of binary classification models for credit card fraud detection from dataset of European cardholder transactions (September 2013). Dataset provided by Machine Learning Group of Université Libre de Bruxelles and Worldline collaboration.
cristobalmitchell
data-science-resources
Curated list of learning resources covering a wide range of topics within data science
hnsc-biomarkers
Statistical analysis of TCGA-HNSC miRNA sequences for identification of biomarkers indicating lymphovascular invasion presence across clinical stages
labelspark
This library makes it easy to take unstructured data in your Data Lake and prepare it for analysis and AI work in Databricks. The Labelbox Connector for Apache Spark takes in a Spark DataFrame to create a dataset in Labelbox, and it also brings labeled, structured data back into Databricks also as a Spark DataFrame.
pokedex
Simple web scraping example for collecting Pokémon stats from a Pokédex site
python-data-viz
Overview of common data visualizations using Python
sport-scraper
Simple web scraper package for gathering sports data from ESPN.com
time-series-forecasting
Exploration of time series forecasting concepts and techniques
labelspark
This library makes it easy to take unstructured data in your Data Lake and prepare it for analysis and AI work in Databricks. The Labelbox Connector for Apache Spark takes in a Spark DataFrame to create a dataset in Labelbox, and it also brings labeled, structured data back into Databricks also as a Spark DataFrame.
cristobalmitchell's Repositories
cristobalmitchell/pokedex
Simple web scraping example for collecting Pokémon stats from a Pokédex site
cristobalmitchell/sport-scraper
Simple web scraper package for gathering sports data from ESPN.com
cristobalmitchell/credit-card-fraud
Exploration of binary classification models for credit card fraud detection from dataset of European cardholder transactions (September 2013). Dataset provided by Machine Learning Group of Université Libre de Bruxelles and Worldline collaboration.
cristobalmitchell/cristobalmitchell
cristobalmitchell/data-science-resources
Curated list of learning resources covering a wide range of topics within data science
cristobalmitchell/hnsc-biomarkers
Statistical analysis of TCGA-HNSC miRNA sequences for identification of biomarkers indicating lymphovascular invasion presence across clinical stages
cristobalmitchell/labelspark
This library makes it easy to take unstructured data in your Data Lake and prepare it for analysis and AI work in Databricks. The Labelbox Connector for Apache Spark takes in a Spark DataFrame to create a dataset in Labelbox, and it also brings labeled, structured data back into Databricks also as a Spark DataFrame.
cristobalmitchell/python-data-viz
Overview of common data visualizations using Python
cristobalmitchell/time-series-forecasting
Exploration of time series forecasting concepts and techniques