/Data-Science-Bootcamp

Data Science Bootcamp projects.

Primary LanguageJupyter Notebook

Week 2: SQL

the goal of this project is to practice writing SQL queries.

We Used Yellow Taxi data in this project.

tools:

1. blazingsql
2. pandas

Week 3: Tableau

in this project we build a story in Tableau using any dataset. we used Yellow Taxi data, you can find the story here

image

Week 4: R - simulation

we build a movie theater simulation using R and Rstudio.

tools:

 1. base R
 2. Graphics
 3. plotrix

Week 5: R - Data Munging

clean data and visualize it using Tidyverse package. I choosed Coffee_Ratings Data from #TidyTuesday repository.

tools:

  1.dplyr
  2.tidyr
  3.ggplot2
  4.lubridate

Week 6: R - Shiny Web app

in this project we build a shiny web app to display our visulaizations and analysis in a nice reactive and simple web page. I build a simple shiny app using Plamer Penguins dataset.

Week 7: python - monte carlo simulation

build a Monte carlo simulation for a birthday problem using python.

Week 8: python - Exploratory Data Analysis (EDA)

in this project we choose dataset and do EDA to undrestand it. the default data is Titanic dataset, we did EDA in titanic dataset and another dataset which is Palmer Penguins .

tools:

   1.pandas
   2.seaborn

Week 9: python - Plotly App

the goal of this project is to build plotly app to display visualizations in interactive way.

we used titanic dataset to build the App. image

Week 10: Machine Learning project 1

in this project our goal is to identify and predict customer retention using EDA and logistic regression.

tools:

  1.pandas
  2.seaborn
  3.sklearn

Week 11: Machine learning project 2

I used Mushroom dataset and do Exploraotry Data Analysis and modeling, since the data was clean there is no need to preprocess it.

tools:

  1.pandas
  2.seaborn
  3.sklearn

Week 12: Machine learning project 3

this week project was participating in ongiong kaggle challenge called Tabular Playground series. and this me and my team notebook.

tools:

  1.cuml
  2.cudf
  3.pandas
  4.numpy
  5.matplotlib & seaborn