Data-Science-Projects-With-Python-Second-Edition

This is the repository for the second edition of Data Science Projects with Python, published by Packt. It contains all of the supporting files needed for you to work through the course from start to finish.

Data Science Projects with Python, 2nd Edition

About Data Science Projects with Python

Data Science Projects with Python uses a case study approach to gaining valuable insights from real data with machine learning. By the end of this book, you will be able to:

  • Think critically about data and use it to form and test a hypothesis
  • Choose an appropriate machine learning model and train it on your data
  • Communicate data-driven insights with confidence and clarity

What You Will Learn

  • Load, explore, and process data using the pandas Python package
  • Use Matplotlib to create compelling data visualizations
  • Implement predictive machine learning models with scikit-learn
  • Use lasso and ridge regression to reduce model overfitting
  • Evaluate random forest and logistic regression model performance
  • Deliver business insights by presenting clear, convincing conclusions

Installing Anaconda and Setting Up an Environment

Install Anaconda by following the instructions at this link: https://www.anaconda.com/products/individual

It is recommended to create an environment in Anaconda to do the exercises and activities in this book, which have been tested against the software versions indicated here. Once you have Anaconda installed, open a Terminal if you're using macOS or Linux, or a Command Prompt window in Windows, and do the following:

  1. Create an environment with most required packages. You can call it whatever you want; here it’s called dspwp2. Copy and paste, or type the entire statement here on one line in the terminal:

conda create -n dspwp2 python=3.8.2 jupyter=1.0.0 pandas=1.2.1 scikit-learn=0.23.2 numpy=1.19.2 matplotlib=3.3.2 seaborn=0.11.1 python-graphviz=0.15 xlrd=2.0.1

Type 'y' and [Enter] when prompted.

  1. Activate the environment:

conda activate dspwp2

  1. Install the remaining packages:

conda install -c conda-forge xgboost=1.3.0 shap=0.37.0

Type 'y' and [Enter] when prompted.

  1. You are ready to use the environment. To deactivate it when finished:

conda deactivate

Related Books

If you've found this book useful, you might want to check out some of our other titles:

If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost.
Simply click on the link to claim your free PDF.

https://packt.link/free-ebook/9781800564480