This is the code repository for Mastering Pandas Second Edition, published by Packt.
A complete guide to pandas, from installation to advanced data analysis techniques
pandas is a popular Python library used by data scientists and analysts worldwide to manipulate and analyze their data. This book presents useful techniques and real-world examples on getting the most out of pandas for expert-level data manipulation, analysis and visualization.
This book covers the following exciting features:
- Speed up your data analysis by importing data into pandas
- Keep relevant data points by selecting subsets of your data
- Create a high-quality dataset by cleaning data and fixing missing values
- Compute actionable analytics with grouping and aggregation in pandas
- Master time series data analysis in pandas
- Make powerful reports in pandas using Jupyter notebooks
If you feel this book is for you, get your copy today!
All of the code is organized into folders. For example,
The code will look like the following:
source_python("titanic.py")
titanic_in_r <- get_data_head("titanic.csv")
Following is what you need for this book: This book is for data scientists, analysts and Python developers who wish to explore advanced data analysis and scientific computing techniques using pandas. Some fundamental understanding of Python programming and familiarity with the basic data analysis concepts is all you need to get started with this book.
With the following software and hardware list you can run all code files present in the book (Chapter 1-11).
Chapter | Software required | OS required |
---|---|---|
All | Python 3.6 or higher | Windows, Mac OS X, and Linux (Any) |
All | Jupyter notebook | Windows, Mac OS X, and Linux (Any) |
All | Pandas, R, IPython, scikit-learn | Windows, Mac OS X, and Linux (Any) |
We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.
Ashish Kumar is a seasoned data science professional, a publisher author and a thought leader in the field of data science and machine learning. An IIT Madras graduate and a Young India Fellow, he has around 7 years of experience in implementing and deploying data science and machine learning solutions for challenging industry problems in both hands-on and leadership roles. Natural Language Procession, IoT Analytics, R Shiny product development, Ensemble ML methods etc. are his core areas of expertise. He is fluent in Python and R and teaches a popular ML course at Simplilearn. When not crunching data, Ashish sneaks off to the next hip beach around and enjoys the company of his Kindle. He also trains and mentors data science aspirants and fledgling start-ups.
Click here if you have any feedback or suggestions.
If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost.
Simply click on the link to claim your free PDF.