/Hands-On-Data-Analysis-with-Scala

Hands-On Data Analysis with Scala, published by Packt

Primary LanguageScalaMIT LicenseMIT

$5 Tech Unlocked 2021!

If you have read this book, please leave a review on Amazon.com. Potential readers can then use your unbiased opinion to help them make purchase decisions. Thank you. The $5 campaign runs from December 15th 2020 to January 13th 2021.

Hands-On Data Analysis with Scala

Hands-On Data Analysis with Scala

This is the code repository for Hands-On Data Analysis with Scala , published by Packt.

Perform data collection, processing, manipulation, and visualization with Scala

What is this book about?

Efficient business decisions with an accurate sense of business data helps in delivering better performance across products and services. This book helps you to leverage the popular Scala libraries and tools for performing core data analysis tasks with ease.

This book covers the following exciting features:

  • Techniques to determine the validity and confidence level of data
  • Apply quartiles and n-tiles to datasets to see how data is distributed into many buckets
  • Create data pipelines that combine multiple data lifecycle steps
  • Use built-in features to gain a deeper understanding of the data
  • Apply Lasso regression analysis method to your data

If you feel this book is for you, get your copy today!

https://www.packtpub.com/

Instructions and Navigations

All of the code is organized into folders.

The code will look like the following:

scala> def factorial(n: Int): Long = if (n <= 1) 1 else n * factorial(n-1)
factorial: (n: Int)Int
scala> factorial(5)
res0: Long = 120

Following is what you need for this book: If you are a data scientist or a data analyst who wants to learn how to perform data analysis using Scala, this book is for you. All you need is knowledge of the basic fundamentals of Scala programming.

With the following software and hardware list you can run all code files present in the book (Chapter 1-9).

Software and Hardware List

Chapter Software required OS required
1-5 IntelliJ Community Edition 2019.1 or above Linux, Mac OS preferred. Windows can also be used
1-5 Scala Build Tool 1.2.8 or above Linux, Mac OS preferred. Windows can also be used
6-9 Apache Spark 2.4.0 or above Linux, Mac OS preferred. Windows can also be used

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.

Related products

Get to Know the Author

Rajesh Gupta is a hands-on big data tech lead and enterprise architect with extensive experience in the full life cycle of enterprise grade software development. He has successfully architected, developed, and deployed highly scalable data solutions using Spark, Scala, and the Hadoop technology stack for several US-based enterprises. A passionate, hands-on technologist, Rajesh has master's degrees in mathematics and computer science from BITS, Pilani (India).

Suggestions and Feedback

Click here if you have any feedback or suggestions.

Download a free PDF

If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost.
Simply click on the link to claim your free PDF.

https://packt.link/free-ebook/9781789346114