This is the code repository for Comet for Data Science, published by Packt.
Enhance your ability to manage and optimize the life cycle of your data science project
This book provides concepts and practical use cases which can be used to quickly build, monitor, and optimize data science projects. Using Comet, you will learn how to manage almost every step of the data science process from data collection through to creating, deploying, and monitoring a machine learning model.
This book covers the following exciting features:
- Prepare for your project with the right data
- Understand the purposes of different machine learning algorithms
- Get up and running with Comet to manage and monitor your pipelines
- Understand how Comet works and how to get the most out of it
- See how you can use Comet for machine learning
- Discover how to integrate Comet with GitLab
- Work with Comet for NLP, deep learning, and time series analysis
If you feel this book is for you, get your copy today!
All of the code is organized into folders.
The code will look like the following:
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from datetime import datetime
Following is what you need for this book: This book is for anyone who has programming experience, and wants to learn how to manage and optimize a complete data science lifecycle using Comet and other DevOps platforms. Although an understanding of basic data science concepts and programming concepts is needed, no prior knowledge of Comet and DevOps is required.
With the following software and hardware list you can run all code files present in the book (Chapter 1-11).
Python 3.8 Python libraries: | 0. comet-ml==3.23.0
-
findspark==1.4.2
-
gradio==3.2.2
-
matplotlib==3.4.3
-
numpy==1.19.5
-
pandas==1.3.4
-
pandas-profiling==3.1.0
-
pyspark==3.2.1
-
scikit-learn==1.0
-
seaborn==0.11.2
-
shap==0.40.0
-
spark-nlp==3.4.4
-
sweetviz==2.1.3
-
tensorflow==2.8.2
Windows, macOS, or Linux If you are using macOS, please make sure that the chip is not Apple M1. To overcome this problem, you can use Google Colab
Java SE Development Kit 17.0.2 (optional for Chapter 4, Workspaces, Projects, Experiments, and Models)
|
Java libraries (optional for Chapter 4,|Workspaces, Projects, Experiments, and Models):
- comet-java-sdk-1.1.10 weka 3.8.6
R software (optional)
R libraries (optional):
- caret
- cometr
Docker Kubernetes git Java 8 (required for Chapter 9, Comet for Natural Language Processing) Apache Spark 3.1.2
You should notice that the code examples described in Chapter 4, Workspaces, Projects, Experiments, and Models, require a different version of Java with respect to those described in Chapter 9, Comet for Natural Language Processing. In addition, to make Comet work, you need to sign up to the Comet platform (https://www.comet.com/signup) and create an account.
We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.
-
Applied Machine Learning Explainability Techniques [Packt] [Amazon]
-
Practical Deep Learning at Scale with MLflow [Packt] [Amazon]
Angelica Lo Duca is a researcher at the Institute of Informatics and Telematics of the National Research Council, Italy. She is also an external professor of data journalism at the University of Pisa. Her research includes data science, data journalism, and web applications. She used to work on network security, semantic web, linked data, and blockchain. She has published more than 40 scientific papers at national and international conferences and journals and has participated in many international projects and events, including as a member of the Program Committee. She is also part of the editorial team of the HighTech And Innovation Journal. She owns a personal blog, where she publishes articles on her research interests.