Pinned Repositories
Amazing-Feature-Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
BeamKatasColab
Apache Beam Katas, demonstrated on Colab.
bigfunctions
Supercharge BigQuery with BigFunctions
bigquery-ml-utils
Machine-Learning-with-BigQuery-ML
Machine Learning with BigQuery ML, published by Packt
training-data-analyst
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
DataAnalysisWithPythonAndPySpark
Code repository for the "PySpark in Action" book
LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
kalona's Repositories
kalona/training-data-analyst
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
kalona/Amazing-Feature-Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
kalona/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
kalona/bigfunctions
Supercharge BigQuery with BigFunctions
kalona/bigquery-ml-utils
kalona/bigquery-notebooks
kalona/Data-Science-Projects-with-Python
A Case Study Approach to Successful Data Science Projects Using Python, Pandas, and Scikit-Learn
kalona/deep-learning-with-python-notebooks
Jupyter notebooks for the code samples of the book "Deep Learning with Python"
kalona/Machine-Learning-with-BigQuery-ML
Machine Learning with BigQuery ML, published by Packt
kalona/cbt-intro-java
kalona/deep_learning_for_structured_data
Production repo to accompany Deep Learning with Structured Data book from Manning: https://www.manning.com/books/deep-learning-with-structured-data
kalona/first-steps-with-python-training
repository of the 3-day course "First steps with Python in Life Sciences" from SIB-training
kalona/fraudfinder
Fraudfinder: A comprehensive lab series on how to build a real-time fraud detection system on Google Cloud
kalona/Hands-On-Data-Analysis-with-Pandas-2nd-edition
Materials for following along with Hands-On Data Analysis with Pandas – Second Edition
kalona/introduction_to_ml_with_python
Notebooks and code for the book "Introduction to Machine Learning with Python"
kalona/Learn-Python-by-Building-Data-Science-Applications
Learn Python by Building Data Science Applications, published by Packt
kalona/machine-learning-for-tabular-data
Repository of course materials for a multi-day course on machine learning for tabular data using Scikit-Learn and XGBoost
kalona/ml_on_tabular_data
Code for the new Manning book on machine learning on tabular datasets
kalona/net.jgp.books.spark.ch01
Spark in Action, 2nd edition - chapter 1 - Introduction
kalona/net.jgp.books.spark.ch02
Spark in Action, 2nd edition - chapter 2
kalona/net.jgp.books.spark.ch03
Spark in Action, 2nd edition - chapter 3
kalona/oci-data-science-ai-samples
This repo contains a series of tutorials and code examples highlighting different features of the OCI Data Science and AI services, along with a release vehicle for experimental programs.
kalona/pandas-in-action
Complete source code (datasets and Jupyter Notebooks) for Pandas In Action
kalona/polars-for-data-science-oreilly-course
kalona/professional-services
Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product.
kalona/pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
kalona/python-statistics-essential-training-4433355
This is a repository for the LinkedIn Learning course Python Statistics Essential Training
kalona/PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
kalona/scikit-learn-Cookbook-Second-Edition
scikit-learn Cookbook Second Edition, published by Packt
kalona/scikit-learn-mooc
Machine learning in Python with scikit-learn MOOC