Data Engineering Workshops with Spark (PySpark), Pandas, Dask, Ray etc - some of the most popular libraries in the field. Supports Google Colab, click on the badge next to each notebook's link.
These serve as practical notes and references on common machine learning algorithms with an introduction to Pandas and Numpy.
Pronounced "cooler" provides 3 approaches to building a regular expressions engine - a toy overview, a backtracking based implementation and a finite-automata based approach.
Discover key colours in a painting or photograph using K-Means clustering. Also provides the proportions of the colours. Some results (more in the repo):
- Deeplearning.ai + Coursera - scored 100% across the board.
- Topics in JavaScript, using Jupyter Notebooks
- Building x.509 certificates for MongoDB - was written originally to be used with MongoDB, may be outdated, see latest Mongo documentation. Still a good step-by-step on how to build a certificate.
- DC/OS Training - no longer current, it's better to reference the DC/OS documentation.