/pipedrive-materials

A list of resources for my friends at Pipedrive =)

pipedrive-materials

A list of resources for my friends at Pipedrive =)

Good Collections of Resources

General Resources

Things to Give to Customers to help them understand ML

  • Google ML Glossary Gloassary of Machine Learning terms.
  • R2D3 A visual introduction to Machine Learning. Very pretty!

HPC Resources

  • Dask Parallel processing for Python Analytics encompassing Numpy, Pandas and SKL.
  • Modin Speed up Pandas with one line of code. Limited but easy.
  • Dask on AWS with Fargate Setting up a Dask cluster on AWS using Fargate (Amazon's version of Kubernetes). Not tried this so may not work!!

Specific things I've mentioned

  • ScatterText Interactive sentiment visualisation.
  • YellowBrick Machine Learning visualisation. Being developed quite quickly - everytime I check back there seems to be new things!
  • FastText Facebook developed library for text classification. Very easy to use.
  • Textract Python library that's brilliant for extract text from various documents (pdf, word, pictures etc.)
  • VADER Lexicon based sentiment analysis library. Very quick and easy to use. May have been incorporated into NLTK.
  • MLFlow Not had a chance to check this out properly yet, but I've heard very good things about it. Helps manage the Machine Learning lifecycle.
  • LIME Again, not had a chance to check this out properly yet, but friends say it's good! Explainability framework for Python / R and can be used with SKL / Keras etc.
  • DfT Data Science Team Processes The processes my team came up with @ DfT. This includes general stuff as well as a script to make Jupyter Notebooks easier to QA through nbstripout
  • The Institute for Ethical ML This is run by my friend Alejandro and has advice on how to develop ethical ML systems. Also he runs an ML Engineer newsletter that contains lots of cutting edge resources.
  • Uber Ludwig A Deep Learning platform that allows training / testing of Deep Learning models without the need to write code.

Recommender System Resources