yangp18's Stars
cromano8/Snowflake_ML_Intro
Introduction to performing Machine Learning on Snowflake
linuxacademy/Content-AWS-Certified-Data-Analytics---Speciality
DAS-C01 ACG/LA by Brock Tubre and John Hanna
keredson/wordninja
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
datarobot-community/tutorials-for-data-scientists
Peleja/odsc_mlops_from_model_to_prod
Machine Learning Operations (MLOps) are essential to build successful Data Science use-cases. Today, ML is powering data driven use-cases that are transforming industries around the world. In order to seize and hold it's competitive advantage business needs to reduce risk therefore a new expertise rises to include data science models in operational systems. According to Gartner Research “While many organizations have experimented with AI proofs of concept, there are still major blockers to operationalizing its development. IT leaders must strive to move beyond the POC to ensure that more projects get to production and that they do so at scale to deliver business value. (July 2020)”. In this session, we will discuss the role of MLOps and how they can help data science models from deployment to maintenance with focus on: keep track of performance degradation overtime from model predictions quality, setting up continuous evaluation metrics and tuning the model performance in both training and serving pipelines that are deployed in production.
sujitpal/pytorch-gnn-tutorial-odsc2021
Repository for GNN tutorial using Pytorch and Pytorch Geometric (PyG) for ODSC 2021
stefmolin/python-data-viz-workshop
A workshop on data visualization in Python with notebooks and exercises for following along. Slides contain all solutions.
ACloudGuru-Resources/Course_AWS_Certified_Machine_Learning
data-science-on-aws/data-science-on-aws
AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker
dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
scikit-learn/scikit-learn
scikit-learn: machine learning in Python
bayesian-optimization/BayesianOptimization
A Python implementation of global optimization with gaussian processes.
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
dswah/pyGAM
[HELP REQUESTED] Generalized Additive Models in Python
streamlit/streamlit
Streamlit — A faster way to build and share data apps.
awslabs/python-deequ
Python API for Deequ
aws/aws-sdk-go
AWS SDK for the Go programming language (In Maintenance Mode, End-of-Life on 07/31/2025). The AWS SDK for Go v2 is available here: https://github.com/aws/aws-sdk-go-v2
microsoft/FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
shap/shap
A game theoretic approach to explain the output of any machine learning model.
happyrabbit/DataScienceWorkshop2019
Data Science Workshop 2019
UNCG-CSE/Bat_Echolocation