anjapago's Stars
fugue-project/fugue
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
lumiata/tech_blog
Follow the Lumiata Tech Blog on Medium!
linkedin/greykite
A flexible, intuitive and fast forecasting library
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
mlrun/mlrun
MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.
GoogleCloudPlatform/ml-design-patterns
Source code accompanying O'Reilly book: Machine Learning Design Patterns
microsoft/FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
evidentlyai/evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
monte-carlo-data/data-downtime-challenge
vasanthk/how-web-works
What happens behind the scenes when we type www.google.com in a browser?
vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
jupyter/docker-stacks
Ready-to-run Docker images containing Jupyter applications
sql-machine-learning/playground
Deploy SQLFlow service mesh on Windows, macOS, and Linux desktop computers
jupyterhub/jupyterhub
Multi-user server for Jupyter notebooks
optuna/optuna
A hyperparameter optimization framework
KevinFasusi/supplychainpy
Supplychainpy is a Python library for supply chain analysis, modelling and simulation. The library assists a workflow that is reliant on Excel and VBA.
Funkmyster/awesome-supply-chain
A curated list of awesome supply chain blogs, podcasts, standards, projects, and examples.
sktime/sktime
A unified framework for machine learning with time series
cesium-ml/cesium
Machine Learning Time-Series Platform
r3ap3rpy/sphinxer
This repository holds demos for the sphinx tutorial.
shogun-toolbox/shogun
Shōgun
scikit-learn-contrib/metric-learn
Metric learning algorithms in Python
google-research/google-research
Google Research
olivia-ai/the-math-behind-a-neural-network
Mathematics paper recapitulating the calculus behind a neural network and its back propagation
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
PeterFogh/dvc_dask_use_case
A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.
stdiff/dvc-usecase
A skelton of a data science project with DVC
peopledoc/mlvtools-tutorial
Tutorial for a new versioning Machine Learning pipeline
peopledoc/mlvtools
Public repository for versioning machine learning data
pytorch/captum
Model interpretability and understanding for PyTorch