mrMakaronka's Stars
pydantic/pydantic
Data validation using Python type hints
flyteorg/flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
ydb-platform/ydb
YDB is an open source Distributed SQL Database that combines high availability and scalability with strong consistency and ACID transactions
abhishekkrthakur/colabcode
Run VSCode (codeserver) on Google Colab or Kaggle Notebooks
learning-at-home/hivemind
Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
frankmcsherry/blog
Some notes on things I find interesting and important.
microsoft/debugpy
An implementation of the Debug Adapter Protocol for Python
dstackai/dstack
dstack is an open-source alternative to Kubernetes, designed to simplify development, training, and deployment of AI across any cloud or on-prem. It supports NVIDIA, AMD, and TPU.
jupyter-incubator/sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
alexmojaki/snoop
A powerful set of Python debugging tools, based on PySnooper
lz4/lz4-java
LZ4 compression for Java
ScottOaks/JavaPerformanceTuning
Examples for O'Reilly & Associates Java Performance Tuning: The Definitive Guide
kubeflow-kale/kale
Kubeflow’s superfood for Data Scientists
marnikitta/stattests
Source code to reproduce experiments from the article Practitioner’s Guide to Statistical Tests
pypi/stdlib-list
A list of Python Standard Libraries (2.6-7, 3.2-12).
duckstax/otterbrix
Otterbrix is an open-source framework for developing conventional and analytical applications
acroz/pylivy
A Python client for Apache Livy, enabling use of remote Apache Spark clusters.
lambdazy/lzy
Platform for a hybrid execution of ML workflows that transparently integrates local and remote runtimes
intellistream/StreamProcessing_ReadingList
stream processing reading list
JetBrains/projector-demo
A simple sample application demonstrating running Swing applications remotely
futujaos/crowdom
Tool for simplifying data labeling
TU-Berlin-DIMA/out-of-order-datagenerator
An open source stream generator which generates reproducible and deterministic out-of-order streams, simulating arbitrary fractions of out-of-order tuples and their respective delays.
akhvorov/vgram
Feature extraction from sequential data
lambdazy/serialzy
solariq/sensearch
isaintnik/erc
Experiment Release Cycle repository
rebryk/neurox
Simple job manager for the Neuromation Platform
AntonYermilov/article-writing-assistant
Experiments in making suggestions while writing articles
streamreasoning/slangs
GlebSolovev/flink-anomaly
Flink at-least-once violation anomaly