persiyanov
Pragmatic ML and Data specialist. Ex ranking infra at Constructor.io. Ex dialogue systems & research at Yandex. MIPT / YSDA alumni.
@extruct-aiBarcelona
persiyanov's Stars
whylabs/whylogs
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
evidentlyai/evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
avito-tech/playbook
AvitoTech team playbook
pinterest/querybook
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
agronholm/apscheduler
Task scheduling library for Python
joerick/pyinstrument
🚴 Call stack profiler for Python. Shows you why your code is slow!
KeyviDev/keyvi
Keyvi - the key value index. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
ipython-contrib/jupyter_contrib_nbextensions
A collection of various notebook extensions for Jupyter
microsoft/hummingbird
Hummingbird compiles trained ML models into tensor computation for faster inference.
python-attrs/attrs
Python Classes Without Boilerplate
nalepae/pandarallel
A simple and efficient tool to parallelize Pandas operations on all available CPUs
facebookresearch/nevergrad
A Python toolbox for performing gradient-free optimization
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
PAIR-code/facets
Visualizations for machine learning datasets
ttzt/catalog_of_requirements_for_ai_products
The purpose of the catalog is to help data science teams to collect all the requirements to consider while building a ML model and productionizing it.
allenai/allennlp
An open-source NLP research library, built on PyTorch.
facebookresearch/StarSpace
Learning embeddings for classification, retrieval and ranking.
ResidentMario/missingno
Missing data visualization module for Python.
pytries/marisa-trie
Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.
flameshot-org/flameshot
Powerful yet simple to use screenshot software :desktop_computer: :camera_flash:
dreamquark-ai/tabnet
PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
trent-b/iterative-stratification
scikit-learn cross validators for iterative stratification of multilabel data
i3/i3
A tiling window manager for X11
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
kubeflow/kubeflow
Machine Learning Toolkit for Kubernetes
getredash/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
ramitsurana/awesome-kubernetes
A curated list for awesome kubernetes sources :ship::tada:
ing-bank/popmon
Monitor the stability of a Pandas or Spark dataframe ⚙︎
prometheus-operator/kube-prometheus
Use Prometheus to monitor Kubernetes and applications running on Kubernetes
prometheus-operator/prometheus-operator
Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes