martinsotir's Stars
rasbt/mlxtend
A library of extension and helper modules for Python's data analysis and machine learning libraries.
koaning/embetter
just a bunch of useful embeddings
xl0/lovely-tensors
Tensors, for human consumption
microsoft/presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
scikit-learn-contrib/MAPIE
A scikit-learn-compatible module to estimate prediction intervals and control risks based on conformal predictions.
huggingface/diffusers
๐ค Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
IBM/lale
Library for Semi-Automated Data Science
ibis-project/ibis
the portable Python dataframe library
nbQA-dev/nbQA
Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks
mljar/bloxs
Build dashboards in Jupyter Notebook with numeric and chart boxes
inducer/pudb
Full-screen console debugger for Python
towhee-io/towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
rgerum/pylustrator
Visualisations of data are at the core of every publication of scientific research results. They have to be as clear as possible to facilitate the communication of research. As data can have different formats and shapes, the visualisations often have to be adapted to reflect the data as well as possible. We developed Pylustrator, an interface to directly edit python generated matplotlib graphs to finalize them for publication. Therefore, subplots can be resized and dragged around by the mouse, text and annotations can be added. The changes can be saved to the initial plot file as python code.
google-research/jax3d
diffgram/diffgram
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
aesara-devs/aesara
Aesara is a Python library for defining, optimizing, and efficiently evaluating mathematical expressions involving multi-dimensional arrays.
jupyter-book/jupyterlab-myst
Use MyST Markdown directly in Jupyter Lab
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
mljar/mercury
Convert Jupyter Notebooks to Web Apps
flyteorg/flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
oneapi-src/oneDAL
oneAPI Data Analytics Library (oneDAL)
NannyML/nannyml
nannyml: post-deployment data science in python
IndustryEssentials/ymir
YMIR, a streamlined model development product.
lostintangent/gistpad
VS Code extension for managing and sharing code snippets, notes and interactive samples using GitHub Gists
bloomberg/memray
Memray is a memory profiler for Python
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
evidentlyai/evidently
Evidently is โโan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
allegroai/clearml-serving
ClearML - Model-Serving Orchestration and Repository Solution
reloadware/reloadium
Hot Reloading and Profiling for Python
whylabs/whylogs
An open-source data logging library for machine learning models and data pipelines. ๐ Provides visibility into data quality & model performance over time. ๐ก๏ธ Supports privacy-preserving data collection, ensuring safety & robustness. ๐