Best-of Machine Learning with Python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
This curated list contains 820 awesome open-source projects with a total of 2.6M stars grouped into 32 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue , submit a pull request , or directly edit the projects.yaml . Contributions are very welcome!
📫 Subscribe to our weekly newsletter for updates on the best machine-learning libraries and tools.
Get notified on trending projects, new additions, detailed comparisons, and more!
🥇🥈🥉 Combined project-quality score
⭐️ Star count from GitHub
🐣 New project (less than 6 months old)
💤 Inactive project (6 months no activity)
💀 Dead project (12 months no activity)
📈📉 Project is trending up or down
➕ Project was recently added
❗️ Warning (e.g. missing/risky license)
👨💻 Contributors count from GitHub
🔀 Fork count from GitHub
📋 Issue count from GitHub
⏱️ Last update timestamp on package manager
📥 Download count from package manager
📦 Number of dependent projects
Tensorflow related project
Sklearn related project
PyTorch related project
MxNet related project
Apache Spark related project
Jupyter related project
PaddlePaddle related project
Pandas related project
Machine Learning Frameworks
General-purpose machine learning and deep learning frameworks.
Tensorflow (🥇44 · ⭐ 150K) - An Open Source Machine Learning Framework for Everyone. Apache-2
GitHub (👨💻 3.5K · 🔀 84K · 📦 120K · 📋 30K - 14% open · ⏱️ 12.01.2021):
git clone https://github.com/tensorflow/tensorflow
PyPi (📥 5.3M / month · 📦 23K · ⏱️ 14.12.2020):
Conda (📥 2.3M · ⏱️ 15.07.2020):
conda install -c conda-forge tensorflow
Docker Hub (📥 47M · ⭐ 1.8K · ⏱️ 12.01.2021):
docker pull tensorflow/tensorflow
scikit-learn (🥇41 · ⭐ 44K) - scikit-learn: machine learning in Python. BSD-3
GitHub (👨💻 2.1K · 🔀 21K · 📥 630 · 📦 180K · 📋 8.9K - 26% open · ⏱️ 12.01.2021):
git clone https://github.com/scikit-learn/scikit-learn
PyPi (📥 10M / month · 📦 38K · ⏱️ 22.12.2020):
Conda (📥 6.3M · ⏱️ 22.12.2020):
conda install -c conda-forge scikit-learn
PyTorch (🥇39 · ⭐ 45K) - Tensors and Dynamic neural networks in Python with strong GPU.. BSD-3
GitHub (👨💻 2.5K · 🔀 12K · 📦 58K · 📋 20K - 36% open · ⏱️ 12.01.2021):
git clone https://github.com/pytorch/pytorch
PyPi (📥 1.7M / month · 📦 6.7K · ⏱️ 10.12.2020):
Conda (📥 9.7M · ⏱️ 10.12.2020):
conda install -c pytorch pytorch
PySpark (🥇38 · ⭐ 29K) - Apache Spark Python API. Apache-2
GitHub (👨💻 2.4K · 🔀 23K · 📦 540 · ⏱️ 12.01.2021):
git clone https://github.com/apache/spark
PyPi (📥 6.6M / month · 📦 760 · ⏱️ 07.09.2020):
Conda (📥 870K · ⏱️ 07.09.2020):
conda install -c conda-forge pyspark
StatsModels (🥇36 · ⭐ 5.9K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
GitHub (👨💻 300 · 🔀 2.1K · 📥 25 · 📦 36K · 📋 4.3K - 47% open · ⏱️ 12.01.2021):
git clone https://github.com/statsmodels/statsmodels
PyPi (📥 1.9M / month · 📦 6.7K · ⏱️ 29.10.2020):
Conda (📥 3.1M · ⏱️ 12.01.2021):
conda install -c conda-forge statsmodels
Keras (🥇35 · ⭐ 51K) - Deep Learning for humans. MIT
GitHub (👨💻 900 · 🔀 19K · 📋 10K - 30% open · ⏱️ 11.01.2021):
git clone https://github.com/keras-team/keras
PyPi (📥 1.9M / month · 📦 15K · ⏱️ 24.06.2020):
Conda (📥 1.4M · ⏱️ 25.06.2020):
conda install -c conda-forge keras
XGBoost (🥇35 · ⭐ 20K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
GitHub (👨💻 500 · 🔀 7.9K · 📥 1.9K · 📦 13K · 📋 3.9K - 6% open · ⏱️ 12.01.2021):
git clone https://github.com/dmlc/xgboost
PyPi (📥 2.5M / month · 📦 1.6K · ⏱️ 09.12.2020):
Conda (📥 1.3M · ⏱️ 10.12.2020):
conda install -c conda-forge xgboost
LightGBM (🥇35 · ⭐ 12K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
GitHub (👨💻 200 · 🔀 3.2K · 📥 85K · 📦 5.3K · 📋 2K - 4% open · ⏱️ 11.01.2021):
git clone https://github.com/microsoft/LightGBM
PyPi (📥 1.4M / month · 📦 560 · ⏱️ 08.12.2020):
Conda (📥 460K · ⏱️ 08.12.2020):
conda install -c conda-forge lightgbm
MXNet (🥈34 · ⭐ 19K) - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning.. Apache-2
GitHub (👨💻 950 · 🔀 6.8K · 📥 23K · 📦 1.7K · 📋 9.4K - 19% open · ⏱️ 12.01.2021):
git clone https://github.com/apache/incubator-mxnet
PyPi (📥 150K / month · 📦 440 · ⏱️ 28.08.2020):
Conda (📥 5.7K · ⏱️ 29.02.2020):
conda install -c anaconda mxnet
Theano (🥈34 · ⭐ 9.3K) - Theano is a Python library that allows you to define, optimize, and.. BSD-3
GitHub (👨💻 380 · 🔀 2.5K · 📦 10K · 📋 2.8K - 24% open · ⏱️ 05.09.2020):
git clone https://github.com/Theano/Theano
PyPi (📥 200K / month · 📦 5.5K · ⏱️ 27.07.2020):
Conda (📥 1.3M · ⏱️ 01.11.2020):
conda install -c conda-forge theano
pytorch-lightning (🥈33 · ⭐ 11K) - The lightweight PyTorch wrapper for high-performance.. Apache-2
GitHub (👨💻 360 · 🔀 1.3K · 📥 34 · 📦 1.6K · 📋 2.7K - 12% open · ⏱️ 12.01.2021):
git clone https://github.com/PyTorchLightning/pytorch-lightning
PyPi (📥 94K / month · 📦 14 · ⏱️ 06.01.2021):
pip install pytorch-lightning
Conda (📥 14K · ⏱️ 06.01.2021):
conda install -c conda-forge pytorch-lightning
Fastai (🥈32 · ⭐ 20K) - The fastai deep learning library. Apache-2
jax (🥈32 · ⭐ 11K) - Composable transformations of Python+NumPy programs: differentiate,.. Apache-2
GitHub (👨💻 240 · 🔀 930 · 📦 980 · 📋 2.1K - 34% open · ⏱️ 12.01.2021):
git clone https://github.com/google/jax
PyPi (📥 84K / month · 📦 46 · ⏱️ 12.01.2021):
Conda (📥 67K · ⏱️ 14.10.2020):
conda install -c conda-forge jaxlib
Thinc (🥈32 · ⭐ 2.1K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
GitHub (👨💻 36 · 🔀 200 · 📦 11K · 📋 98 - 17% open · ⏱️ 05.01.2021):
git clone https://github.com/explosion/thinc
PyPi (📥 830K / month · 📦 1.1K · ⏱️ 16.12.2020):
Conda (📥 840K · ⏱️ 18.12.2020):
conda install -c conda-forge thinc
Catboost (🥈31 · ⭐ 5.6K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
GitHub (👨💻 720 · 🔀 860 · 📥 47K · 📋 1.3K - 22% open · ⏱️ 12.01.2021):
git clone https://github.com/catboost/catboost
PyPi (📥 760K / month · 📦 160 · ⏱️ 27.12.2020):
Conda (📥 550K · ⏱️ 29.12.2020):
conda install -c conda-forge catboost
Chainer (🥈31 · ⭐ 5.5K) - A flexible framework of neural networks for deep learning. MIT
TFlearn (🥈30 · ⭐ 9.5K) - Deep learning library featuring a higher-level API for TensorFlow. MIT
Vowpal Wabbit (🥈30 · ⭐ 7.4K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
PaddlePaddle (🥈29 · ⭐ 14K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2
Turi Create (🥈29 · ⭐ 10K) - Turi Create simplifies the development of custom machine learning.. BSD-3
tensorpack (🥈29 · ⭐ 5.9K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2
Sonnet (🥉28 · ⭐ 8.7K) - TensorFlow-based neural network library. Apache-2
GitHub (👨💻 48 · 🔀 1.2K · 📦 480 · 📋 150 - 10% open · ⏱️ 08.10.2020):
git clone https://github.com/deepmind/sonnet
PyPi (📥 59K / month · 📦 82 · ⏱️ 27.03.2020):
Conda (📥 6.9K · ⏱️ 14.11.2020):
conda install -c conda-forge sonnet
Flax (🥉28 · ⭐ 1.4K) - Flax is a neural network ecosystem for JAX that is designed for.. Apache-2
jax
CNTK (🥉27 · ⭐ 17K · 💤) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit. MIT
skorch (🥉27 · ⭐ 3.7K) - A scikit-learn compatible neural network library that wraps.. BSD-3
GitHub (👨💻 39 · 🔀 270 · 📦 250 · 📋 360 - 13% open · ⏱️ 25.12.2020):
git clone https://github.com/skorch-dev/skorch
PyPi (📥 6.2K / month · 📦 13 · ⏱️ 30.08.2020):
Conda (📥 160K · ⏱️ 19.09.2020):
conda install -c conda-forge skorch
Ignite (🥉27 · ⭐ 3.2K) - High-level library to help with training and evaluating neural.. BSD-3
GitHub (👨💻 120 · 🔀 400 · 📦 690 · 📋 680 - 14% open · ⏱️ 11.01.2021):
git clone https://github.com/pytorch/ignite
PyPi (📥 31K / month · 📦 62 · ⏱️ 12.01.2021):
pip install pytorch-ignite
Conda (📥 51K · ⏱️ 20.09.2020):
conda install -c pytorch ignite
dyNET (🥉27 · ⭐ 3.2K) - DyNet: The Dynamic Neural Network Toolkit. Apache-2
Ludwig (🥉26 · ⭐ 7.4K) - Ludwig is a toolbox that allows to train and evaluate deep.. Apache-2
mlpack (🥉26 · ⭐ 3.5K) - mlpack: a scalable C++ machine learning library --. BSD-3
GitHub (👨💻 260 · 🔀 1.3K · 📋 1.3K - 11% open · ⏱️ 12.01.2021):
git clone https://github.com/mlpack/mlpack
PyPi (📥 240 / month · ⏱️ 28.10.2020):
Conda (📥 62K · ⏱️ 29.10.2020):
conda install -c conda-forge mlpack
Neural Network Libraries (🥉25 · ⭐ 2.4K) - Neural Network Libraries. Apache-2
tensorflow-upstream (🥉25 · ⭐ 530 · ➕) - TensorFlow ROCm port. Apache-2
einops (🥉24 · ⭐ 2.1K) - Deep learning operations reinvented (for pytorch, tensorflow, jax and.. MIT
GitHub (👨💻 10 · 🔀 68 · 📦 180 · 📋 61 - 36% open · ⏱️ 09.01.2021):
git clone https://github.com/arogozhnikov/einops
PyPi (📥 11K / month · 📦 10 · ⏱️ 08.09.2020):
Conda (📥 3.5K · ⏱️ 15.10.2020):
conda install -c conda-forge einops
ktrain (🥉24 · ⭐ 710) - ktrain is a Python library that makes deep learning and AI more.. Apache-2
xLearn (🥉23 · ⭐ 2.8K · 💤) - High performance, easy-to-use, and scalable machine learning (ML).. Apache-2
SHOGUN (🥉23 · ⭐ 2.8K) - Unified and efficient Machine Learning. BSD-3
GitHub (👨💻 250 · 🔀 1K · 📋 1.6K - 33% open · ⏱️ 08.12.2020):
git clone https://github.com/shogun-toolbox/shogun
Conda (📥 90K · ⏱️ 25.06.2018):
conda install -c conda-forge shogun
Docker Hub (📥 1.4K · ⭐ 1 · ⏱️ 31.01.2019):
docker pull shogun/shogun
fklearn (🥉22 · ⭐ 1.3K · ➕) - fklearn: Functional Machine Learning. Apache-2
mace (🥉21 · ⭐ 4.3K) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
Neural Tangents (🥉21 · ⭐ 1.3K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
ThunderSVM (🥉20 · ⭐ 1.3K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
Haiku (🥉20 · ⭐ 900) - JAX-based neural network library. Apache-2
Torchbearer (🥉18 · ⭐ 580 · 💤) - torchbearer: A model fitting library for PyTorch. MIT
ThunderGBM (🥉16 · ⭐ 580) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
NeoML (🥉13 · ⭐ 550) - Machine learning framework for both deep learning and traditional.. Apache-2
Show 8 hidden projects...
dlib (🥈33 · ⭐ 9.8K) - A toolkit for making real world machine learning and data analysis.. ❗️BSL-1.0
NuPIC (🥉25 · ⭐ 6.2K · 💀) - Numenta Platform for Intelligent Computing is an implementation.. ❗️AGPL-3.0
Lasagne (🥉24 · ⭐ 3.8K · 💀) - Lightweight library to build and train neural networks in Theano. MIT
neon (🥉23 · ⭐ 3.9K · 💀) - Intel Nervana reference deep learning framework committed to best.. Apache-2
MindsDB (🥉20 · ⭐ 3.1K) - Predictive AI layer for existing databases. ❗️GPL-3.0
NeuPy (🥉20 · ⭐ 660 · 💀) - NeuPy is a Tensorflow based python library for prototyping and building.. MIT
elegy (🥉16 · ⭐ 150) - Elegy is a Neural Networks framework based on Jax and inspired.. Apache-2
jax
StarSpace (🥉13 · ⭐ 3.5K · 💀) - Learning embeddings for classification, retrieval and ranking. MIT
General-purpose and task-specific data visualization libraries.
Matplotlib (🥇41 · ⭐ 13K · 📈) - matplotlib: plotting with Python. Python-2.0
GitHub (👨💻 1.2K · 🔀 5.6K · 📦 310K · 📋 7.6K - 21% open · ⏱️ 12.01.2021):
git clone https://github.com/matplotlib/matplotlib
PyPi (📥 9.2M / month · 📦 79K · ⏱️ 12.11.2020):
Conda (📥 7.7M · ⏱️ 18.11.2020):
conda install -c conda-forge matplotlib
Plotly (🥇35 · ⭐ 8.6K) - The interactive graphing library for Python (includes Plotly Express). MIT
GitHub (👨💻 160 · 🔀 1.7K · 📦 5 · 📋 1.8K - 42% open · ⏱️ 12.01.2021):
git clone https://github.com/plotly/plotly.py
PyPi (📥 2.7M / month · 📦 5K · ⏱️ 12.01.2021):
Conda (📥 1.1M · ⏱️ 12.01.2021):
conda install -c conda-forge plotly
NPM (📥 26K / month · 📦 4 · ⏱️ 12.01.2021):
Seaborn (🥇35 · ⭐ 8K) - Statistical data visualization using matplotlib. BSD-3
GitHub (👨💻 140 · 🔀 1.4K · 📥 110 · 📦 77K · 📋 1.7K - 4% open · ⏱️ 10.01.2021):
git clone https://github.com/mwaskom/seaborn
PyPi (📥 1.9M / month · 📦 13K · ⏱️ 20.12.2020):
Conda (📥 1.9M · ⏱️ 21.12.2020):
conda install -c conda-forge seaborn
dash (🥇34 · ⭐ 14K) - Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required. MIT
GitHub (👨💻 70 · 🔀 1.4K · 📦 16K · 📋 950 - 42% open · ⏱️ 07.01.2021):
git clone https://github.com/plotly/dash
PyPi (📥 220K / month · 📦 1.6K · ⏱️ 09.12.2020):
Conda (📥 210K · ⏱️ 11.12.2020):
conda install -c conda-forge dash
Bokeh (🥇33 · ⭐ 15K) - Interactive Data Visualization in the browser, from Python. BSD-3
GitHub (👨💻 540 · 🔀 3.6K · 📦 30K · 📋 6.3K - 9% open · ⏱️ 11.01.2021):
git clone https://github.com/bokeh/bokeh
PyPi (📥 1M / month · 📦 5.9K · ⏱️ 11.01.2021):
Conda (📥 3.6M · ⏱️ 23.11.2020):
conda install -c conda-forge bokeh
pyecharts (🥈31 · ⭐ 10K) - Python Echarts Plotting Library. MIT
wordcloud (🥈31 · ⭐ 7.8K) - A little word cloud generator in Python. MIT
GitHub (👨💻 58 · 🔀 2K · 📦 8.4K · 📋 430 - 20% open · ⏱️ 11.11.2020):
git clone https://github.com/amueller/word_cloud
PyPi (📥 230K / month · 📦 1.1K · ⏱️ 11.11.2020):
Conda (📥 180K · ⏱️ 16.11.2020):
conda install -c conda-forge wordcloud
Altair (🥈31 · ⭐ 6.3K) - Declarative statistical visualization library for Python. BSD-3
GitHub (👨💻 120 · 🔀 560 · 📦 7.2K · 📋 1.5K - 18% open · ⏱️ 12.01.2021):
git clone https://github.com/altair-viz/altair
PyPi (📥 650K / month · 📦 370 · ⏱️ 01.04.2020):
Conda (📥 580K · ⏱️ 01.04.2020):
conda install -c conda-forge altair
bqplot (🥈30 · ⭐ 3K) - Plotting library for IPython/Jupyter notebooks. Apache-2
GitHub (👨💻 51 · 🔀 400 · 📦 1.2K · 📋 500 - 37% open · ⏱️ 08.01.2021):
git clone https://github.com/bqplot/bqplot
PyPi (📥 11K / month · 📦 110 · ⏱️ 08.01.2021):
Conda (📥 450K · ⏱️ 08.01.2021):
conda install -c conda-forge bqplot
NPM (📥 120K / month · 📦 10 · ⏱️ 05.11.2020):
pandas-profiling (🥈29 · ⭐ 6.6K) - Create HTML profiling reports from pandas DataFrame.. MIT
GitHub (👨💻 65 · 🔀 980 · 📦 2.8K · 📋 410 - 13% open · ⏱️ 12.01.2021):
git clone https://github.com/pandas-profiling/pandas-profiling
PyPi (📥 200K / month · 📦 160 · ⏱️ 03.09.2020):
pip install pandas-profiling
Conda (📥 96K · ⏱️ 09.01.2021):
conda install -c conda-forge pandas-profiling
UMAP (🥈29 · ⭐ 4.4K · 📈) - Uniform Manifold Approximation and Projection. BSD-3
PyQtGraph (🥈29 · ⭐ 2.3K) - Fast data visualization and GUI tools for scientific / engineering.. MIT
GitHub (👨💻 180 · 🔀 790 · 📋 720 - 39% open · ⏱️ 06.01.2021):
git clone https://github.com/pyqtgraph/pyqtgraph
PyPi (📥 27K / month · 📦 890 · ⏱️ 20.12.2020):
Conda (📥 160K · ⏱️ 20.12.2020):
conda install -c conda-forge pyqtgraph
HoloViews (🥈29 · ⭐ 1.8K) - With Holoviews, your data visualizes itself. BSD-3
GitHub (👨💻 100 · 🔀 300 · 📋 2.5K - 27% open · ⏱️ 12.01.2021):
git clone https://github.com/holoviz/holoviews
PyPi (📥 85K / month · 📦 170 · ⏱️ 27.12.2020):
Conda (📥 400K · ⏱️ 28.12.2020):
conda install -c conda-forge holoviews
NPM (📥 5.7K / month · ⏱️ 24.05.2020):
npm install @pyviz/jupyterlab_pyviz
Graphviz (🥈29 · ⭐ 890) - Simple Python interface for Graphviz. MIT
VisPy (🥈28 · ⭐ 2.6K) - High-performance interactive 2D/3D data visualization library. BSD-3
GitHub (👨💻 140 · 🔀 540 · 📦 440 · 📋 1.1K - 31% open · ⏱️ 28.11.2020):
git clone https://github.com/vispy/vispy
PyPi (📥 13K / month · 📦 120 · ⏱️ 28.11.2020):
Conda (📥 120K · ⏱️ 28.11.2020):
conda install -c conda-forge vispy
NPM (📥 130 / month · ⏱️ 15.03.2020):
datashader (🥈28 · ⭐ 2.4K) - Quickly and accurately render even the largest data. BSD-3
GitHub (👨💻 43 · 🔀 310 · 📦 550 · 📋 460 - 31% open · ⏱️ 07.01.2021):
git clone https://github.com/holoviz/datashader
PyPi (📥 11K / month · 📦 70 · ⏱️ 07.01.2021):
Conda (📥 130K · ⏱️ 08.01.2021):
conda install -c conda-forge datashader
missingno (🥈27 · ⭐ 2.6K) - Missing data visualization module for Python. MIT
GitHub (👨💻 15 · 🔀 330 · 📦 2.6K · 📋 100 - 14% open · ⏱️ 28.12.2020):
git clone https://github.com/ResidentMario/missingno
PyPi (📥 170K / month · 📦 76 · ⏱️ 29.06.2018):
Conda (📥 68K · ⏱️ 15.02.2020):
conda install -c conda-forge missingno
data-validation (🥈27 · ⭐ 500 · ➕) - Library for exploring and validating machine learning.. Apache-2
Perspective (🥉26 · ⭐ 3.1K) - Streaming pivot visualization via WebAssembly. Apache-2
GitHub (👨💻 61 · 🔀 340 · 📦 160 · 📋 370 - 19% open · ⏱️ 08.01.2021):
git clone https://github.com/finos/perspective
PyPi (📥 460 / month · 📦 4 · ⏱️ 15.10.2020):
pip install perspective-python
NPM (📥 940 / month · ⏱️ 08.01.2021):
npm install @finos/perspective-jupyterlab
Cufflinks (🥉26 · ⭐ 2K) - Productivity Tools for Plotly + Pandas. MIT
PyVista (🥉26 · ⭐ 650) - 3D plotting and mesh analysis through a streamlined interface for the.. MIT
GitHub (👨💻 48 · 🔀 130 · 📥 37 · 📦 230 · 📋 380 - 30% open · ⏱️ 11.01.2021):
git clone https://github.com/pyvista/pyvista
PyPi (📥 7.4K / month · 📦 26 · ⏱️ 10.12.2020):
Conda (📥 53K · ⏱️ 10.12.2020):
conda install -c conda-forge pyvista
HyperTools (🥉25 · ⭐ 1.6K) - A Python toolbox for gaining geometric insights into high-dimensional.. MIT
hvPlot (🥉25 · ⭐ 330) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
GitHub (👨💻 22 · 🔀 49 · 📦 420 · 📋 310 - 31% open · ⏱️ 06.01.2021):
git clone https://github.com/holoviz/hvplot
PyPi (📥 45K / month · 📦 15 · ⏱️ 02.06.2020):
Conda (📥 57K · ⏱️ 06.01.2021):
conda install -c conda-forge hvplot
Chartify (🥉24 · ⭐ 2.8K) - Python library that makes it easy for data scientists to create.. Apache-2
GitHub (👨💻 19 · 🔀 250 · 📦 52 · 📋 70 - 57% open · ⏱️ 02.11.2020):
git clone https://github.com/spotify/chartify
PyPi (📥 5.6K / month · 📦 5 · ⏱️ 02.11.2020):
Conda (📥 12K · ⏱️ 07.11.2020):
conda install -c conda-forge chartify
pythreejs (🥉24 · ⭐ 700) - A Jupyter - Three.js bridge. BSD-3
GitHub (👨💻 24 · 🔀 160 · 📦 15 · 📋 200 - 30% open · ⏱️ 09.10.2020):
git clone https://github.com/jupyter-widgets/pythreejs
PyPi (📥 5.8K / month · 📦 13 · ⏱️ 09.10.2020):
Conda (📥 260K · ⏱️ 12.10.2020):
conda install -c conda-forge pythreejs
NPM (📥 4.1K / month · 📦 8 · ⏱️ 19.03.2020):
npm install jupyter-threejs
Facets Overview (🥉23 · ⭐ 6.5K) - Visualizations for machine learning datasets. Apache-2
Multicore-TSNE (🥉23 · ⭐ 1.5K · ➕) - Parallel t-SNE implementation with Python and Torch.. BSD-3
GitHub (👨💻 15 · 🔀 190 · 📦 200 · 📋 53 - 62% open · ⏱️ 19.08.2020):
git clone https://github.com/DmitryUlyanov/Multicore-TSNE
PyPi (📥 2.9K / month · 📦 14 · ⏱️ 08.11.2017):
pip install MulticoreTSNE
Conda (📥 5.8K · ⏱️ 12.11.2018):
conda install -c conda-forge multicore-tsne
openTSNE (🥉23 · ⭐ 740) - Extensible, parallel implementations of t-SNE. BSD-3
GitHub (👨💻 10 · 🔀 82 · 📦 170 · 📋 71 - 4% open · ⏱️ 08.01.2021):
git clone https://github.com/pavlin-policar/openTSNE
PyPi (📥 8.9K / month · 📦 4 · ⏱️ 08.01.2021):
Conda (📥 73K · ⏱️ 08.01.2021):
conda install -c conda-forge opentsne
PandasGUI (🥉22 · ⭐ 2K) - A GUI for Pandas DataFrames. MIT
Pandas-Bokeh (🥉22 · ⭐ 600) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
python-ternary (🥉22 · ⭐ 380 · ➕) - Ternary plotting library for python with matplotlib. MIT
GitHub (👨💻 25 · 🔀 110 · 📥 14 · 📦 50 · 📋 100 - 23% open · ⏱️ 05.01.2021):
git clone https://github.com/marcharper/python-ternary
PyPi (📥 1K / month · 📦 10 · ⏱️ 10.05.2020):
pip install python-ternary
Conda (📥 47K · ⏱️ 10.05.2020):
conda install -c conda-forge python-ternary
vega (🥉22 · ⭐ 290) - IPython/Jupyter notebook module for Vega and Vega-Lite. BSD-3
GitHub (👨💻 9 · 🔀 46 · 📋 87 - 9% open · ⏱️ 11.01.2021):
git clone https://github.com/vega/ipyvega
PyPi (📥 6K / month · 📦 150 · ⏱️ 15.05.2020):
Conda (📥 370K · ⏱️ 10.12.2020):
conda install -c conda-forge vega
joypy (🥉21 · ⭐ 310 · ➕) - Joyplots in Python with matplotlib & pandas. MIT
GitHub (👨💻 5 · 🔀 32 · 📦 61 · 📋 38 - 10% open · ⏱️ 28.12.2020):
git clone https://github.com/sbebo/joypy
PyPi (📥 3.2K / month · 📦 6 · ⏱️ 28.12.2020):
Conda (📥 7.8K · ⏱️ 28.12.2020):
conda install -c conda-forge joypy
HiPlot (🥉19 · ⭐ 1.9K) - HiPlot makes understanding high dimensional data easy. MIT
GitHub (👨💻 6 · 🔀 88 · 📦 2 · 📋 47 - 12% open · ⏱️ 11.01.2021):
git clone https://github.com/facebookresearch/hiplot
PyPi (📥 2K / month · ⏱️ 23.12.2020):
Conda (📥 41K · ⏱️ 23.12.2020):
conda install -c conda-forge hiplot
Sweetviz (🥉19 · ⭐ 1.2K) - Visualize and compare datasets, target values and associations, with one.. MIT
lets-plot (🥉19 · ⭐ 470 · ➕) - An open-source plotting library for statistical data. MIT
animatplot (🥉19 · ⭐ 350 · ➕) - A python package for animating plots build on matplotlib. MIT
GitHub (👨💻 7 · 🔀 33 · 📦 14 · 📋 31 - 51% open · ⏱️ 05.10.2020):
git clone https://github.com/t-makaro/animatplot
PyPi (📥 100 / month · 📦 1 · ⏱️ 05.10.2020):
Conda (📥 5K · ⏱️ 06.10.2020):
conda install -c conda-forge animatplot
AutoViz (🥉19 · ⭐ 280) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
PyWaffle (🥉18 · ⭐ 380) - Make Waffle Charts in Python. MIT
nx-altair (🥉13 · ⭐ 150 · 💤) - Draw interactive NetworkX graphs with Altair. MIT
Show 6 hidden projects...
plotnine (🥈28 · ⭐ 2.5K · ➕) - A grammar of graphics for Python. ❗️GPL-2.0
PDPbox (🥉22 · ⭐ 520 · 💀) - python partial dependence plot toolbox. MIT
pivottablejs (🥉19 · ⭐ 410 · 💀) - Dragndrop Pivot Tables and Charts for Jupyter/IPython.. MIT
ivis (🥉18 · ⭐ 220 · ➕) - Dimensionality reduction in very large datasets using Siamese.. ❗️GPL-2.0
pdvega (🥉16 · ⭐ 340 · 💀) - Interactive plotting for Pandas using Vega-Lite. MIT
nptsne (🥉14 · ⭐ 24) - nptsne is a numpy compatible python binary package that offers a number.. Apache-2
Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.
spaCy (🥇37 · ⭐ 18K) - Industrial-strength Natural Language Processing (NLP) with Python and Cython. MIT
GitHub (👨💻 540 · 🔀 3.2K · 📥 2.9K · 📦 20K · 📋 4.2K - 2% open · ⏱️ 08.01.2021):
git clone https://github.com/explosion/spaCy
PyPi (📥 880K / month · 📦 3.1K · ⏱️ 11.12.2020):
Conda (📥 1.4M · ⏱️ 18.12.2020):
conda install -c conda-forge spacy
transformers (🥇36 · ⭐ 39K) - Transformers: State-of-the-art Natural Language.. Apache-2
GitHub (👨💻 750 · 🔀 9.6K · 📥 1.2K · 📦 6.8K · 📋 5.6K - 9% open · ⏱️ 12.01.2021):
git clone https://github.com/huggingface/transformers
PyPi (📥 670K / month · 📦 130 · ⏱️ 17.12.2020):
Conda (📥 14K · ⏱️ 19.12.2020):
conda install -c conda-forge transformers
nltk (🥇34 · ⭐ 9.5K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
GitHub (👨💻 390 · 🔀 2.4K · 📦 86K · 📋 1.5K - 16% open · ⏱️ 02.01.2021):
git clone https://github.com/nltk/nltk
PyPi (📥 4.6M / month · 📦 21K · ⏱️ 12.04.2020):
Conda (📥 610K · ⏱️ 08.08.2019):
conda install -c conda-forge nltk
Rasa (🥇32 · ⭐ 11K) - Open source machine learning framework to automate text- and voice-.. Apache-2
fairseq (🥇31 · ⭐ 11K) - Facebook AI Research Sequence-to-Sequence Toolkit written in Python. MIT
ChatterBot (🥇31 · ⭐ 11K) - ChatterBot is a machine learning, conversational dialog engine for.. BSD-3
sentencepiece (🥇31 · ⭐ 4.7K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
GitHub (👨💻 48 · 🔀 630 · 📥 9.5K · 📦 5.3K · 📋 410 - 5% open · ⏱️ 12.01.2021):
git clone https://github.com/google/sentencepiece
PyPi (📥 1M / month · 📦 240 · ⏱️ 10.01.2021):
pip install sentencepiece
Conda (📥 22K · ⏱️ 08.01.2021):
conda install -c conda-forge sentencepiece
flair (🥇30 · ⭐ 9.8K) - A very simple framework for state-of-the-art Natural Language.. MIT
torchtext (🥇30 · ⭐ 2.6K) - Data loaders and abstractions for text and NLP. BSD-3
fastText (🥈29 · ⭐ 22K) - Library for fast text representation and classification. MIT
GitHub (👨💻 58 · 🔀 4.2K · 📦 1.4K · 📋 990 - 40% open · ⏱️ 18.07.2020):
git clone https://github.com/facebookresearch/fastText
PyPi (📥 110K / month · 📦 190 · ⏱️ 28.04.2020):
Conda (📥 17K · ⏱️ 12.10.2020):
conda install -c conda-forge fasttext
AllenNLP (🥈29 · ⭐ 9.6K) - An open-source NLP research library, built on PyTorch. Apache-2
TextBlob (🥈29 · ⭐ 7.5K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
GitHub (👨💻 33 · 🔀 950 · 📥 88 · 📦 9.8K · 📋 220 - 31% open · ⏱️ 11.01.2021):
git clone https://github.com/sloria/TextBlob
PyPi (📥 250K / month · 📦 2.5K · ⏱️ 24.02.2019):
Conda (📥 110K · ⏱️ 24.02.2019):
conda install -c conda-forge textblob
snowballstemmer (🥈29 · ⭐ 460 · ➕) - Snowball compiler and stemming algorithms. BSD-3
GitHub (👨💻 24 · 🔀 130 · 📦 42K · 📋 61 - 31% open · ⏱️ 23.11.2020):
git clone https://github.com/snowballstem/snowball
PyPi (📥 2.3M / month · 📦 13K · ⏱️ 03.10.2019):
pip install snowballstemmer
Conda (📥 1.9M · ⏱️ 03.10.2019):
conda install -c conda-forge snowballstemmer
Dedupe (🥈28 · ⭐ 2.9K) - A python library for accurate and scalable fuzzy matching, record.. MIT
phonenumbers (🥈28 · ⭐ 2.6K) - Python port of Google's libphonenumber. Apache-2
GitHub (👨💻 22 · 🔀 320 · 📋 110 - 2% open · ⏱️ 12.01.2021):
git clone https://github.com/daviddrysdale/python-phonenumbers
PyPi (📥 710K / month · 📦 2.3K · ⏱️ 12.01.2021):
Conda (📥 370K · ⏱️ 04.08.2019):
conda install -c conda-forge phonenumbers
inflect (🥈28 · ⭐ 470) - Correctly generate plurals, ordinals, indefinite articles; convert numbers.. MIT
GitHub (👨💻 25 · 🔀 57 · 📋 71 - 18% open · ⏱️ 15.11.2020):
git clone https://github.com/jaraco/inflect
PyPi (📥 890K / month · 📦 1.4K · ⏱️ 15.11.2020):
Conda (📥 120K · ⏱️ 08.01.2021):
conda install -c conda-forge inflect
OpenNMT (🥈27 · ⭐ 4.8K) - Open Source Neural Machine Translation in PyTorch. MIT
Tokenizers (🥈27 · ⭐ 4.2K · 📉) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
GitHub (👨💻 40 · 🔀 300 · 📦 22 · 📋 370 - 21% open · ⏱️ 12.01.2021):
git clone https://github.com/huggingface/tokenizers
PyPi (📥 870K / month · ⏱️ 08.12.2020):
Conda (📥 17K · ⏱️ 19.11.2020):
conda install -c conda-forge tokenizers
GluonNLP (🥈27 · ⭐ 2.2K) - Toolkit that enables easy text preprocessing, datasets loading.. Apache-2
textacy (🥈27 · ⭐ 1.6K) - NLP, before and after spaCy. Apache-2
GitHub (👨💻 29 · 🔀 210 · 📋 220 - 13% open · ⏱️ 09.01.2021):
git clone https://github.com/chartbeat-labs/textacy
PyPi (📥 20K / month · 📦 77 · ⏱️ 29.08.2020):
Conda (📥 74K · ⏱️ 19.11.2020):
conda install -c conda-forge textacy
DeepPavlov (🥈26 · ⭐ 4.9K) - An open source library for deep learning end-to-end dialog.. Apache-2
Jina (🥈26 · ⭐ 1.8K) - An easier way to build neural search in the cloud. Apache-2
GitHub (👨💻 81 · 🔀 320 · 📦 45 · 📋 580 - 7% open · ⏱️ 12.01.2021):
git clone https://github.com/jina-ai/jina
PyPi (📥 1.7K / month · ⏱️ 12.01.2021):
Docker Hub (📥 70K · ⏱️ 12.01.2021):
TensorFlow Text (🥈26 · ⭐ 680) - Making text a first-class citizen in TensorFlow. Apache-2
ftfy (🥈25 · ⭐ 2.9K) - Fixes mojibake and other glitches in Unicode text, after the fact. MIT
GitHub (👨💻 17 · 🔀 98 · 📦 2.6K · 📋 110 - 13% open · ⏱️ 17.07.2020):
git clone https://github.com/LuminosoInsight/python-ftfy
PyPi (📥 260K / month · 📦 760 · ⏱️ 20.07.2020):
Conda (📥 94K · ⏱️ 03.02.2019):
conda install -c conda-forge ftfy
vaderSentiment (🥈25 · ⭐ 2.7K · 💤) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary.. MIT
TextDistance (🥈25 · ⭐ 1.9K · 💤) - Compute distance between sequences. 30+ algorithms, pure.. MIT
GitHub (👨💻 7 · 🔀 160 · 📥 96 · 📦 290 · ⏱️ 13.04.2020):
git clone https://github.com/life4/textdistance
PyPi (📥 97K / month · 📦 28 · ⏱️ 13.04.2020):
Conda (📥 16K · ⏱️ 10.11.2020):
conda install -c conda-forge textdistance
spark-nlp (🥈25 · ⭐ 1.8K) - State of the Art Natural Language Processing. Apache-2
jellyfish (🥈25 · ⭐ 1.4K) - a python library for doing approximate and phonetic matching of.. BSD-2
GitHub (👨💻 20 · 🔀 120 · 📦 2K · 📋 95 - 9% open · ⏱️ 30.12.2020):
git clone https://github.com/jamesturk/jellyfish
PyPi (📥 760K / month · 📦 650 · ⏱️ 21.05.2020):
Conda (📥 110K · ⏱️ 08.01.2021):
conda install -c conda-forge jellyfish
haystack (🥈25 · ⭐ 1.2K) - Transformers at scale for question answering & neural search. Using.. Apache-2
ParlAI (🥈24 · ⭐ 6.9K) - A framework for training and evaluating AI models on a variety of.. MIT
PyText (🥈24 · ⭐ 6.1K) - A natural language modeling framework based on PyTorch. BSD-3
stanza (🥈24 · ⭐ 5.1K) - Official Stanford NLP Python Library for Many Human Languages. Apache-2
GitHub (👨💻 28 · 🔀 630 · 📋 430 - 12% open · ⏱️ 12.01.2021):
git clone https://github.com/stanfordnlp/stanza
PyPi (📥 12K / month · 📦 2 · ⏱️ 13.08.2020):
Conda (📥 3K · ⏱️ 13.08.2020):
conda install -c stanfordnlp stanza
T5 (🥈24 · ⭐ 3.1K) - Code for the paper Exploring the Limits of Transfer Learning with a.. Apache-2
Sumy (🥈24 · ⭐ 2.5K) - Module for automatic summarization of text documents and HTML pages. Apache-2
fastNLP (🥈24 · ⭐ 2K · 📉) - fastNLP: A Modularized and Extensible NLP Framework. Currently.. Apache-2
PyTextRank (🥈24 · ⭐ 1.4K) - Python implementation of TextRank for phrase extraction and.. MIT
CLTK (🥈24 · ⭐ 630 · 📉) - The Classical Language Toolkit. MIT
pyahocorasick (🥈24 · ⭐ 570) - Python module (C extension and plain python) implementing Aho-.. BSD-3
GitHub (👨💻 20 · 🔀 86 · 📦 470 · 📋 96 - 32% open · ⏱️ 12.01.2021):
git clone https://github.com/WojciechMula/pyahocorasick
PyPi (📥 94K / month · 📦 64 · ⏱️ 14.01.2019):
pip install pyahocorasick
Conda (📥 110K · ⏱️ 13.10.2020):
conda install -c conda-forge pyahocorasick
Ciphey (🥉23 · ⭐ 6.1K) - Automatically decrypt encryptions without knowing the key or cipher,.. MIT
GitHub (👨💻 38 · 🔀 330 · 📋 220 - 20% open · ⏱️ 02.01.2021):
git clone https://github.com/Ciphey/Ciphey
PyPi (📥 4.5K / month · ⏱️ 02.12.2020):
Docker Hub (📥 7.2K · ⭐ 1 · ⏱️ 17.12.2020):
docker pull remnux/ciphey
flashtext (🥉23 · ⭐ 4.6K · 💤) - Extract Keywords from sentence or Replace keywords in sentences. MIT
textgenrnn (🥉23 · ⭐ 4.2K) - Easily train your own text-generating neural network of any size.. MIT
neuralcoref (🥉23 · ⭐ 2.2K) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
GitHub (👨💻 20 · 🔀 380 · 📥 160 · 📦 270 · 📋 250 - 15% open · ⏱️ 29.12.2020):
git clone https://github.com/huggingface/neuralcoref
PyPi (📥 3K / month · 📦 9 · ⏱️ 08.04.2019):
Conda (📥 5.6K · ⏱️ 21.02.2020):
conda install -c conda-forge neuralcoref
sense2vec (🥉23 · ⭐ 1.1K · 💤) - Contextually-keyed word vectors. MIT
GitHub (👨💻 14 · 🔀 200 · 📥 12K · 📦 48 · 📋 93 - 16% open · ⏱️ 29.05.2020):
git clone https://github.com/explosion/sense2vec
PyPi (📥 2.5K / month · 📦 6 · ⏱️ 22.11.2019):
Conda (📥 14K · ⏱️ 16.03.2020):
conda install -c conda-forge sense2vec
spacy-transformers (🥉23 · ⭐ 860) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT
spacy
SciSpacy (🥉23 · ⭐ 770) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
pySBD (🥉23 · ⭐ 250) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
Snips NLU (🥉22 · ⭐ 3.4K · 💤) - Snips Python library to extract meaning from text. Apache-2
pytorch-nlp (🥉22 · ⭐ 1.8K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3
scattertext (🥉22 · ⭐ 1.5K) - Beautiful visualizations of how language differs among document.. Apache-2
GitHub (👨💻 10 · 🔀 200 · 📦 140 · 📋 70 - 24% open · ⏱️ 18.12.2020):
git clone https://github.com/JasonKessler/scattertext
PyPi (📥 2.4K / month · 📦 8 · ⏱️ 14.12.2020):
Conda (📥 43K · ⏱️ 18.12.2020):
conda install -c conda-forge scattertext
fast-bert (🥉22 · ⭐ 1.5K) - Super easy library for BERT based NLP models. Apache-2
NLP Architect (🥉21 · ⭐ 2.6K) - A model library for exploring state-of-the-art deep learning.. Apache-2
Texar (🥉21 · ⭐ 2.1K) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2
DeepMatcher (🥉20 · ⭐ 3.4K · 💤) - Python package for performing Entity and Text Matching using.. BSD-3
gpt-2-simple (🥉20 · ⭐ 2.4K · 💤) - Python package to easily retrain OpenAI's GPT-2 text-.. MIT
NeMo (🥉20 · ⭐ 2.3K) - NeMo: a toolkit for conversational AI. Apache-2
Texthero (🥉20 · ⭐ 2K) - Text preprocessing, representation and visualization from zero to hero. MIT
DELTA (🥉20 · ⭐ 1.4K) - DELTA is a deep learning based natural language and speech.. Apache-2
GitHub (👨💻 41 · 🔀 270 · 📋 75 - 12% open · ⏱️ 17.12.2020):
git clone https://github.com/Delta-ML/delta
PyPi (📥 15 / month · ⏱️ 27.03.2020):
Docker Hub (📥 12K · ⏱️ 12.01.2021):
docker pull zh794390558/delta
FARM (🥉20 · ⭐ 1.1K) - Fast & easy transfer learning for NLP. Harvesting language models.. Apache-2
Sockeye (🥉20 · ⭐ 980) - Sequence-to-sequence framework with a focus on Neural Machine.. Apache-2
finetune (🥉20 · ⭐ 630) - Scikit-learn style model finetuning for NLP. MPL-2.0
Kashgari (🥉19 · ⭐ 2K) - Kashgari is a production-level NLP Transfer learning framework.. Apache-2
YouTokenToMe (🥉19 · ⭐ 710 · 💤) - Unsupervised text tokenizer focused on computational efficiency. MIT
textpipe (🥉19 · ⭐ 260) - Textpipe: clean and extract metadata from text. MIT
skift (🥉18 · ⭐ 210 · ➕) - scikit-learn wrappers for Python fastText. MIT
Camphr (🥉17 · ⭐ 320) - spaCy plugin for Transformers , Udify, ELmo, etc. Apache-2
spacy
VizSeq (🥉16 · ⭐ 300) - An Analysis Toolkit for Natural Language Generation (Translation,.. MIT
Translate (🥉15 · ⭐ 680) - Translate - a PyTorch Language Library. BSD-3
Headliner (🥉15 · ⭐ 220 · 💤) - Easy training and deployment of seq2seq models. MIT
NeuralQA (🥉15 · ⭐ 180) - NeuralQA: A Usable Library for Question Answering on Large Datasets with.. MIT
OpenNRE (🥉14 · ⭐ 2.9K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
TransferNLP (🥉14 · ⭐ 280 · 💤) - NLP library designed for reproducible experimentation.. MIT
textvec (🥉14 · ⭐ 150 · ➕) - Text vectorization tool to outperform TFIDF for classification.. MIT
Show 10 hidden projects...
gensim (🥇35 · ⭐ 12K) - Topic Modelling for Humans. ❗️LGPL-2.1
fuzzywuzzy (🥈29 · ⭐ 7.8K · 💤) - Fuzzy String Matching in Python. ❗️GPL-2.0
langid (🥈26 · ⭐ 1.7K · 💀) - Stand-alone language identification system. BSD-3
polyglot (🥈24 · ⭐ 1.7K) - Multilingual text (NLP) processing toolkit. ❗️GPL-3.0
anaGo (🥉22 · ⭐ 1.4K · 💀) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,.. MIT
MatchZoo (🥉21 · ⭐ 3.3K · 💀) - Facilitating the design, comparison and sharing of deep.. Apache-2
stop-words (🥉20 · ⭐ 120 · 💀) - Get list of common stop words in various languages in Python. BSD-3
pyfasttext (🥉19 · ⭐ 230 · 💀) - Yet another Python binding for fastText. ❗️GPL-3.0
NeuroNER (🥉18 · ⭐ 1.5K · 💀) - Named-entity recognition using neural networks. Easy-to-use and.. MIT
ONNX-T5 (🥉11 · ⭐ 130 · 🐣) - Summarization, translation, sentiment-analysis, text-generation.. Apache-2
Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.
Pillow (🥇38 · ⭐ 8.1K) - The friendly PIL fork (Python Imaging Library). ❗️PIL
GitHub (👨💻 340 · 🔀 1.6K · 📦 380K · 📋 2.1K - 11% open · ⏱️ 12.01.2021):
git clone https://github.com/python-pillow/Pillow
PyPi (📥 14M / month · 📦 110K · ⏱️ 02.01.2021):
Conda (📥 6.7M · ⏱️ 11.01.2021):
conda install -c conda-forge pillow
torchvision (🥇36 · ⭐ 8.1K · 📈) - Datasets, Transforms and Models specific to Computer.. BSD-3
GitHub (👨💻 350 · 🔀 4.2K · 📦 39K · 📋 1.5K - 29% open · ⏱️ 11.01.2021):
git clone https://github.com/pytorch/vision
PyPi (📥 750K / month · 📦 4.6K · ⏱️ 10.12.2020):
Conda (📥 34K · ⏱️ 14.10.2018):
conda install -c conda-forge torchvision
scikit-image (🥇36 · ⭐ 4.1K) - Image processing in Python. BSD-2
GitHub (👨💻 480 · 🔀 1.7K · 📦 58K · 📋 2.1K - 30% open · ⏱️ 11.01.2021):
git clone https://github.com/scikit-image/scikit-image
PyPi (📥 1.8M / month · 📦 15K · ⏱️ 23.12.2020):
Conda (📥 2M · ⏱️ 23.12.2020):
conda install -c conda-forge scikit-image
imgaug (🥇31 · ⭐ 11K · 💤) - Image augmentation for machine learning experiments. MIT
GitHub (👨💻 36 · 🔀 2K · 📦 5K · 📋 420 - 51% open · ⏱️ 01.06.2020):
git clone https://github.com/aleju/imgaug
PyPi (📥 140K / month · 📦 280 · ⏱️ 05.02.2020):
Conda (📥 31K · ⏱️ 14.02.2020):
conda install -c conda-forge imgaug
imageio (🥇31 · ⭐ 820) - Python library for reading and writing image data. BSD-2
GitHub (👨💻 71 · 🔀 160 · 📦 35K · 📋 330 - 18% open · ⏱️ 08.01.2021):
git clone https://github.com/imageio/imageio
PyPi (📥 1.9M / month · 📦 3.8K · ⏱️ 06.07.2020):
Conda (📥 1.6M · ⏱️ 06.07.2020):
conda install -c conda-forge imageio
opencv-python (🥈30 · ⭐ 1.7K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
Wand (🥈30 · ⭐ 1K) - The ctypes-based simple ImageMagick binding for Python. MIT
Face Recognition (🥈29 · ⭐ 38K) - The world's simplest facial recognition api for Python.. MIT
MoviePy (🥈29 · ⭐ 7.2K) - Video editing with Python. MIT
GitHub (👨💻 130 · 🔀 1K · 📦 5.7K · 📋 970 - 34% open · ⏱️ 12.01.2021):
git clone https://github.com/Zulko/moviepy
PyPi (📥 120K / month · 📦 1.1K · ⏱️ 05.10.2020):
Conda (📥 71K · ⏱️ 23.02.2020):
conda install -c conda-forge moviepy
Albumentations (🥈28 · ⭐ 7K · 📉) - Fast image augmentation library and easy to use wrapper.. MIT
GitHub (👨💻 72 · 🔀 900 · 📦 2.5K · 📋 400 - 42% open · ⏱️ 30.12.2020):
git clone https://github.com/albumentations-team/albumentations
PyPi (📥 58K / month · 📦 130 · ⏱️ 29.11.2020):
pip install albumentations
Conda (📥 14K · ⏱️ 29.11.2020):
conda install -c conda-forge albumentations
GluonCV (🥈28 · ⭐ 4.5K) - Gluon CV Toolkit. Apache-2
Kornia (🥈28 · ⭐ 3.5K) - Open Source Differentiable Computer Vision Library for PyTorch. Apache-2
ImageHash (🥈28 · ⭐ 1.8K) - A Python Perceptual Image Hashing Module. BSD-2
GitHub (👨💻 17 · 🔀 250 · 📦 1.8K · 📋 87 - 19% open · ⏱️ 03.01.2021):
git clone https://github.com/JohannesBuchner/imagehash
PyPi (📥 350K / month · 📦 530 · ⏱️ 19.11.2020):
Conda (📥 97K · ⏱️ 19.11.2020):
conda install -c conda-forge imagehash
PyTorch Image Models (🥈27 · ⭐ 6.5K) - PyTorch image models, scripts, pretrained weights --.. Apache-2
imageai (🥈27 · ⭐ 5.8K) - A python library built to empower developers to build applications and.. MIT
detectron2 (🥈26 · ⭐ 14K) - Detectron2 is FAIR's next-generation platform for object.. Apache-2
InsightFace (🥈26 · ⭐ 8.4K) - Face Analysis Project on MXNet. MIT
MMDetection (🥈25 · ⭐ 13K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2
Augmentor (🥈25 · ⭐ 4.3K · 💤) - Image augmentation library in Python for machine learning. MIT
facenet-pytorch (🥈25 · ⭐ 1.8K · 📉) - Pretrained Pytorch face detection (MTCNN) and.. MIT
chainercv (🥈25 · ⭐ 1.4K · 💤) - ChainerCV: a Library for Deep Learning in Computer Vision. MIT
mahotas (🥈25 · ⭐ 660 · ➕) - Computer Vision in Python. MIT
GitHub (👨💻 30 · 🔀 120 · 📦 550 · 📋 69 - 18% open · ⏱️ 16.08.2020):
git clone https://github.com/luispedro/mahotas
PyPi (📥 8.8K / month · 📦 190 · ⏱️ 16.08.2020):
Conda (📥 250K · ⏱️ 01.11.2020):
conda install -c conda-forge mahotas
PyTorch3D (🥉24 · ⭐ 4.2K) - PyTorch3D is FAIR's library of reusable components for deep.. MIT
GitHub (👨💻 50 · 🔀 460 · 📦 40 · 📋 460 - 12% open · ⏱️ 11.01.2021):
git clone https://github.com/facebookresearch/pytorch3d
PyPi (📥 4.4K / month · ⏱️ 12.11.2020):
Conda (📥 5.5K · ⏱️ 12.11.2020):
conda install -c pytorch3d pytorch3d
mtcnn (🥉24 · ⭐ 1.4K · 💤) - MTCNN face detection implementation for TensorFlow, as a PIP.. MIT
Face Alignment (🥉23 · ⭐ 4.6K) - 2D and 3D Face alignment library build using pytorch. BSD-3
segmentation_models (🥉23 · ⭐ 2.9K · 💤) - Segmentation models with pretrained backbones. Keras.. MIT
vidgear (🥉23 · ⭐ 1.6K · ➕) - High-performance cross-platform Video Processing Python.. Apache-2
Image Deduplicator (🥉22 · ⭐ 3.3K) - Finding duplicate images made easy!. Apache-2
CellProfiler (🥉22 · ⭐ 530 · ➕) - An open-source application for biological image analysis. BSD-3
pyvips (🥉22 · ⭐ 290) - python binding for libvips using cffi. MIT
GitHub (👨💻 10 · 🔀 25 · 📦 130 · 📋 200 - 33% open · ⏱️ 27.12.2020):
git clone https://github.com/libvips/pyvips
PyPi (📥 3.3K / month · 📦 22 · ⏱️ 18.12.2020):
Conda (📥 5.9K · ⏱️ 14.10.2020):
conda install -c conda-forge pyvips
MMF (🥉21 · ⭐ 4K) - A modular framework for vision & language multimodal research from.. BSD-3
Image Super-Resolution (🥉21 · ⭐ 2.4K) - Super-scale your images and run experiments with.. Apache-2
GitHub (👨💻 9 · 🔀 460 · 📦 38 · 📋 140 - 33% open · ⏱️ 11.11.2020):
git clone https://github.com/idealo/image-super-resolution
PyPi (📥 2.7K / month · 📦 4 · ⏱️ 08.01.2020):
Docker Hub (📥 120 · ⏱️ 01.04.2019):
docker pull idealo/image-super-resolution-gpu
tensorflow-graphics (🥉21 · ⭐ 2.4K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2
Luminoth (🥉21 · ⭐ 2.3K · 💤) - Deep Learning toolkit for Computer Vision. BSD-3
caer (🥉21 · ⭐ 300 · 🐣) - A lightweight, scalable Computer Vision library for high-performance AI.. MIT
image-match (🥉20 · ⭐ 2.5K · ➕) - Quickly search over billions of images. Apache-2
Classy Vision (🥉20 · ⭐ 1K) - An end-to-end PyTorch framework for image and video.. MIT
GitHub (👨💻 53 · 🔀 190 · 📋 88 - 52% open · ⏱️ 06.01.2021):
git clone https://github.com/facebookresearch/ClassyVision
PyPi (📥 170 / month · ⏱️ 20.11.2020):
pip install classy_vision
Conda (📥 5.5K · ⏱️ 11.12.2020):
conda install -c conda-forge classy_vision
nude.py (🥉20 · ⭐ 790) - Nudity detection with Python. MIT
vit-pytorch (🥉19 · ⭐ 2.2K · 🐣) - Implementation of Vision Transformer, a simple way to.. MIT
Torch Points 3D (🥉19 · ⭐ 980) - Pytorch framework for doing deep learning on point clouds. BSD-3
Norfair (🥉18 · ⭐ 760 · 🐣) - Lightweight Python library for adding real-time 2D object tracking.. BSD-3
PaddleDetection (🥉17 · ⭐ 2.2K) - Object detection and instance segmentation toolkit.. Apache-2
lightly (🥉16 · ⭐ 390 · 🐣) - A python library for self-supervised learning. MIT
DE⫶TR (🥉14 · ⭐ 5.9K) - End-to-End Object Detection with Transformers. Apache-2
PySlowFast (🥉14 · ⭐ 3.3K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2
pycls (🥉14 · ⭐ 1.4K) - Codebase for Image Classification Research, written in PyTorch. MIT
Show 3 hidden projects...
glfw (🥈29 · ⭐ 7.1K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input. ❗️Zlib
imutils (🥈27 · ⭐ 3.5K · 💀) - A series of convenience functions to make basic image processing.. MIT
Pillow-SIMD (🥉23 · ⭐ 1.5K · 💤) - The friendly PIL fork. ❗️PIL
Libraries for graph processing, clustering, embedding, and machine learning tasks.
networkx (🥇37 · ⭐ 8.5K) - Network Analysis in Python. BSD-3
GitHub (👨💻 490 · 🔀 2.2K · 📥 51 · 📦 64K · 📋 2.5K - 9% open · ⏱️ 12.01.2021):
git clone https://github.com/networkx/networkx
PyPi (📥 5.2M / month · 📦 21K · ⏱️ 22.08.2020):
Conda (📥 2.8M · ⏱️ 23.08.2020):
conda install -c conda-forge networkx
PyTorch Geometric (🥇28 · ⭐ 9.9K) - Geometric Deep Learning Extension Library for PyTorch. MIT
dgl (🥇27 · ⭐ 6.5K) - Python package built to ease deep learning on graph, on top of existing.. Apache-2
StellarGraph (🥈25 · ⭐ 1.7K) - StellarGraph - Machine Learning on Graphs. Apache-2
Spektral (🥈23 · ⭐ 1.6K) - Graph Neural Networks with Keras and Tensorflow 2. MIT
Node2Vec (🥈22 · ⭐ 620) - Implementation of the node2vec algorithm. MIT
GitHub (👨💻 7 · 🔀 160 · 📦 110 · ⏱️ 09.01.2021):
git clone https://github.com/eliorc/node2vec
PyPi (📥 4.4K / month · 📦 10 · ⏱️ 09.01.2021):
Conda (📥 15K · ⏱️ 25.04.2020):
conda install -c conda-forge node2vec
ogb (🥈21 · ⭐ 700) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
torch-cluster (🥈21 · ⭐ 320) - PyTorch Extension Library of Optimized Graph Cluster.. MIT
AmpliGraph (🥉20 · ⭐ 1.4K) - Python library for Representation Learning on Knowledge.. Apache-2
graph-nets (🥉19 · ⭐ 4.7K) - Build Graph Nets in Tensorflow. Apache-2
PyTorch-BigGraph (🥉19 · ⭐ 2.6K) - Generate embeddings from large-scale graph-structured.. BSD-3
Paddle Graph Learning (🥉19 · ⭐ 870) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2
PyKEEN (🥉19 · ⭐ 280) - A Python library for learning and evaluating knowledge graph embeddings. MIT
kglib (🥉17 · ⭐ 380) - Grakn Knowledge Graph Library (ML R&D). Apache-2
DeepGraph (🥉17 · ⭐ 230) - Analyze Data with Pandas-based Networks. Documentation:. BSD-3
GitHub (👨💻 2 · 🔀 33 · 📦 1 · 📋 12 - 58% open · ⏱️ 01.10.2020):
git clone https://github.com/deepgraph/deepgraph
PyPi (📥 240 / month · ⏱️ 01.10.2020):
Conda (📥 76K · ⏱️ 13.10.2020):
conda install -c conda-forge deepgraph
pytorch_geometric_temporal (🥉16 · ⭐ 310 · ➕) - A Temporal Extension Library for PyTorch Geometric. MIT
AutoGL (🥉15 · ⭐ 560 · 🐣) - An autoML framework & toolkit for machine learning on graphs. MIT
Euler (🥉14 · ⭐ 2.5K) - A distributed graph deep learning framework. Apache-2
GraphEmbedding (🥉14 · ⭐ 1.7K) - Implementation and experiments of graph embedding algorithms. MIT
OpenKE (🥉13 · ⭐ 2.3K · 💤) - An Open-Source Package for Knowledge Embedding (KE). MIT
GraphVite (🥉12 · ⭐ 830 · 💤) - GraphVite: A General and High-performance Graph Embedding.. Apache-2
Show 7 hidden projects...
igraph (🥇27 · ⭐ 760) - Python interface for igraph. ❗️GPL-2.0
pygal (🥈26 · ⭐ 2.3K) - PYthon svg GrAph plotting Library. ❗️LGPL-3.0
Karate Club (🥈21 · ⭐ 1.1K) - Karate Club: An API Oriented Open-source Python Framework for.. ❗️GPL-3.0
DeepWalk (🥉19 · ⭐ 2.2K · 💤) - DeepWalk - Deep Learning for Graphs. ❗️GPL-3.0
Sematch (🥉17 · ⭐ 340 · 💀) - semantic similarity framework for knowledge graph. Apache-2
GraphSAGE (🥉14 · ⭐ 2.1K · 💀) - Representation learning on large graphs using stochastic.. MIT
OpenNE (🥉14 · ⭐ 1.4K · 💀) - An Open-Source Package for Network Embedding (NE). MIT
Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.
DeepSpeech (🥇31 · ⭐ 16K) - DeepSpeech is an open source embedded (offline, on-device).. MPL-2.0
Magenta (🥇29 · ⭐ 16K) - Magenta: Music and Art Generation with Machine Intelligence. Apache-2
Pydub (🥇29 · ⭐ 5K) - Manipulate audio with a simple and easy high level interface. MIT
GitHub (👨💻 79 · 🔀 670 · 📦 5.6K · 📋 400 - 41% open · ⏱️ 14.12.2020):
git clone https://github.com/jiaaro/pydub
PyPi (📥 120K / month · 📦 1.4K · ⏱️ 03.06.2020):
Conda (📥 12K · ⏱️ 02.02.2019):
conda install -c conda-forge pydub
torchaudio (🥇29 · ⭐ 1.2K) - Data manipulation and transformation for audio signal.. BSD-2
audioread (🥈27 · ⭐ 360 · ➕) - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio.. MIT
GitHub (👨💻 20 · 🔀 83 · 📦 3.9K · 📋 75 - 41% open · ⏱️ 20.10.2020):
git clone https://github.com/beetbox/audioread
PyPi (📥 280K / month · 📦 590 · ⏱️ 20.10.2020):
Conda (📥 190K · ⏱️ 08.12.2020):
conda install -c conda-forge audioread
librosa (🥈26 · ⭐ 4.2K) - Python library for audio and music analysis. ISC
GitHub (👨💻 80 · 🔀 680 · 📦 8.8K · 📋 830 - 6% open · ⏱️ 11.09.2020):
git clone https://github.com/librosa/librosa
PyPi (📥 300K / month · 📦 1.8K · ⏱️ 22.07.2020):
Conda (📥 240K · ⏱️ 22.07.2020):
conda install -c conda-forge librosa
spleeter (🥈25 · ⭐ 15K · 📉) - Deezer source separation library including pretrained models. MIT
GitHub (👨💻 17 · 🔀 1.5K · 📥 890K · 📋 500 - 18% open · ⏱️ 11.01.2021):
git clone https://github.com/deezer/spleeter
PyPi (📥 5.8K / month · ⏱️ 08.01.2021):
Conda (📥 41K · ⏱️ 30.06.2020):
conda install -c conda-forge spleeter
pyAudioAnalysis (🥈25 · ⭐ 3.7K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
espnet (🥈25 · ⭐ 3.3K) - End-to-End Speech Processing Toolkit. Apache-2
python-soundfile (🥈25 · ⭐ 350) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3
python_speech_features (🥉24 · ⭐ 1.8K · ➕) - This library provides common speech features for ASR.. MIT
tinytag (🥉23 · ⭐ 430 · ➕) - Read music meta data and length of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA.. MIT
DDSP (🥉22 · ⭐ 1.7K) - DDSP: Differentiable Digital Signal Processing. Apache-2
kapre (🥉22 · ⭐ 690 · ➕) - kapre: Keras Audio Preprocessors. MIT
Porcupine (🥉21 · ⭐ 2.3K) - On-device wake word detection powered by deep learning. Apache-2
Dejavu (🥉20 · ⭐ 5.3K · 💤) - Audio fingerprinting and recognition in Python. MIT
TTS (🥉17 · ⭐ 3K · ➕) - Deep learning for Text to Speech (Discussion forum:.. MPL-2.0
Muda (🥉16 · ⭐ 170) - A library for augmenting annotated audio data. ISC
Julius (🥉12 · ⭐ 160 · 🐣) - Fast PyTorch based DSP for audio and 1D signals. MIT
Show 4 hidden projects...
SpeechRecognition (🥇30 · ⭐ 5.3K · 💀) - Speech recognition module for Python, supporting.. BSD-3
aubio (🥈26 · ⭐ 2K) - a library for audio and music analysis. ❗️GPL-3.0
Essentia (🥉23 · ⭐ 1.7K) - C++ library for audio and music analysis, description and.. ❗️AGPL-3.0
Madmom (🥉21 · ⭐ 700 · 💀) - Python audio and music signal processing library. BSD-3
Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.
geopy (🥇33 · ⭐ 3.2K) - Geocoding library for Python. MIT
GitHub (👨💻 120 · 🔀 520 · 📦 15K · 📋 240 - 10% open · ⏱️ 27.12.2020):
git clone https://github.com/geopy/geopy
PyPi (📥 3.8M / month · 📦 7.2K · ⏱️ 27.12.2020):
Conda (📥 500K · ⏱️ 27.12.2020):
conda install -c conda-forge geopy
Shapely (🥇33 · ⭐ 2.1K · ➕) - Manipulation and analysis of geometric objects. BSD-3
GitHub (👨💻 100 · 🔀 370 · 📦 15K · 📋 700 - 19% open · ⏱️ 15.12.2020):
git clone https://github.com/Toblerity/Shapely
PyPi (📥 2M / month · 📦 5.5K · ⏱️ 20.08.2020):
Conda (📥 2M · ⏱️ 13.10.2020):
conda install -c conda-forge shapely
folium (🥇32 · ⭐ 5.1K) - Python Data. Leaflet.js Maps. MIT
GitHub (👨💻 120 · 🔀 1.9K · 📦 8K · 📋 830 - 17% open · ⏱️ 04.01.2021):
git clone https://github.com/python-visualization/folium
PyPi (📥 210K / month · 📦 970 · ⏱️ 06.01.2021):
Conda (📥 310K · ⏱️ 06.01.2021):
conda install -c conda-forge folium
GeoPandas (🥈30 · ⭐ 2.4K) - Python tools for geographic data. BSD-3
GitHub (👨💻 130 · 🔀 530 · 📥 840 · 📦 6.8K · 📋 950 - 30% open · ⏱️ 10.01.2021):
git clone https://github.com/geopandas/geopandas
PyPi (📥 390K / month · 📦 1.2K · ⏱️ 24.06.2020):
Conda (📥 810K · ⏱️ 16.07.2020):
conda install -c conda-forge geopandas
Rasterio (🥈30 · ⭐ 1.4K) - Rasterio reads and writes geospatial raster datasets. BSD-3
GitHub (👨💻 110 · 🔀 390 · 📥 700 · 📦 2.6K · 📋 1.3K - 10% open · ⏱️ 12.01.2021):
git clone https://github.com/mapbox/rasterio
PyPi (📥 130K / month · 📦 850 · ⏱️ 12.01.2021):
Conda (📥 840K · ⏱️ 30.10.2020):
conda install -c conda-forge rasterio
Fiona (🥈30 · ⭐ 770) - Fiona reads and writes geographic data files. BSD-3
GitHub (👨💻 65 · 🔀 160 · 📦 4.8K · 📋 620 - 11% open · ⏱️ 30.11.2020):
git clone https://github.com/Toblerity/Fiona
PyPi (📥 450K / month · 📦 1.2K · ⏱️ 17.11.2020):
Conda (📥 1.6M · ⏱️ 17.11.2020):
conda install -c conda-forge fiona
pyproj (🥈29 · ⭐ 560) - Python interface to PROJ (cartographic projections and coordinate.. MIT
GitHub (👨💻 39 · 🔀 150 · 📦 8.1K · 📋 380 - 1% open · ⏱️ 08.01.2021):
git clone https://github.com/pyproj4/pyproj
PyPi (📥 1.1M / month · 📦 2.5K · ⏱️ 05.11.2020):
Conda (📥 1.9M · ⏱️ 06.11.2020):
conda install -c conda-forge pyproj
ipyleaflet (🥈27 · ⭐ 1.1K) - A Jupyter - Leaflet.js bridge. MIT
GitHub (👨💻 63 · 🔀 280 · 📦 660 · 📋 380 - 34% open · ⏱️ 05.01.2021):
git clone https://github.com/jupyter-widgets/ipyleaflet
PyPi (📥 13K / month · 📦 98 · ⏱️ 05.01.2021):
Conda (📥 580K · ⏱️ 05.01.2021):
conda install -c conda-forge ipyleaflet
NPM (📥 130K / month · 📦 2 · ⏱️ 05.01.2021):
npm install jupyter-leaflet
geojson (🥈27 · ⭐ 590) - Python bindings and utilities for GeoJSON. BSD-3
GitHub (👨💻 44 · 🔀 78 · 📦 5.5K · 📋 68 - 26% open · ⏱️ 25.11.2020):
git clone https://github.com/jazzband/geojson
PyPi (📥 410K / month · 📦 1.6K · ⏱️ 09.08.2019):
Conda (📥 360K · ⏱️ 11.08.2019):
conda install -c conda-forge geojson
ArcGIS API (🥉25 · ⭐ 940) - Documentation and samples for ArcGIS API for Python. Apache-2
GitHub (👨💻 60 · 🔀 680 · 📋 320 - 39% open · ⏱️ 06.01.2021):
git clone https://github.com/Esri/arcgis-python-api
PyPi (📥 19K / month · 📦 10 · ⏱️ 30.11.2020):
Docker Hub (📥 3.6K · ⭐ 29 · ⏱️ 06.03.2020):
docker pull esridocker/arcgis-api-python-notebook
PySAL (🥉24 · ⭐ 810) - PySAL: Python Spatial Analysis Library Meta-Package. BSD-3
GitHub (👨💻 70 · 🔀 240 · 📋 630 - 9% open · ⏱️ 12.01.2021):
git clone https://github.com/pysal/pysal
PyPi (📥 10K / month · 📦 18 · ⏱️ 30.07.2020):
Conda (📥 400K · ⏱️ 30.07.2020):
conda install -c conda-forge pysal
GeoViews (🥉24 · ⭐ 330) - Simple, concise geographical visualization in Python. BSD-3
GitHub (👨💻 21 · 🔀 60 · 📦 190 · 📋 250 - 31% open · ⏱️ 21.09.2020):
git clone https://github.com/holoviz/geoviews
PyPi (📥 1.2K / month · 📦 10 · ⏱️ 30.03.2020):
Conda (📥 54K · ⏱️ 23.09.2020):
conda install -c conda-forge geoviews
EarthPy (🥉22 · ⭐ 220) - A package built to support working with spatial data using open source.. BSD-3
GitHub (👨💻 38 · 🔀 96 · 📦 72 · 📋 220 - 11% open · ⏱️ 03.12.2020):
git clone https://github.com/earthlab/earthpy
PyPi (📥 1.7K / month · 📦 6 · ⏱️ 18.06.2020):
Conda (📥 26K · ⏱️ 19.06.2020):
conda install -c conda-forge earthpy
pymap3d (🥉21 · ⭐ 170) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2
GitHub (👨💻 8 · 🔀 55 · 📋 24 - 8% open · ⏱️ 23.12.2020):
git clone https://github.com/geospace-code/pymap3d
PyPi (📥 9K / month · 📦 3 · ⏱️ 21.09.2020):
Conda (📥 5.9K · ⏱️ 24.09.2020):
conda install -c conda-forge pymap3d
Show 7 hidden projects...
Geocoder (🥈29 · ⭐ 1.3K · 💀) - Python Geocoder. MIT
Cartopy (🥈27 · ⭐ 1.4K) - Rasterio reads and writes geospatial raster datasets. ❗️LGPL-3.0
Satpy (🥉25 · ⭐ 660) - Python package for earth-observing satellite data processing. ❗️GPL-3.0
gmaps (🥉21 · ⭐ 700 · 💀) - Google maps for Jupyter notebooks. BSD-3
Sentinelsat (🥉21 · ⭐ 540) - Search and download Copernicus Sentinel satellite images. ❗️GPL-3.0
Mapbox GL (🥉20 · ⭐ 550 · 💀) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook. MIT
geoplotlib (🥉19 · ⭐ 890 · 💀) - python toolbox for visualizing geographical data and making maps. MIT
Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.
zipline (🥇30 · ⭐ 13K) - Zipline, a Pythonic Algorithmic Trading Library. Apache-2
yfinance (🥇29 · ⭐ 3.9K) - Yahoo! Finance market data downloader (+faster Pandas Datareader). Apache-2
GitHub (👨💻 27 · 🔀 990 · 📦 2.6K · 📋 470 - 63% open · ⏱️ 11.01.2021):
git clone https://github.com/ranaroussi/yfinance
PyPi (📥 100K / month · 📦 26 · ⏱️ 05.10.2020):
Conda (📥 30K · ⏱️ 27.12.2019):
conda install -c ranaroussi yfinance
Alpha Vantage (🥇27 · ⭐ 3K) - A python wrapper for Alpha Vantage API for financial data. MIT
ta (🥇27 · ⭐ 1.7K · ➕) - Technical Analysis Library using Pandas and Numpy. MIT
pyfolio (🥈25 · ⭐ 3.4K) - Portfolio and risk analytics in Python. Apache-2
GitHub (👨💻 55 · 🔀 1.1K · 📦 190 · 📋 390 - 31% open · ⏱️ 15.07.2020):
git clone https://github.com/quantopian/pyfolio
PyPi (📥 6.8K / month · 📦 54 · ⏱️ 15.04.2019):
Conda (📥 5.4K · ⏱️ 16.05.2020):
conda install -c conda-forge pyfolio
empyrical (🥈25 · ⭐ 680) - Common financial risk and performance metrics. Used by zipline and.. Apache-2
GitHub (👨💻 22 · 🔀 220 · 📦 500 · 📋 53 - 50% open · ⏱️ 14.10.2020):
git clone https://github.com/quantopian/empyrical
PyPi (📥 25K / month · 📦 220 · ⏱️ 13.10.2020):
Conda (📥 9K · ⏱️ 14.10.2020):
conda install -c conda-forge empyrical
Alphalens (🥈24 · ⭐ 1.7K · 💤) - Performance analysis of predictive (alpha) stock factors. Apache-2
GitHub (👨💻 25 · 🔀 620 · 📦 350 · 📋 180 - 20% open · ⏱️ 27.04.2020):
git clone https://github.com/quantopian/alphalens
PyPi (📥 2K / month · 📦 14 · ⏱️ 27.04.2020):
Conda (📥 10K · ⏱️ 16.05.2020):
conda install -c conda-forge alphalens
IB-insync (🥈24 · ⭐ 1.2K) - Python sync/async framework for Interactive Brokers API. BSD-2
GitHub (👨💻 25 · 🔀 380 · 📋 280 - 1% open · ⏱️ 12.01.2021):
git clone https://github.com/erdewit/ib_insync
PyPi (📥 3K / month · 📦 12 · ⏱️ 03.11.2020):
Conda (📥 7.1K · ⏱️ 07.11.2020):
conda install -c conda-forge ib-insync
ffn (🥈24 · ⭐ 740) - ffn - a financial function library for Python. MIT
Enigma Catalyst (🥉23 · ⭐ 1.9K) - An Algorithmic Trading Library for Crypto-Assets in Python. Apache-2
stockstats (🥉23 · ⭐ 690 · ➕) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
bt (🥉22 · ⭐ 930) - bt - flexible backtesting for Python. MIT
TensorTrade (🥉21 · ⭐ 2.2K) - An open source reinforcement learning framework for training,.. Apache-2
finmarketpy (🥉20 · ⭐ 2.4K) - Python library for backtesting trading strategies & analyzing.. Apache-2
Qlib (🥉19 · ⭐ 3.6K · 🐣) - Qlib is an AI-oriented quantitative investment platform, which aims.. MIT
tf-quant-finance (🥉19 · ⭐ 1.4K · ➕) - High-performance TensorFlow library for quantitative.. Apache-2
Crypto Signals (🥉18 · ⭐ 2.4K) - Github.com/CryptoSignal - #1 Quant Trading & Technical Analysis.. MIT
Show 6 hidden projects...
Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.
Prophet (🥇30 · ⭐ 12K) - Tool for producing high quality forecasts for time series data that has.. MIT
tsfresh (🥇27 · ⭐ 5.3K · 📈) - Automatic extraction of relevant features from time series:. MIT
GitHub (👨💻 71 · 🔀 840 · 📋 430 - 7% open · ⏱️ 12.01.2021):
git clone https://github.com/blue-yonder/tsfresh
PyPi (📥 170K / month · 📦 66 · ⏱️ 09.09.2020):
Conda (📥 26K · ⏱️ 10.09.2020):
conda install -c conda-forge tsfresh
tslearn (🥇27 · ⭐ 1.4K) - A machine learning toolkit dedicated to time-series data. BSD-2
GitHub (👨💻 28 · 🔀 220 · 📦 180 · 📋 220 - 23% open · ⏱️ 08.01.2021):
git clone https://github.com/tslearn-team/tslearn
PyPi (📥 67K / month · 📦 11 · ⏱️ 18.06.2020):
Conda (📥 180K · ⏱️ 19.06.2020):
conda install -c conda-forge tslearn
sktime (🥈26 · ⭐ 3.4K) - A unified framework for machine learning with time series. BSD-3
pmdarima (🥈26 · ⭐ 800 · ➕) - A statistical library designed to fill the void in Python's time.. MIT
Streamz (🥈24 · ⭐ 900) - Real-time stream processing for python. BSD-3
GitHub (👨💻 38 · 🔀 110 · 📦 180 · 📋 220 - 41% open · ⏱️ 12.01.2021):
git clone https://github.com/python-streamz/streamz
PyPi (📥 2.1K / month · 📦 16 · ⏱️ 02.11.2020):
Conda (📥 100K · ⏱️ 03.11.2020):
conda install -c conda-forge streamz
GluonTS (🥈23 · ⭐ 1.7K) - Probabilistic time series modeling in Python. Apache-2
STUMPY (🥉22 · ⭐ 1.6K) - STUMPY is a powerful and scalable Python library for computing a Matrix.. BSD-3
GitHub (👨💻 20 · 🔀 160 · 📋 200 - 13% open · ⏱️ 04.01.2021):
git clone https://github.com/TDAmeritrade/stumpy
PyPi (📥 18K / month · ⏱️ 31.12.2020):
Conda (📥 15K · ⏱️ 31.12.2020):
conda install -c conda-forge stumpy
Darts (🥉21 · ⭐ 690) - A python library for easy manipulation and forecasting of time series. Apache-2
GitHub (👨💻 23 · 🔀 84 · 📦 11 · 📋 57 - 28% open · ⏱️ 09.11.2020):
git clone https://github.com/unit8co/darts
PyPi (📥 910 / month · ⏱️ 09.11.2020):
Docker Hub (📥 75 · ⏱️ 06.10.2020):
pyts (🥉20 · ⭐ 850 · 💤) - A Python package for time series classification. BSD-3
GitHub (👨💻 7 · 🔀 92 · 📦 79 · 📋 41 - 56% open · ⏱️ 30.04.2020):
git clone https://github.com/johannfaouzi/pyts
PyPi (📥 2.1K / month · 📦 2 · ⏱️ 21.03.2020):
Conda (📥 5.8K · ⏱️ 21.03.2020):
conda install -c conda-forge pyts
matrixprofile-ts (🥉19 · ⭐ 600 · 💤) - A Python library for detecting patterns and anomalies.. Apache-2
pytorch-forecasting (🥉19 · ⭐ 590 · 🐣) - Time series forecasting with PyTorch. MIT
seglearn (🥉19 · ⭐ 410) - Python module for machine learning time series:. BSD-3
ADTK (🥉17 · ⭐ 580 · 💤) - A Python toolkit for rule-based/unsupervised anomaly detection in time.. MPL-2.0
atspy (🥉17 · ⭐ 320 · ➕) - AtsPy: Automated Time Series Models in Python (by @firmai). MIT
tick (🥉17 · ⭐ 320 · 💤) - Module for statistical learning, with a particular emphasis on time-.. BSD-3
Show 4 hidden projects...
luminol (🥈23 · ⭐ 860 · 💀) - Anomaly Detection and Correlation library. Apache-2
PyFlux (🥉22 · ⭐ 1.8K · 💀) - Open source time series library for Python. BSD-3
pydlm (🥉20 · ⭐ 350 · 💀) - A python library for Bayesian time series modeling. BSD-3
Auto TS (🥉17 · ⭐ 140) - Automatically build ARIMA, SARIMAX, VAR, FB Prophet and ML Models on.. Apache-2
Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.
MNE (🥇31 · ⭐ 1.5K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
GitHub (👨💻 230 · 🔀 840 · 📦 780 · 📋 3.4K - 8% open · ⏱️ 12.01.2021):
git clone https://github.com/mne-tools/mne-python
PyPi (📥 20K / month · 📦 200 · ⏱️ 17.12.2020):
Conda (📥 96K · ⏱️ 20.12.2020):
conda install -c conda-forge mne
Nilearn (🥇30 · ⭐ 690) - Machine learning for NeuroImaging in Python. BSD-3
GitHub (👨💻 160 · 🔀 370 · 📦 850 · 📋 1.4K - 26% open · ⏱️ 11.01.2021):
git clone https://github.com/nilearn/nilearn
PyPi (📥 8.8K / month · 📦 300 · ⏱️ 12.11.2020):
Conda (📥 76K · ⏱️ 12.11.2020):
conda install -c conda-forge nilearn
Lifelines (🥈29 · ⭐ 1.5K) - Survival analysis in Python. MIT
GitHub (👨💻 90 · 🔀 410 · 📦 480 · 📋 770 - 24% open · ⏱️ 05.01.2021):
git clone https://github.com/CamDavidsonPilon/lifelines
PyPi (📥 100K / month · 📦 130 · ⏱️ 09.12.2020):
Conda (📥 120K · ⏱️ 10.12.2020):
conda install -c conda-forge lifelines
NIPYPE (🥈29 · ⭐ 540) - Workflows and interfaces for neuroimaging packages. Apache-2
GitHub (👨💻 210 · 🔀 440 · 📦 480 · 📋 1.2K - 26% open · ⏱️ 08.12.2020):
git clone https://github.com/nipy/nipype
PyPi (📥 11K / month · 📦 190 · ⏱️ 16.08.2020):
Conda (📥 350K · ⏱️ 28.11.2020):
conda install -c conda-forge nipype
NiBabel (🥈29 · ⭐ 380) - Python package to access a cacophony of neuro-imaging file formats. MIT
GitHub (👨💻 89 · 🔀 210 · 📦 3.5K · 📋 390 - 27% open · ⏱️ 05.01.2021):
git clone https://github.com/nipy/nibabel
PyPi (📥 46K / month · 📦 1.3K · ⏱️ 28.11.2020):
Conda (📥 290K · ⏱️ 29.11.2020):
conda install -c conda-forge nibabel
DIPY (🥈28 · ⭐ 380) - DIPY is the paragon 3D/4D+ imaging library in Python. Contains generic.. BSD-3
GitHub (👨💻 120 · 🔀 310 · 📦 320 · 📋 730 - 21% open · ⏱️ 10.12.2020):
git clone https://github.com/dipy/dipy
PyPi (📥 6.6K / month · 📦 94 · ⏱️ 05.11.2020):
Conda (📥 170K · ⏱️ 14.11.2020):
conda install -c conda-forge dipy
Hail (🥈23 · ⭐ 690) - Scalable genomic data analysis. MIT
MONAI (🥈22 · ⭐ 1.7K) - AI Toolkit for Healthcare Imaging. Apache-2
NiftyNet (🥈22 · ⭐ 1.3K · 💤) - [unmaintained] An open-source convolutional neural.. Apache-2
DeepVariant (🥉21 · ⭐ 2.2K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3
Glow (🥉19 · ⭐ 150) - An open-source toolkit for large-scale genomic analysis. Apache-2
Brainiak (🥉18 · ⭐ 230) - Brain Imaging Analysis Kit. Apache-2
GitHub (👨💻 32 · 🔀 100 · 📦 11 · 📋 180 - 35% open · ⏱️ 15.10.2020):
git clone https://github.com/brainiak/brainiak
PyPi (📥 86 / month · 📦 1 · ⏱️ 15.10.2020):
Docker Hub (📥 460 · ⭐ 1 · ⏱️ 15.10.2020):
docker pull brainiak/brainiak
Medical Detection Toolkit (🥉12 · ⭐ 890 · 💤) - The Medical Detection Toolkit contains 2D + 3D.. Apache-2
MedicalNet (🥉11 · ⭐ 1K) - Many studies have shown that the performance on deep learning is.. MIT
Show 5 hidden projects...
NIPY (🥉21 · ⭐ 290) - Neuroimaging in Python FMRI analysis package. ❗️DSDP
MedPy (🥉20 · ⭐ 310 · 💤) - Medical image processing in Python. ❗️GPL-3.0
DLTK (🥉19 · ⭐ 1.2K · 💀) - Deep Learning Toolkit for Medical Image Analysis. Apache-2
MedicalTorch (🥉15 · ⭐ 700 · 💀) - A medical imaging framework for Pytorch. Apache-2
DeepNeuro (🥉14 · ⭐ 98 · 💤) - A deep learning python package for neuroimaging data. Made by:. MIT
Optical Character Recognition
Libraries for optical character recognition (OCR) and text extraction from images or videos.
Tesseract (🥇30 · ⭐ 3.4K) - Python-tesseract is an optical character recognition (OCR) tool.. Apache-2
GitHub (👨💻 36 · 🔀 500 · 📋 240 - 2% open · ⏱️ 04.01.2021):
git clone https://github.com/madmaze/pytesseract
PyPi (📥 440K / month · 📦 1.4K · ⏱️ 15.12.2020):
Conda (📥 38K · ⏱️ 20.11.2020):
conda install -c conda-forge pytesseract
EasyOCR (🥈27 · ⭐ 9.9K) - Ready-to-use OCR with 80+ supported languages and all popular.. Apache-2
OCRmyPDF (🥈27 · ⭐ 3.7K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them.. MPL-2.0
tesserocr (🥈26 · ⭐ 1.4K) - A Python wrapper for the tesseract-ocr API. MIT
GitHub (👨💻 23 · 🔀 180 · 📦 440 · 📋 200 - 29% open · ⏱️ 17.11.2020):
git clone https://github.com/sirfz/tesserocr
PyPi (📥 26K / month · 📦 50 · ⏱️ 17.03.2020):
Conda (📥 31K · ⏱️ 14.10.2020):
conda install -c conda-forge tesserocr
PaddleOCR (🥉24 · ⭐ 8.3K) - Awesome multilingual OCR toolkits based on PaddlePaddle.. Apache-2
attention-ocr (🥉20 · ⭐ 820) - A Tensorflow model for text recognition (CNN + seq2seq with.. MIT
keras-ocr (🥉20 · ⭐ 740) - A packaged and flexible version of the CRAFT text detector and.. MIT
doc2text (🥉19 · ⭐ 1.2K) - Detect text blocks and OCR poorly scanned PDFs in bulk. Python module.. MIT
calamari (🥉18 · ⭐ 760) - Line based ATR Engine based on OCRopy. Apache-2
Show 1 hidden projects...
Data Containers & Structures
General-purpose data containers & structures as well as utilities & extensions for pandas.
pandas (🥇43 · ⭐ 28K) - Flexible and powerful data analysis / manipulation library for.. BSD-3
GitHub (👨💻 2.6K · 🔀 12K · 📥 73K · 📦 360K · 📋 20K - 17% open · ⏱️ 12.01.2021):
git clone https://github.com/pandas-dev/pandas
PyPi (📥 24M / month · 📦 77K · ⏱️ 26.12.2020):
Conda (📥 13M · ⏱️ 27.12.2020):
conda install -c conda-forge pandas
numpy (🥇42 · ⭐ 16K) - The fundamental package for scientific computing with Python. BSD-3
GitHub (👨💻 1.2K · 🔀 5.2K · 📥 290K · 📦 600K · 📋 9.5K - 23% open · ⏱️ 11.01.2021):
git clone https://github.com/numpy/numpy
PyPi (📥 39M / month · 📦 170K · ⏱️ 05.01.2021):
Conda (📥 15M · ⏱️ 11.01.2021):
conda install -c conda-forge numpy
h5py (🥇36 · ⭐ 1.5K) - HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5.. BSD-3
GitHub (👨💻 160 · 🔀 390 · 📥 770 · 📦 98K · 📋 1.1K - 16% open · ⏱️ 11.01.2021):
git clone https://github.com/h5py/h5py
PyPi (📥 5.3M / month · 📦 23K · ⏱️ 06.11.2020):
Conda (📥 4.4M · ⏱️ 01.01.2021):
conda install -c conda-forge h5py
Arrow (🥈35 · ⭐ 6.9K) - Apache Arrow is a cross-language development platform for in-memory.. Apache-2
GitHub (👨💻 610 · 🔀 1.7K · 📦 29 · 📋 670 - 19% open · ⏱️ 12.01.2021):
git clone https://github.com/apache/arrow
PyPi (📥 14M / month · 📦 990 · ⏱️ 19.10.2020):
Conda (📥 450K · ⏱️ 12.01.2021):
conda install -c conda-forge arrow
xarray (🥈32 · ⭐ 1.9K) - N-D labeled arrays and datasets in Python. Apache-2
GitHub (👨💻 290 · 🔀 610 · 📦 4.8K · 📋 2.6K - 29% open · ⏱️ 12.01.2021):
git clone https://github.com/pydata/xarray
PyPi (📥 270K / month · 📦 1.1K · ⏱️ 30.11.2020):
Conda (📥 2.2M · ⏱️ 01.12.2020):
conda install -c conda-forge xarray
numexpr (🥈31 · ⭐ 1.5K) - Fast numerical array expression evaluator for Python, NumPy, PyTables,.. MIT
GitHub (👨💻 55 · 🔀 160 · 📋 300 - 17% open · ⏱️ 04.01.2021):
git clone https://github.com/pydata/numexpr
PyPi (📥 700K / month · 📦 5.5K · ⏱️ 05.01.2020):
Conda (📥 2M · ⏱️ 08.01.2021):
conda install -c conda-forge numexpr
Modin (🥈29 · ⭐ 5.6K) - Modin: Speed up your Pandas workflows by changing a single line of.. Apache-2
TinyDB (🥈29 · ⭐ 3.9K) - TinyDB is a lightweight document oriented database optimized for your.. MIT
GitHub (👨💻 62 · 🔀 350 · 📋 250 - 5% open · ⏱️ 04.01.2021):
git clone https://github.com/msiemens/tinydb
PyPi (📥 130K / month · 📦 1.1K · ⏱️ 14.11.2020):
Conda (📥 99K · ⏱️ 14.11.2020):
conda install -c conda-forge tinydb
Koalas (🥈29 · ⭐ 2.6K) - Koalas: pandas API on Apache Spark. Apache-2
GitHub (👨💻 47 · 🔀 290 · 📥 1K · 📦 70 · 📋 500 - 16% open · ⏱️ 12.01.2021):
git clone https://github.com/databricks/koalas
PyPi (📥 860K / month · 📦 1 · ⏱️ 11.12.2020):
Conda (📥 74K · ⏱️ 11.12.2020):
conda install -c conda-forge koalas
PyTables (🥈28 · ⭐ 1K) - A Python package to manage extremely large amounts of data. BSD-3
GitHub (👨💻 96 · 🔀 180 · 📥 120 · 📋 590 - 25% open · ⏱️ 10.01.2021):
git clone https://github.com/PyTables/PyTables
PyPi (📥 380K / month · 📦 3.8K · ⏱️ 30.10.2019):
Conda (📥 2.1M · ⏱️ 09.01.2021):
conda install -c conda-forge pytables
Bottleneck (🥈28 · ⭐ 570) - Fast NumPy array functions written in C. BSD-2
GitHub (👨💻 20 · 🔀 62 · 📦 18K · 📋 200 - 11% open · ⏱️ 25.11.2020):
git clone https://github.com/pydata/bottleneck
PyPi (📥 200K / month · 📦 2.9K · ⏱️ 21.02.2020):
Conda (📥 1.4M · ⏱️ 12.10.2020):
conda install -c conda-forge bottleneck
datasketch (🥉27 · ⭐ 1.4K) - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog,.. MIT
zarr (🥉26 · ⭐ 620) - An implementation of chunked, compressed, N-dimensional arrays for Python. MIT
GitHub (👨💻 35 · 🔀 110 · 📦 510 · 📋 410 - 43% open · ⏱️ 04.01.2021):
git clone https://github.com/zarr-developers/zarr-python
PyPi (📥 18K / month · 📦 72 · ⏱️ 02.12.2020):
Conda (📥 520K · ⏱️ 03.12.2020):
conda install -c conda-forge zarr
swifter (🥉25 · ⭐ 1.5K) - A package which efficiently applies any function to a pandas.. MIT
GitHub (👨💻 14 · 🔀 69 · 📦 270 · 📋 90 - 15% open · ⏱️ 19.12.2020):
git clone https://github.com/jmcarpenter2/swifter
PyPi (📥 51K / month · 📦 16 · ⏱️ 11.10.2020):
Conda (📥 73K · ⏱️ 20.09.2020):
conda install -c conda-forge swifter
bcolz (🥉25 · ⭐ 910 · ➕) - A columnar data container that can be compressed. BSD-3
GitHub (👨💻 33 · 🔀 120 · 📦 1.5K · 📋 250 - 51% open · ⏱️ 10.09.2020):
git clone https://github.com/Blosc/bcolz
PyPi (📥 20K / month · 📦 970 · ⏱️ 13.04.2018):
Conda (📥 200K · ⏱️ 05.11.2019):
conda install -c conda-forge bcolz
Pandaral·lel (🥉23 · ⭐ 1.3K) - A simple and efficient tool to parallelize Pandas.. BSD-3
Vaex (🥉22 · ⭐ 5.6K) - Out-of-Core DataFrames for Python, ML, visualize and explore big tabular data.. MIT
GitHub (👨💻 35 · 🔀 430 · 📥 200 · 📋 650 - 40% open · ⏱️ 08.01.2021):
git clone https://github.com/vaexio/vaex
PyPi (📥 3.5K / month · 📦 2 · ⏱️ 08.12.2020):
Conda (📥 94K · ⏱️ 01.06.2020):
conda install -c conda-forge vaex
datatable (🥉20 · ⭐ 1.1K) - A Python package for manipulating 2-dimensional tabular data.. MPL-2.0
fletcher (🥉20 · ⭐ 210) - Pandas ExtensionDType/Array backed by Apache Arrow. MIT
GitHub (👨💻 23 · 🔀 32 · 📥 12 · 📦 3 · 📋 72 - 45% open · ⏱️ 29.12.2020):
git clone https://github.com/xhochy/fletcher
PyPi (📥 310 / month · ⏱️ 07.12.2020):
Conda (📥 18K · ⏱️ 29.12.2020):
conda install -c conda-forge fletcher
StaticFrame (🥉20 · ⭐ 200) - The StaticFrame library defines the Series and Frame, immutable data.. MIT
GitHub (👨💻 14 · 🔀 20 · 📦 5 · 📋 270 - 12% open · ⏱️ 12.01.2021):
git clone https://github.com/InvestmentSystems/static-frame
PyPi (📥 1.1K / month · ⏱️ 12.01.2021):
Conda (📥 60K · ⏱️ 12.01.2021):
conda install -c conda-forge static-frame
Bounter (🥉18 · ⭐ 890) - Efficient Counter that uses a limited (bounded) amount of memory.. MIT
PandaPy (🥉15 · ⭐ 470) - PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x.. MIT
Show 6 hidden projects...
Blaze (🥈28 · ⭐ 2.9K · 💀) - NumPy and Pandas interface to Big Data. BSD-3
sklearn-pandas (🥈28 · ⭐ 2.3K) - Pandas integration with sklearn. ❗️Zlib
Arctic (🥉24 · ⭐ 2.1K) - Arctic is a high performance datastore for numeric data. ❗️LGPL-2.1
pandasql (🥉22 · ⭐ 930 · 💀) - sqldf for pandas. MIT
pickleDB (🥉21 · ⭐ 530 · 💀) - pickleDB is an open source key-value store using Python's json.. BSD-3
Pandas Summary (🥉21 · ⭐ 360 · 💀) - An extension to pandas dataframes describe function. MIT
Data Loading & Extraction
Libraries for loading, collecting, and extracting data from a variety of data sources and formats.
Faker (🥇36 · ⭐ 12K) - Faker is a Python package that generates fake data for you. MIT
GitHub (👨💻 380 · 🔀 1.3K · 📦 22K · 📋 480 - 28% open · ⏱️ 12.01.2021):
git clone https://github.com/joke2k/faker
PyPi (📥 1.9M / month · 📦 4.7K · ⏱️ 12.01.2021):
Conda (📥 360K · ⏱️ 11.01.2021):
conda install -c conda-forge faker
xlrd (🥇33 · ⭐ 1.8K) - Please use openpyxl where you can... BSD-3
GitHub (👨💻 50 · 🔀 410 · 📦 61K · ⏱️ 12.12.2020):
git clone https://github.com/python-excel/xlrd
PyPi (📥 5.3M / month · 📦 14K · ⏱️ 11.12.2020):
Conda (📥 1.1M · ⏱️ 09.01.2021):
conda install -c conda-forge xlrd
xmltodict (🥇32 · ⭐ 4.2K · 💤) - Python module that makes working with XML feel like you are.. MIT
GitHub (👨💻 41 · 🔀 400 · 📦 20K · 📋 200 - 31% open · ⏱️ 26.04.2020):
git clone https://github.com/martinblech/xmltodict
PyPi (📥 3.9M / month · 📦 8.2K · ⏱️ 11.02.2019):
Conda (📥 600K · ⏱️ 11.02.2019):
conda install -c conda-forge xmltodict
Tablib (🥇32 · ⭐ 3.8K) - Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c. MIT
GitHub (👨💻 110 · 🔀 560 · 📦 9K · 📋 230 - 14% open · ⏱️ 11.12.2020):
git clone https://github.com/jazzband/tablib
PyPi (📥 550K / month · 📦 2.4K · ⏱️ 05.12.2020):
Conda (📥 56K · ⏱️ 05.12.2020):
conda install -c conda-forge tablib
TensorFlow Datasets (🥇32 · ⭐ 2.6K) - TFDS is a collection of datasets ready to use with.. Apache-2
smart-open (🥈30 · ⭐ 1.9K) - Utils for streaming large files (S3, HDFS, gzip, bz2...). MIT
python-magic (🥈30 · ⭐ 1.8K) - A python wrapper for libmagic. MIT
GitHub (👨💻 47 · 🔀 210 · 📦 11K · 📋 150 - 18% open · ⏱️ 11.12.2020):
git clone https://github.com/ahupp/python-magic
PyPi (📥 1.4M / month · 📦 5.1K · ⏱️ 06.05.2020):
Conda (📥 74K · ⏱️ 24.12.2020):
conda install -c conda-forge python-magic
Datasets (🥈29 · ⭐ 6.4K) - The largest hub of ready-to-use NLP datasets for ML models with.. Apache-2
pandas-datareader (🥈29 · ⭐ 1.8K) - Extract data from a wide range of Internet sources.. BSD-3
GitHub (👨💻 77 · 🔀 490 · 📦 7.4K · 📋 440 - 15% open · ⏱️ 31.12.2020):
git clone https://github.com/pydata/pandas-datareader
PyPi (📥 120K / month · 📦 1.4K · ⏱️ 10.07.2020):
pip install pandas-datareader
Conda (📥 85K · ⏱️ 20.11.2019):
conda install -c conda-forge pandas-datareader
csvkit (🥉27 · ⭐ 4.4K) - A suite of utilities for converting to and working with CSV, the king of.. MIT
GitHub (👨💻 91 · 🔀 540 · 📦 800 · 📋 800 - 7% open · ⏱️ 30.10.2020):
git clone https://github.com/wireservice/csvkit
PyPi (📥 26K / month · 📦 700 · ⏱️ 03.03.2020):
Conda (📥 45K · ⏱️ 28.05.2019):
conda install -c conda-forge csvkit
snorkel (🥉27 · ⭐ 4.4K) - A system for quickly generating training data with weak supervision. Apache-2
GitHub (👨💻 62 · 🔀 710 · 📥 490 · 📦 63 · 📋 940 - 3% open · ⏱️ 05.09.2020):
git clone https://github.com/snorkel-team/snorkel
PyPi (📥 68K / month · 📦 4 · ⏱️ 07.04.2020):
Conda (📥 15K · ⏱️ 10.04.2020):
conda install -c conda-forge snorkel
PDFMiner (🥉26 · ⭐ 4.5K · 💤) - Python PDF Parser (Not actively maintained). Check out pdfminer.six. MIT
GitHub (👨💻 28 · 🔀 1K · 📦 2K · 📋 260 - 85% open · ⏱️ 18.01.2020):
git clone https://github.com/euske/pdfminer
PyPi (📥 170K / month · 📦 1.5K · ⏱️ 25.11.2019):
Conda (📥 13K · ⏱️ 03.11.2019):
conda install -c conda-forge pdfminer
tabulator-py (🥉26 · ⭐ 200 · ➕) - Python library for reading and writing tabular data via streams. MIT
GitHub (👨💻 24 · 🔀 40 · 📦 450 · ⏱️ 30.11.2020):
git clone https://github.com/frictionlessdata/tabulator-py
PyPi (📥 94K / month · 📦 100 · ⏱️ 30.11.2020):
Conda (📥 37K · ⏱️ 24.07.2018):
conda install -c conda-forge tabulator-py
Intake (🥉24 · ⭐ 520) - Intake is a lightweight package for finding, investigating, loading and.. BSD-2
GitHub (👨💻 51 · 🔀 91 · 📦 220 · 📋 260 - 28% open · ⏱️ 16.12.2020):
git clone https://github.com/intake/intake
PyPi (📥 2.6K / month · 📦 74 · ⏱️ 03.06.2020):
Conda (📥 55K · ⏱️ 03.06.2020):
conda install -c conda-forge intake
SDV (🥉21 · ⭐ 300) - Synthetic Data Generation for tabular, relational and time series data. MIT
datatest (🥉21 · ⭐ 230 · ➕) - Tools for test driven data-wrangling and data validation. Apache-2
Show 6 hidden projects...
textract (🥉26 · ⭐ 2.9K · 💀) - extract text from any document. no muss. no fuss. MIT
Singer (🥉24 · ⭐ 660) - Standard for moving data between databases, web APIs, files, queues,.. ❗️AGPL-3.0
Camelot (🥉23 · ⭐ 3K · 💀) - Camelot: PDF Table Extraction for Humans. MIT
messytables (🥉23 · ⭐ 360 · 💀) - Tools for parsing messy tabular data. This is now superseded by.. MIT
pyexcel-xlsx (🥉23 · ⭐ 83) - A wrapper library to read, manipulate and write data in xlsx and.. BSD-3
rows (🥉20 · ⭐ 730 · ➕) - A common, beautiful interface to tabular data, no matter the format. ❗️LGPL-3.0
Libraries for web scraping, crawling, downloading, and mining as well as libraries.
🔗 best-of-web-python - Web Scraping ( ⭐ 2) - Collection of web-scraping and crawling libraries.
Data Pipelines & Streaming
Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
Celery (🥇37 · ⭐ 17K) - Asynchronous task queue/job queue based on distributed message passing. BSD-3
GitHub (👨💻 1.1K · 🔀 3.9K · 📦 47K · 📋 4.4K - 11% open · ⏱️ 12.01.2021):
git clone https://github.com/celery/celery
PyPi (📥 2M / month · 📦 28K · ⏱️ 16.12.2020):
Conda (📥 360K · ⏱️ 11.11.2020):
conda install -c conda-forge celery
Airflow (🥇35 · ⭐ 20K) - Platform to programmatically author, schedule, and monitor workflows. Apache-2
GitHub (👨💻 1.7K · 🔀 7.8K · 📥 75K · 📋 2.8K - 35% open · ⏱️ 12.01.2021):
git clone https://github.com/apache/airflow
PyPi (📥 550K / month · 📦 290 · ⏱️ 14.12.2020):
pip install apache-airflow
Conda (📥 260K · ⏱️ 26.11.2020):
conda install -c conda-forge airflow
Docker Hub (📥 4M · ⭐ 200 · ⏱️ 12.01.2021):
docker pull apache/airflow
joblib (🥇35 · ⭐ 2.3K · ➕) - Computing with Python functions. BSD-3
GitHub (👨💻 98 · 🔀 290 · 📦 76K · 📋 640 - 45% open · ⏱️ 28.12.2020):
git clone https://github.com/joblib/joblib
PyPi (📥 16M / month · 📦 6.6K · ⏱️ 14.12.2020):
Conda (📥 2.8M · ⏱️ 14.12.2020):
conda install -c conda-forge joblib
luigi (🥇33 · ⭐ 14K) - Luigi is a Python module that helps you build complex pipelines of batch.. Apache-2
GitHub (👨💻 550 · 🔀 2.2K · 📦 1.3K · 📋 900 - 7% open · ⏱️ 29.12.2020):
git clone https://github.com/spotify/luigi
PyPi (📥 360K / month · 📦 680 · ⏱️ 23.09.2020):
Conda (📥 6.2K · ⏱️ 21.07.2020):
conda install -c anaconda luigi
rq (🥇33 · ⭐ 7.5K · ➕) - Simple job queues for Python. BSD-3
GitHub (👨💻 220 · 🔀 1.2K · 📦 7.1K · 📋 810 - 17% open · ⏱️ 09.01.2021):
git clone https://github.com/rq/rq
PyPi (📥 230K / month · 📦 3.3K · ⏱️ 29.11.2020):
Conda (📥 43K · ⏱️ 29.11.2020):
conda install -c conda-forge rq
Beam (🥈32 · ⭐ 4.5K) - Unified programming model to define and execute data processing.. Apache-2
Prefect (🥈30 · ⭐ 5.7K) - The easiest way to automate your data. Apache-2
GitHub (👨💻 170 · 🔀 480 · 📦 230 · 📋 1.5K - 14% open · ⏱️ 12.01.2021):
git clone https://github.com/PrefectHQ/prefect
PyPi (📥 35K / month · 📦 2 · ⏱️ 06.01.2021):
Conda (📥 65K · ⏱️ 06.01.2021):
conda install -c conda-forge prefect
mrjob (🥈30 · ⭐ 2.5K) - Run MapReduce jobs on Hadoop or Amazon Web Services. Apache-2
GitHub (👨💻 140 · 🔀 590 · 📦 650 · 📋 1.3K - 15% open · ⏱️ 16.11.2020):
git clone https://github.com/Yelp/mrjob
PyPi (📥 110K / month · 📦 110 · ⏱️ 17.09.2020):
Conda (📥 310K · ⏱️ 24.12.2020):
conda install -c conda-forge mrjob
Kedro (🥈28 · ⭐ 3.3K) - A Python framework for creating reproducible, maintainable and modular.. Apache-2
dbt (🥈28 · ⭐ 2.3K) - dbt (data build tool) enables data analysts and engineers to transform.. Apache-2
GitHub (👨💻 130 · 🔀 470 · 📦 170 · 📋 1.7K - 15% open · ⏱️ 12.01.2021):
git clone https://github.com/fishtown-analytics/dbt
PyPi (📥 95K / month · 📦 18 · ⏱️ 29.12.2020):
Conda (📥 150K · ⏱️ 14.05.2020):
conda install -c conda-forge dbt
faust (🥈27 · ⭐ 5.2K · 📉) - Python Stream Processing. BSD-3
petl (🥈27 · ⭐ 810) - Python Extract Transform and Load Tables of Data. MIT
GitHub (👨💻 43 · 🔀 150 · 📦 310 · 📋 410 - 16% open · ⏱️ 29.12.2020):
git clone https://github.com/petl-developers/petl
PyPi (📥 13K / month · 📦 110 · ⏱️ 06.10.2020):
Conda (📥 19K · ⏱️ 29.12.2020):
conda install -c conda-forge petl
Dagster (🥈26 · ⭐ 2.5K) - A data orchestrator for machine learning, analytics, and ETL. Apache-2
GitHub (👨💻 110 · 🔀 260 · 📦 130 · 📋 2.4K - 25% open · ⏱️ 12.01.2021):
git clone https://github.com/dagster-io/dagster
PyPi (📥 26K / month · 📦 4 · ⏱️ 11.12.2020):
Conda (📥 120K · ⏱️ 04.12.2020):
conda install -c conda-forge dagster
PyFunctional (🥈26 · ⭐ 1.8K) - Python library for creating data pipelines with chain functional.. MIT
TFX (🥈26 · ⭐ 1.3K) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2
streamparse (🥉25 · ⭐ 1.4K) - Run Python in Apache Storm topologies. Pythonic API, CLI.. Apache-2
Great Expectations (🥉23 · ⭐ 3.4K) - Always know what to expect from your data. Apache-2
bonobo (🥉23 · ⭐ 1.4K) - Extract Transform Load for Python 3.5+. Apache-2
Optimus (🥉23 · ⭐ 960) - Agile Data Preparation Workflows madeeasy with dask, cudf,.. Apache-2
pysparkling (🥉23 · ⭐ 230) - A pure Python implementation of Apache Spark's RDD and DStream.. MIT
Pypeline (🥉21 · ⭐ 1.2K) - Concurrent data pipelines in Python . MIT
dpark (🥉20 · ⭐ 2.6K · ➕) - Python clone of Spark, a MapReduce alike framework in Python. BSD-3
mrq (🥉20 · ⭐ 830 · ➕) - Mr. Queue - A distributed worker task queue in Python using Redis & gevent. MIT
pdpipe (🥉20 · ⭐ 580) - Easy pipelines for pandas DataFrames. MIT
riko (🥉19 · ⭐ 1.6K) - A Python stream processing engine modeled after Yahoo! Pipes. MIT
TaskTiger (🥉19 · ⭐ 1K) - Python task queue using Redis. MIT
Databolt Flow (🥉19 · ⭐ 890) - Python library for building highly effective data science workflows. MIT
spark-deep-learning (🥉18 · ⭐ 1.8K · ➕) - Deep Learning Pipelines for Apache Spark. Apache-2
flupy (🥉18 · ⭐ 150 · ➕) - Fluent data pipelines for python and your shell. MIT
Mara Pipelines (🥉17 · ⭐ 1.6K) - A lightweight opinionated ETL framework, halfway between plain.. MIT
BatchFlow (🥉17 · ⭐ 150) - BatchFlow helps you conveniently work with random or sequential.. Apache-2
zenml (🥉15 · ⭐ 370 · 🐣) - ZenML: Bring Zen to your ML with reproducible pipelines. Apache-2
Show 2 hidden projects...
ploomber (🥉18 · ⭐ 110 · ➕) - A convention over configuration workflow orchestrator. Develop.. Apache-2
Botflow (🥉15 · ⭐ 1.2K · 💀) - Python Fast Dataflow programming framework for Data pipeline work(.. BSD-3
Distributed Machine Learning
Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.
dask (🥇35 · ⭐ 7.7K) - Parallel computing with task scheduling. BSD-3
GitHub (👨💻 420 · 🔀 1.2K · 📦 24K · 📋 3.5K - 20% open · ⏱️ 11.01.2021):
git clone https://github.com/dask/dask
PyPi (📥 1.2M / month · 📦 3.9K · ⏱️ 11.12.2020):
Conda (📥 2.8M · ⏱️ 11.12.2020):
conda install -c conda-forge dask
dask.distributed (🥇34 · ⭐ 1.1K · ➕) - A distributed task scheduler for Dask. BSD-3
GitHub (👨💻 230 · 🔀 500 · 📦 16K · 📋 2K - 35% open · ⏱️ 12.01.2021):
git clone https://github.com/dask/distributed
PyPi (📥 690K / month · 📦 1.8K · ⏱️ 11.12.2020):
Conda (📥 3.5M · ⏱️ 11.12.2020):
conda install -c conda-forge distributed
Ray (🥇33 · ⭐ 14K) - An open source framework that provides a simple, universal API for.. Apache-2
horovod (🥈30 · ⭐ 11K) - Distributed training framework for TensorFlow, Keras, PyTorch, and.. Apache-2
ipyparallel (🥈29 · ⭐ 1.8K) - Interactive Parallel Computing in Python. BSD-3
GitHub (👨💻 94 · 🔀 720 · 📦 1.4K · 📋 250 - 56% open · ⏱️ 24.08.2020):
git clone https://github.com/ipython/ipyparallel
PyPi (📥 72K / month · 📦 490 · ⏱️ 05.05.2020):
Conda (📥 360K · ⏱️ 14.12.2020):
conda install -c conda-forge ipyparallel
dask-ml (🥈26 · ⭐ 680) - Scalable Machine Learning with Dask. BSD-3
GitHub (👨💻 61 · 🔀 180 · 📦 320 · 📋 370 - 46% open · ⏱️ 05.01.2021):
git clone https://github.com/dask/dask-ml
PyPi (📥 35K / month · 📦 42 · ⏱️ 24.09.2020):
Conda (📥 190K · ⏱️ 24.09.2020):
conda install -c conda-forge dask-ml
Mesh (🥈26 · ⭐ 620) - Mesh TensorFlow: Model Parallelism Made Easier. Apache-2
TensorFlowOnSpark (🥈25 · ⭐ 3.6K · 📈) - TensorFlowOnSpark brings TensorFlow programs to.. Apache-2
mpi4py (🥈25 · ⭐ 370) - Python bindings for MPI. BSD-3
GitHub (👨💻 13 · 🔀 61 · 📥 640 · 📋 4 - 25% open · ⏱️ 12.01.2021):
git clone https://github.com/mpi4py/mpi4py
PyPi (📥 140K / month · 📦 700 · ⏱️ 04.11.2019):
Conda (📥 500K · ⏱️ 08.01.2021):
conda install -c conda-forge mpi4py
BigDL (🥈24 · ⭐ 3.7K) - BigDL: Distributed Deep Learning Framework for Apache Spark. Apache-2
petastorm (🥈24 · ⭐ 1K · ➕) - Petastorm library enables single machine or distributed.. Apache-2
DeepSpeed (🥉23 · ⭐ 3.9K) - DeepSpeed is a deep learning optimization library that makes.. MIT
GitHub (👨💻 37 · 🔀 350 · 📦 8 · 📋 250 - 40% open · ⏱️ 12.01.2021):
git clone https://github.com/microsoft/DeepSpeed
PyPi (📥 1.4K / month · ⏱️ 08.01.2021):
Docker Hub (📥 6.6K · ⭐ 2 · ⏱️ 20.11.2020):
docker pull deepspeed/deepspeed
MMLSpark (🥉23 · ⭐ 2.2K) - Microsoft Machine Learning for Apache Spark. MIT
Elephas (🥉23 · ⭐ 1.4K) - Distributed Deep learning with Keras & Spark. MIT
keras
analytics-zoo (🥉22 · ⭐ 2.2K) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2
Submit it (🥉20 · ⭐ 290) - Python 3.6+ toolbox for submitting jobs to Slurm. MIT
GitHub (👨💻 8 · 🔀 17 · 📦 73 · 📋 23 - 39% open · ⏱️ 07.01.2021):
git clone https://github.com/facebookincubator/submitit
PyPi (📥 2.2K / month · ⏱️ 01.12.2020):
Conda (📥 750 · ⏱️ 19.11.2020):
conda install -c conda-forge submitit
BytePS (🥉19 · ⭐ 2.6K) - A high performance and generic framework for distributed DNN training. Apache-2
GitHub (👨💻 16 · 🔀 360 · 📋 210 - 35% open · ⏱️ 10.01.2021):
git clone https://github.com/bytedance/byteps
PyPi (📥 66 / month · ⏱️ 04.11.2020):
Docker Hub (📥 950 · ⏱️ 03.03.2020):
docker pull bytepsimage/tensorflow
Apache Singa (🥉19 · ⭐ 2.2K) - a distributed deep learning platform. Apache-2
GitHub (👨💻 70 · 🔀 570 · 📋 81 - 60% open · ⏱️ 11.12.2020):
git clone https://github.com/apache/singa
Conda (📥 240 · ⏱️ 12.01.2021):
conda install -c nusdbsystem singa
Docker Hub (📥 160 · ⭐ 2 · ⏱️ 04.06.2019):
FairScale (🥉18 · ⭐ 630 · 🐣) - PyTorch extensions for high performance and large scale.. BSD-3
sk-dist (🥉18 · ⭐ 250) - Distributed scikit-learn meta-estimators in PySpark. Apache-2
Fiber (🥉17 · ⭐ 830) - Distributed Computing for AI Made Simple. Apache-2
somoclu (🥉17 · ⭐ 220 · ➕) - Massively parallel self-organizing maps: accelerate training on.. MIT
GitHub (👨💻 17 · 🔀 55 · 📥 1.4K · 📋 130 - 19% open · ⏱️ 24.07.2020):
git clone https://github.com/peterwittek/somoclu
PyPi (📥 1.3K / month · 📦 2 · ⏱️ 25.04.2020):
Conda (📥 39K · ⏱️ 13.10.2020):
conda install -c conda-forge somoclu
Hivemind (🥉16 · ⭐ 640) - Decentralized deep learning in PyTorch. Built to train models on.. MIT
Show 3 hidden projects...
Hyperparameter Optimization & AutoML
Libraries for hyperparameter optimization, automl and neural architecture search.
Optuna (🥇31 · ⭐ 3.9K) - A hyperparameter optimization framework. MIT
GitHub (👨💻 120 · 🔀 440 · 📦 850 · 📋 640 - 27% open · ⏱️ 12.01.2021):
git clone https://github.com/optuna/optuna
PyPi (📥 160K / month · 📦 52 · ⏱️ 04.11.2020):
Conda (📥 16K · ⏱️ 11.11.2020):
conda install -c conda-forge optuna
scikit-optimize (🥇31 · ⭐ 2K) - Sequential model-based optimization with a `scipy.optimize`.. BSD-3
GitHub (👨💻 68 · 🔀 370 · 📦 1.3K · 📋 520 - 31% open · ⏱️ 31.12.2020):
git clone https://github.com/scikit-optimize/scikit-optimize
PyPi (📥 480K / month · 📦 160 · ⏱️ 04.09.2020):
pip install scikit-optimize
Conda (📥 180K · ⏱️ 04.09.2020):
conda install -c conda-forge scikit-optimize
Hyperopt (🥇30 · ⭐ 5.3K) - Distributed Asynchronous Hyperparameter Optimization in Python. BSD-3
GitHub (👨💻 84 · 🔀 840 · 📦 2.5K · 📋 550 - 60% open · ⏱️ 24.12.2020):
git clone https://github.com/hyperopt/hyperopt
PyPi (📥 400K / month · 📦 500 · ⏱️ 07.10.2020):
Conda (📥 150K · ⏱️ 14.10.2020):
conda install -c conda-forge hyperopt
featuretools (🥇28 · ⭐ 5.3K) - An open source python library for automated feature engineering. BSD-3
GitHub (👨💻 49 · 🔀 690 · 📦 650 · 📋 500 - 22% open · ⏱️ 12.01.2021):
git clone https://github.com/alteryx/featuretools
PyPi (📥 64K / month · 📦 70 · ⏱️ 31.12.2020):
Conda (📥 43K · ⏱️ 05.01.2021):
conda install -c conda-forge featuretools
Keras Tuner (🥇28 · ⭐ 2.2K) - Hyperparameter tuning for humans. Apache-2
NNI (🥈27 · ⭐ 8.7K) - An open source AutoML toolkit for automate machine learning lifecycle,.. MIT
AutoKeras (🥈27 · ⭐ 7.7K) - AutoML library for deep learning. Apache-2
auto-sklearn (🥈26 · ⭐ 5.1K) - Automated Machine Learning with scikit-learn. BSD-3
Bayesian Optimization (🥈26 · ⭐ 4.8K) - A Python implementation of global optimization with.. MIT
AutoGluon (🥈26 · ⭐ 2.9K) - AutoGluon: AutoML for Text, Image, and Tabular Data. Apache-2
BoTorch (🥈26 · ⭐ 1.8K) - Bayesian optimization in PyTorch. MIT
SMAC3 (🥈26 · ⭐ 550) - Sequential Model-based Algorithm Configuration. BSD-3
nevergrad (🥈25 · ⭐ 2.8K · 📉) - A Python toolbox for performing gradient-free optimization. MIT
GitHub (👨💻 42 · 🔀 270 · 📦 110 · 📋 190 - 39% open · ⏱️ 07.01.2021):
git clone https://github.com/facebookresearch/nevergrad
PyPi (📥 7.4K / month · 📦 14 · ⏱️ 10.12.2020):
Conda (📥 5.6K · ⏱️ 14.12.2020):
conda install -c conda-forge nevergrad
Ax (🥈25 · ⭐ 1.4K) - Adaptive Experimentation Platform. MIT
Hyperas (🥈24 · ⭐ 2.1K) - Keras + Hyperopt: A very simple wrapper for convenient.. MIT
GPyOpt (🥈24 · ⭐ 700) - Gaussian Process Optimization using GPy. BSD-3
Talos (🥉22 · ⭐ 1.4K) - Hyperparameter Optimization for TensorFlow, Keras and PyTorch. MIT
Orion (🥉22 · ⭐ 180) - Asynchronous Distributed Hyperparameter Optimization. BSD-3
AdaNet (🥉21 · ⭐ 3.2K) - Fast and flexible AutoML with learning guarantees. Apache-2
optunity (🥉21 · ⭐ 360 · 💤) - optimization routines for hyperparameter tuning. BSD-3
Neuraxle (🥉21 · ⭐ 350) - A Sklearn-like Framework for Hyperparameter Tuning and AutoML in.. Apache-2
mljar-supervised (🥉20 · ⭐ 740) - Automates Machine Learning Pipeline with Feature Engineering.. MIT
Auto ViML (🥉20 · ⭐ 200) - Automatically Build Multiple ML Models with a Single Line of Code... Apache-2
Test Tube (🥉18 · ⭐ 640 · 💤) - Python library to easily log experiments and parallelize.. MIT
lazypredict (🥉18 · ⭐ 300 · ➕) - Lazy Predict help build a lot of basic models without much.. MIT
Dragonfly (🥉17 · ⭐ 550) - An open source python library for scalable Bayesian optimisation. MIT
HyperparameterHunter (🥉16 · ⭐ 630) - Easy hyperparameter optimization and automatic result.. MIT
AlphaPy (🥉16 · ⭐ 540) - Automated Machine Learning [AutoML] with Python, scikit-learn, Keras,.. Apache-2
Auto Tune Models (🥉16 · ⭐ 500 · 💤) - Auto Tune Models - A multi-tenant, multi-data system for.. MIT
Parfit (🥉15 · ⭐ 200 · 💤) - A package for parallelizing the fit and flexibly scoring of.. MIT
ENAS (🥉14 · ⭐ 2.4K · 💤) - PyTorch implementation of Efficient Neural Architecture Search via.. Apache-2
Devol (🥉11 · ⭐ 930) - Genetic neural architecture search with Keras. MIT
Show 13 hidden projects...
TPOT (🥇30 · ⭐ 7.7K · 📈) - A Python Automated Machine Learning tool that optimizes.. ❗️LGPL-3.0
MLBox (🥈23 · ⭐ 1.2K) - MLBox is a powerful Automated Machine Learning python library. ❗️BSD-1-Clause
auto_ml (🥉21 · ⭐ 1.5K · 💀) - [UNMAINTAINED] Automated machine learning for analytics & production. MIT
HpBandSter (🥉19 · ⭐ 440 · 💀) - a distributed Hyperband implementation on Steroids. BSD-3
Advisor (🥉17 · ⭐ 1.3K · 💀) - Open-source implementation of Google Vizier for hyper parameters.. Apache-2
sklearn-deap (🥉17 · ⭐ 620 · 💀) - Use evolutionary algorithms instead of gridsearch in.. MIT
Sherpa (🥉17 · ⭐ 280) - Hyperparameter optimization that enables researchers to experiment,.. ❗️GPL-3.0
automl-gs (🥉16 · ⭐ 1.7K · 💀) - Provide an input CSV and a target field to predict, generate a.. MIT
Xcessiv (🥉16 · ⭐ 1.3K · 💀) - A web-based application for quick, scalable, and automated.. Apache-2
Auptimizer (🥉13 · ⭐ 150) - An automatic ML model optimization tool. ❗️GPL-3.0
Hypermax (🥉13 · ⭐ 96) - Better, faster hyper-parameter optimization. BSD-3
featurewiz (🥉12 · ⭐ 17 · 🐣) - Select the best features from your data set fast with a single.. Apache-2
Hypertunity (🥉10 · ⭐ 120 · 💤) - A toolset for black-box hyperparameter optimisation. Apache-2
Libraries for building and evaluating reinforcement learning & agent-based systems.
OpenAI Gym (🥇35 · ⭐ 23K) - A toolkit for developing and comparing reinforcement learning.. MIT
baselines (🥇28 · ⭐ 11K · 💤) - OpenAI Baselines: high-quality implementations of reinforcement.. MIT
Dopamine (🥈27 · ⭐ 9.3K) - Dopamine is a research framework for fast prototyping of.. Apache-2
TensorLayer (🥈27 · ⭐ 6.5K) - Deep Learning and Reinforcement Learning Library for.. Apache-2
TF-Agents (🥈27 · ⭐ 1.8K) - TF-Agents is a library for Reinforcement Learning in.. Apache-2
Stable Baselines (🥈25 · ⭐ 2.8K) - A fork of OpenAI Baselines, implementations of reinforcement.. MIT
ViZDoom (🥈25 · ⭐ 1.2K) - Doom-based AI Research Platform for Reinforcement Learning from Raw.. MIT
TensorForce (🥉24 · ⭐ 2.8K) - Tensorforce: a TensorFlow library for applied.. Apache-2
Acme (🥉23 · ⭐ 1.9K) - A library of reinforcement learning components and agents. Apache-2
garage (🥉23 · ⭐ 1K) - A toolkit for reproducible reinforcement learning research. MIT
ChainerRL (🥉23 · ⭐ 920) - ChainerRL is a deep reinforcement learning library built on top of.. MIT
PARL (🥉21 · ⭐ 1.8K · 📈) - A high-performance distributed training framework for.. Apache-2
TRFL (🥉20 · ⭐ 3.1K · 💤) - TensorFlow Reinforcement Learning. Apache-2
Coach (🥉19 · ⭐ 1.9K) - Reinforcement Learning Coach by Intel AI Lab enables easy.. Apache-2
PFRL (🥉19 · ⭐ 480) - PFRL: a PyTorch-based deep reinforcement learning library. MIT
ReAgent (🥉16 · ⭐ 2.7K) - A platform for Reasoning systems (Reinforcement Learning,.. BSD-3
RLax (🥉16 · ⭐ 540) - A library of reinforcement learning building blocks in JAX. Apache-2
jax
Show 2 hidden projects...
keras-rl (🥈26 · ⭐ 4.9K · 💀) - Deep Reinforcement Learning for Keras. MIT
DeepMind Lab (🥉17 · ⭐ 6.4K) - A customisable 3D platform for agent-based AI research. ❗️GPL-2.0
Libraries for building and evaluating recommendation systems.
implicit (🥇28 · ⭐ 2.2K) - Fast Python Collaborative Filtering for Implicit Feedback Datasets. MIT
GitHub (👨💻 28 · 🔀 450 · 📦 340 · 📋 330 - 23% open · ⏱️ 15.11.2020):
git clone https://github.com/benfred/implicit
PyPi (📥 95K / month · 📦 22 · ⏱️ 15.09.2020):
Conda (📥 180K · ⏱️ 24.11.2020):
conda install -c conda-forge implicit
lightfm (🥇27 · ⭐ 3.5K) - A Python implementation of LightFM, a hybrid recommendation algorithm. Apache-2
GitHub (👨💻 42 · 🔀 560 · 📦 380 · 📋 400 - 31% open · ⏱️ 27.11.2020):
git clone https://github.com/lyst/lightfm
PyPi (📥 92K / month · 📦 28 · ⏱️ 27.11.2020):
Conda (📥 72K · ⏱️ 07.12.2020):
conda install -c conda-forge lightfm
scikit-surprise (🥈26 · ⭐ 4.6K) - A Python scikit for building and analyzing recommender.. BSD-3
GitHub (👨💻 38 · 🔀 810 · 📦 860 · 📋 320 - 10% open · ⏱️ 05.08.2020):
git clone https://github.com/NicolasHug/Surprise
PyPi (📥 39K / month · 📦 24 · ⏱️ 19.07.2020):
pip install scikit-surprise
Conda (📥 150K · ⏱️ 13.10.2020):
conda install -c conda-forge scikit-surprise
Recommenders (🥈21 · ⭐ 9K) - Best Practices on Recommendation Systems. MIT
TF Ranking (🥈21 · ⭐ 2K) - Learning to Rank in TensorFlow. Apache-2
tensorrec (🥈21 · ⭐ 1.1K · 💤) - A TensorFlow recommendation algorithm and framework in.. Apache-2
fastFM (🥉20 · ⭐ 890 · 💤) - fastFM: A Library for Factorization Machines. BSD-3
RecBole (🥉20 · ⭐ 680) - A unified, comprehensive and efficient recommendation library. MIT
GitHub (👨💻 27 · 🔀 94 · 📋 60 - 38% open · ⏱️ 12.01.2021):
git clone https://github.com/RUCAIBox/RecBole
PyPi (📥 86 / month · ⏱️ 06.12.2020):
Conda (📥 170 · ⏱️ 06.12.2020):
conda install -c aibox recbole
TF Recommenders (🥉19 · ⭐ 690) - TensorFlow Recommenders is a library for building.. Apache-2
recmetrics (🥉19 · ⭐ 230) - A library of metrics for evaluating recommender systems. MIT
Spotlight (🥉18 · ⭐ 2.4K · 💤) - Deep recommender models using PyTorch. MIT
OpenRec (🥉16 · ⭐ 350 · 💤) - OpenRec is an open-source and modular library for neural network-.. Apache-2
Case Recommender (🥉16 · ⭐ 310 · 💤) - Case Recommender: A Flexible and Extensible Python.. MIT
Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.
PySyft (🥇26 · ⭐ 6.7K) - A library for answering questions using data you cannot see. MIT
TensorFlow Privacy (🥈21 · ⭐ 1.3K) - Library for training machine learning models with.. Apache-2
TFEncrypted (🥈21 · ⭐ 810) - A Framework for Encrypted Machine Learning in TensorFlow. Apache-2
Opacus (🥈21 · ⭐ 710) - Training PyTorch models with differential privacy. Apache-2
FATE (🥉20 · ⭐ 2.7K) - An Industrial Grade Federated Learning Framework. Apache-2
CrypTen (🥉16 · ⭐ 690) - A framework for Privacy Preserving Machine Learning. MIT
Workflow & Experiment Tracking
Libraries to organize, track, and visualize machine learning experiments.
Tensorboard (🥇36 · ⭐ 5.1K) - TensorFlow's Visualization Toolkit. Apache-2
GitHub (👨💻 250 · 🔀 1.3K · 📦 51K · 📋 1.5K - 38% open · ⏱️ 12.01.2021):
git clone https://github.com/tensorflow/tensorboard
PyPi (📥 5.2M / month · 📦 3.6K · ⏱️ 12.11.2020):
Conda (📥 1.5M · ⏱️ 12.11.2020):
conda install -c conda-forge tensorboard
mlflow (🥇34 · ⭐ 8.1K) - Open source platform for the machine learning lifecycle. Apache-2
GitHub (👨💻 270 · 🔀 1.8K · 📦 1.8K · 📋 1.7K - 38% open · ⏱️ 12.01.2021):
git clone https://github.com/mlflow/mlflow
PyPi (📥 2.5M / month · 📦 150 · ⏱️ 31.12.2020):
Conda (📥 200K · ⏱️ 11.01.2021):
conda install -c conda-forge mlflow
DVC (🥇30 · ⭐ 7K) - Data Version Control | Git for Data & Models. Apache-2
GitHub (👨💻 200 · 🔀 660 · 📥 17K · 📦 520 · 📋 2.6K - 18% open · ⏱️ 12.01.2021):
git clone https://github.com/iterative/dvc
PyPi (📥 49K / month · 📦 46 · ⏱️ 05.01.2021):
Conda (📥 440K · ⏱️ 05.01.2021):
conda install -c conda-forge dvc
wandb client (🥇30 · ⭐ 2.6K · ➕) - A tool for visualizing and tracking your machine learning.. MIT
SageMaker SDK (🥇30 · ⭐ 1.3K) - A library for training and deploying machine learning.. Apache-2
kaggle (🥈29 · ⭐ 3.8K) - Official Kaggle API. Apache-2
GitHub (👨💻 35 · 🔀 760 · 📦 4.2K · 📋 270 - 60% open · ⏱️ 30.11.2020):
git clone https://github.com/Kaggle/kaggle-api
PyPi (📥 280K / month · 📦 560 · ⏱️ 30.11.2020):
Conda (📥 40K · ⏱️ 30.11.2020):
conda install -c conda-forge kaggle
sacred (🥈29 · ⭐ 3.3K) - Sacred is a tool to help you configure, organize, log and reproduce.. MIT
snakemake (🥈29 · ⭐ 800) - This is the development home of the workflow management system.. MIT
GitHub (👨💻 190 · 🔀 180 · 📦 720 · 📋 510 - 60% open · ⏱️ 12.01.2021):
git clone https://github.com/snakemake/snakemake
PyPi (📥 9.3K / month · 📦 290 · ⏱️ 21.12.2020):
Conda (📥 270K · ⏱️ 22.12.2020):
conda install -c bioconda snakemake
PyCaret (🥈28 · ⭐ 2.8K) - An open-source, low-code machine learning library in Python. MIT
AzureML SDK (🥈28 · ⭐ 2.1K) - Python notebooks with ML and deep learning examples with Azure.. MIT
tensorboardX (🥈27 · ⭐ 6.8K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
GitHub (👨💻 64 · 🔀 770 · 📥 290 · 📦 9.7K · 📋 410 - 17% open · ⏱️ 05.07.2020):
git clone https://github.com/lanpa/tensorboardX
PyPi (📥 290K / month · 📦 1.3K · ⏱️ 31.12.2019):
Conda (📥 250K · ⏱️ 06.07.2020):
conda install -c conda-forge tensorboardx
Metaflow (🥈26 · ⭐ 4K) - Build and manage real-life data science projects with ease. Apache-2
GitHub (👨💻 28 · 🔀 310 · 📦 99 · 📋 250 - 43% open · ⏱️ 11.01.2021):
git clone https://github.com/Netflix/metaflow
PyPi (📥 38K / month · 📦 1 · ⏱️ 29.10.2020):
Conda (📥 10K · ⏱️ 12.11.2020):
conda install -c conda-forge metaflow
Catalyst (🥈26 · ⭐ 2.4K) - Accelerated deep learning R&D. Apache-2
TNT (🥈25 · ⭐ 1.3K) - Simple tools for logging and visualizing, loading and training. BSD-3
Hub (🥈24 · ⭐ 540 · ➕) - The fastest way to access and manage datasets for PyTorch and.. MPL-2.0
GitHub (👨💻 47 · 🔀 110 · 📦 110 · 📋 170 - 43% open · ⏱️ 12.01.2021):
git clone https://github.com/activeloopai/Hub
PyPi (📥 1.3K / month · 📦 52 · ⏱️ 10.01.2021):
Conda (📥 93K · ⏱️ 22.04.2020):
conda install -c conda-forge hub
ml-metadata (🥈24 · ⭐ 240) - For recording and retrieving metadata associated with ML.. Apache-2
VisualDL (🥉23 · ⭐ 3.2K) - Deep Learning Visualization Toolkit. Apache-2
ClearML (🥉23 · ⭐ 2.1K) - ClearML - Auto-Magical Suite of tools to streamline your ML.. Apache-2
GitHub (👨💻 24 · 🔀 290 · 📥 250 · 📦 7 · 📋 240 - 37% open · ⏱️ 12.01.2021):
git clone https://github.com/allegroai/clearml
PyPi (📥 990 / month · ⏱️ 11.01.2021):
Docker Hub (📥 30K · ⏱️ 05.10.2020):
docker pull allegroai/trains
livelossplot (🥉23 · ⭐ 1K · ➕) - Live training loss plot in Jupyter Notebook for Keras,.. MIT
TensorWatch (🥉22 · ⭐ 3K) - Debugging, monitoring and visualization for Python Machine Learning.. MIT
knockknock (🥉22 · ⭐ 1.9K · 💤) - Knock Knock: Get notified when your training ends with only two.. MIT
GitHub (👨💻 18 · 🔀 160 · 📦 130 · 📋 33 - 36% open · ⏱️ 16.03.2020):
git clone https://github.com/huggingface/knockknock
PyPi (📥 1.4K / month · 📦 3 · ⏱️ 16.03.2020):
Conda (📥 5.3K · ⏱️ 17.03.2020):
conda install -c conda-forge knockknock
Guild AI (🥉22 · ⭐ 520) - Experiment tracking, ML developer tools. Apache-2
lore (🥉21 · ⭐ 1.5K · 💤) - Lore makes machine learning approachable for Software Engineers and.. MIT
gokart (🥉21 · ⭐ 160) - A wrapper of the data pipeline library luigi. MIT
hiddenlayer (🥉20 · ⭐ 1.4K · 💤) - Neural network graphs and training metrics for.. MIT
Studio.ml (🥉20 · ⭐ 370) - Studio: Simplify and expedite model building process. Apache-2
Labml (🥉20 · ⭐ 350) - Monitor PyTorch & TensorFlow model training from your mobile phone. MIT
MXBoard (🥉19 · ⭐ 330 · 💤) - Logging MXNet data for visualization in TensorBoard. Apache-2
quinn (🥉19 · ⭐ 200 · ➕) - pyspark methods to enhance developer productivity. Apache-2
aim (🥉15 · ⭐ 670 · ➕) - Aim a super-easy way to record, search and compare 1000s of ML.. Apache-2
Show 6 hidden projects...
TensorBoard Logger (🥉20 · ⭐ 610 · 💀) - Log TensorBoard events without touching TensorFlow. MIT
SKLL (🥉17 · ⭐ 520) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine.. ❗️BSD-1-Clause
datmo (🥉16 · ⭐ 330 · 💀) - Open source production model management tool for data scientists. MIT
steppy (🥉15 · ⭐ 120 · 💀) - Lightweight, Python library for fast and reproducible experimentation. MIT
ModelChimp (🥉13 · ⭐ 120 · 💀) - Experiment tracking for machine and deep learning projects. BSD-2
traintool (🥉11 · ⭐ 8 · 🐣) - Train off-the-shelf machine learning models in one.. Apache-2
Model Serialization & Conversion
Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.
onnx (🥇33 · ⭐ 9.6K) - Open standard for machine learning interoperability. Apache-2
GitHub (👨💻 190 · 🔀 1.8K · 📥 9.8K · 📦 2.4K · 📋 1.4K - 35% open · ⏱️ 12.01.2021):
git clone https://github.com/onnx/onnx
PyPi (📥 320K / month · 📦 300 · ⏱️ 06.11.2020):
Conda (📥 180K · ⏱️ 11.01.2021):
conda install -c conda-forge onnx
Core ML Tools (🥇26 · ⭐ 2.1K) - Core ML tools contain supporting tools for Core ML model.. BSD-3
TorchServe (🥈24 · ⭐ 1.5K) - Model Serving on PyTorch. Apache-2
GitHub (👨💻 61 · 🔀 220 · 📥 170 · 📦 26 · 📋 520 - 24% open · ⏱️ 24.12.2020):
git clone https://github.com/pytorch/serve
PyPi (📥 1.5K / month · ⏱️ 17.12.2020):
Conda (📥 5.9K · ⏱️ 17.12.2020):
conda install -c pytorch torchserve
Docker Hub (📥 50K · ⭐ 3 · ⏱️ 18.12.2020):
docker pull pytorch/torchserve
mmdnn (🥈22 · ⭐ 5.2K) - MMdnn is a set of tools to help users inter-operate among different deep.. MIT
m2cgen (🥈22 · ⭐ 1.7K · ➕) - Transform ML models into a native code (Java, C, Python, Go,.. MIT
cortex (🥉21 · ⭐ 7.2K) - Run inference at scale. Apache-2
Hummingbird (🥉20 · ⭐ 2.2K) - Hummingbird compiles trained ML models into tensor computation for.. MIT
pytorch2keras (🥉18 · ⭐ 650 · 💤) - PyTorch to Keras model convertor. MIT
tfdeploy (🥉16 · ⭐ 350 · ➕) - Deploy tensorflow graphs for fast evaluation and export to.. BSD-3
Show 2 hidden projects...
Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.
shap (🥇33 · ⭐ 11K) - A game theoretic approach to explain the output of any machine learning model. MIT
GitHub (👨💻 140 · 🔀 1.6K · 📦 1.9K · 📋 1.4K - 64% open · ⏱️ 05.01.2021):
git clone https://github.com/slundberg/shap
PyPi (📥 780K / month · 📦 140 · ⏱️ 04.11.2020):
Conda (📥 350K · ⏱️ 16.12.2020):
conda install -c conda-forge shap
Lime (🥇29 · ⭐ 8.3K) - Lime: Explaining the predictions of any machine learning classifier. BSD-2
GitHub (👨💻 57 · 🔀 1.3K · 📦 990 · 📋 480 - 7% open · ⏱️ 12.01.2021):
git clone https://github.com/marcotcr/lime
PyPi (📥 170K / month · 📦 130 · ⏱️ 03.04.2020):
Conda (📥 62K · ⏱️ 28.06.2020):
conda install -c conda-forge lime
eli5 (🥇28 · ⭐ 2.3K · 💤) - A library for debugging/inspecting machine learning classifiers and.. MIT
GitHub (👨💻 14 · 🔀 280 · 📦 830 · 📋 240 - 53% open · ⏱️ 22.01.2020):
git clone https://github.com/TeamHG-Memex/eli5
PyPi (📥 250K / month · 📦 96 · ⏱️ 29.08.2019):
Conda (📥 85K · ⏱️ 15.06.2020):
conda install -c conda-forge eli5
pyLDAvis (🥇28 · ⭐ 1.4K) - Python library for interactive topic model visualization. Port of.. BSD-3
GitHub (👨💻 31 · 🔀 290 · 📦 1.7K · 📋 150 - 61% open · ⏱️ 02.12.2020):
git clone https://github.com/bmabey/pyLDAvis
PyPi (📥 62K / month · 📦 99 · ⏱️ 05.06.2018):
Conda (📥 21K · ⏱️ 22.06.2018):
conda install -c conda-forge pyldavis
InterpretML (🥇27 · ⭐ 3.4K) - Fit interpretable models. Explain blackbox machine learning. MIT
Model Analysis (🥇27 · ⭐ 1K) - Model analysis tools for TensorFlow. Apache-2
arviz (🥇27 · ⭐ 930 · ➕) - Exploratory analysis of Bayesian models with Python. Apache-2
GitHub (👨💻 62 · 🔀 170 · 📥 98 · 📦 660 · 📋 520 - 21% open · ⏱️ 10.01.2021):
git clone https://github.com/arviz-devs/arviz
PyPi (📥 110K / month · 📦 18 · ⏱️ 23.09.2020):
Conda (📥 160K · ⏱️ 24.09.2020):
conda install -c conda-forge arviz
Captum (🥈26 · ⭐ 2.1K · 📈) - Model interpretability and understanding for PyTorch. BSD-3
yellowbrick (🥈25 · ⭐ 3.1K) - Visual analysis and diagnostic tools to facilitate machine.. Apache-2
dtreeviz (🥈25 · ⭐ 1.3K · ➕) - A python library for decision tree visualization and model.. MIT
Lucid (🥈24 · ⭐ 4K) - A collection of infrastructure and tools for research in neural.. Apache-2
DoWhy (🥈24 · ⭐ 2.6K) - DoWhy is a Python library for causal inference that supports explicit.. MIT
GitHub (👨💻 37 · 🔀 360 · 📥 19 · 📦 24 · 📋 91 - 14% open · ⏱️ 24.12.2020):
git clone https://github.com/Microsoft/dowhy
PyPi (📥 7.9K / month · ⏱️ 12.12.2020):
Conda (📥 880 · ⏱️ 13.12.2020):
conda install -c conda-forge dowhy
Fairness 360 (🥈24 · ⭐ 1.2K) - A comprehensive set of fairness metrics for datasets and.. Apache-2
keras-vis (🥈23 · ⭐ 2.8K · 💤) - Neural network visualization toolkit for keras. MIT
keract (🥈23 · ⭐ 840) - Activation Maps (Layers Outputs) and Gradients in Keras. MIT
TreeInterpreter (🥈23 · ⭐ 640 · 📈) - Package for interpreting scikit-learn's decision tree.. BSD-3
random-forest-importances (🥈23 · ⭐ 400 · ➕) - Code to compute permutation and drop-column.. MIT
Alibi (🥉22 · ⭐ 850) - Algorithms for monitoring and explaining machine learning models. Apache-2
Explainability 360 (🥉22 · ⭐ 750) - Interpretability and explainability of data and machine.. Apache-2
iNNvestigate (🥉21 · ⭐ 750) - A toolbox to iNNvestigate neural networks' predictions!. BSD-2
fairlearn (🥉21 · ⭐ 650 · ➕) - A Python package to assess and improve fairness of machine.. MIT
GitHub (👨💻 29 · 🔀 160 · 📋 180 - 38% open · ⏱️ 12.01.2021):
git clone https://github.com/fairlearn/fairlearn
PyPi (📥 3.4K / month · 📦 1 · ⏱️ 10.11.2020):
Conda (📥 8.1K · ⏱️ 11.11.2020):
conda install -c conda-forge fairlearn
aequitas (🥉21 · ⭐ 340 · ➕) - Bias and Fairness Audit Toolkit. MIT
checklist (🥉20 · ⭐ 1.2K · ➕) - Beyond Accuracy: Behavioral Testing of NLP models with.. MIT
tf-explain (🥉20 · ⭐ 750) - Interpretability Methods for tf.keras models with Tensorflow 2.x. MIT
deeplift (🥉20 · ⭐ 500 · ➕) - Public facing deeplift repo. MIT
sklearn-evaluation (🥉20 · ⭐ 290) - Machine learning model evaluation made easy: plots,.. MIT
explainerdashboard (🥉20 · ⭐ 240 · ➕) - Quickly build Explainable AI dashboards that show the.. MIT
What-If Tool (🥉18 · ⭐ 420) - Source code/webpage/demos for the What-If Tool. Apache-2
GitHub (👨💻 18 · 🔀 89 · 📋 70 - 51% open · ⏱️ 08.01.2021):
git clone https://github.com/PAIR-code/what-if-tool
PyPi (📥 2.1K / month · ⏱️ 28.06.2020):
NPM (📥 1.7K / month · ⏱️ 03.11.2020):
fairness-indicators (🥉18 · ⭐ 170 · ➕) - Tensorflow's Fairness Evaluation and Visualization.. Apache-2
LIT (🥉17 · ⭐ 2.3K · 🐣) - The Language Interpretability Tool: Interactively analyze NLP models.. Apache-2
DiCE (🥉17 · ⭐ 430) - Generate Diverse Counterfactual Explanations for any machine.. MIT
ExplainX.ai (🥉17 · ⭐ 160) - Explainable AI framework for data scientists. Explain & debug any.. MIT
FlashTorch (🥉16 · ⭐ 520 · 💤) - Visualization toolkit for neural networks in PyTorch! Demo --. MIT
tcav (🥉16 · ⭐ 420 · ➕) - Code for the TCAV ML interpretability project. Apache-2
LOFO (🥉16 · ⭐ 290) - Leave One Feature Out Importance. MIT
model-card-toolkit (🥉15 · ⭐ 150 · 🐣) - a tool that leverages rich metadata and lineage.. Apache-2
Anchor (🥉14 · ⭐ 610) - Code for High-Precision Model-Agnostic Explanations paper. BSD-2
Show 6 hidden projects...
scikit-plot (🥈23 · ⭐ 2K · 💀) - An intuitive library to add plotting functionality to scikit-.. MIT
Skater (🥉20 · ⭐ 960 · 💤) - Python Library for Model Interpretation/Explanations. ❗️UPL-1.0
DALEX (🥉18 · ⭐ 750 · ➕) - moDel Agnostic Language for Exploration and eXplanation. ❗️GPL-3.0
XAI (🥉16 · ⭐ 550 · 💀) - XAI - An eXplainability toolbox for machine learning. MIT
contextual-ai (🥉13 · ⭐ 65 · ➕) - Contextual AI adds explainability to different stages of.. Apache-2
Attribution Priors (🥉11 · ⭐ 72) - Tools for training explainable models using.. MIT
Vector Similarity Search (ANN)
Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search.
🔗 ANN Benchmarks ( ⭐ 2K) - Benchmarks of approximate nearest neighbor libraries in Python.
Faiss (🥇29 · ⭐ 12K · 📈) - A library for efficient similarity search and clustering of dense vectors. MIT
GitHub (👨💻 72 · 🔀 2.1K · 📦 290 · 📋 1.3K - 7% open · ⏱️ 11.01.2021):
git clone https://github.com/facebookresearch/faiss
PyPi (📥 5.5K / month · 📦 6 · ⏱️ 16.10.2020):
Conda (📥 21K · ⏱️ 12.12.2020):
conda install -c conda-forge faiss
Annoy (🥇29 · ⭐ 8K) - Approximate Nearest Neighbors in C++/Python optimized for memory usage.. Apache-2
NMSLIB (🥈28 · ⭐ 2.2K) - Non-Metric Space Library (NMSLIB): An efficient similarity search.. Apache-2
GitHub (👨💻 44 · 🔀 330 · 📦 310 · 📋 360 - 12% open · ⏱️ 08.01.2021):
git clone https://github.com/nmslib/nmslib
PyPi (📥 50K / month · 📦 52 · ⏱️ 08.01.2021):
Conda (📥 9.8K · ⏱️ 08.01.2021):
conda install -c conda-forge nmslib
Milvus (🥈25 · ⭐ 4.9K) - An open source embedding vector similarity search engine powered by.. Apache-2
GitHub (👨💻 140 · 🔀 750 · 📋 2.1K - 10% open · ⏱️ 12.01.2021):
git clone https://github.com/milvus-io/milvus
PyPi (📥 5.5K / month · 📦 6 · ⏱️ 16.10.2020):
Docker Hub (📥 220K · ⭐ 9 · ⏱️ 06.01.2021):
docker pull milvusdb/milvus
hnswlib (🥈22 · ⭐ 1.3K) - Header-only C++/python library for fast approximate nearest neighbors. Apache-2
Magnitude (🥉21 · ⭐ 1.4K) - A fast, efficient universal vector embedding utility package. MIT
PyNNDescent (🥉20 · ⭐ 360) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
GitHub (👨💻 10 · 🔀 43 · 📋 55 - 47% open · ⏱️ 05.01.2021):
git clone https://github.com/lmcinnes/pynndescent
PyPi (📥 6.6K / month · 📦 3 · ⏱️ 19.11.2020):
Conda (📥 20K · ⏱️ 19.11.2020):
conda install -c conda-forge pynndescent
NGT (🥉19 · ⭐ 610) - Nearest Neighbor Search with Neighborhood Graph and Tree for High-.. Apache-2
N2 (🥉19 · ⭐ 450) - TOROS N2 - lightweight approximate Nearest Neighbor library which runs fast.. Apache-2
Show 2 hidden projects...
NearPy (🥉20 · ⭐ 660 · 💀) - Python framework for fast (approximated) nearest neighbour search in.. MIT
PySparNN (🥉11 · ⭐ 840 · 💀) - Approximate Nearest Neighbor Search for Sparse Data in Python!. BSD-3
Probabilistics & Statistics
Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, gaussian processes, or statistics.
PyMC3 (🥇32 · ⭐ 5.5K) - Probabilistic Programming in Python: Bayesian Modeling and.. Apache-2
GitHub (👨💻 300 · 🔀 1.3K · 📥 140 · 📦 1.9K · 📋 2.1K - 7% open · ⏱️ 10.01.2021):
git clone https://github.com/pymc-devs/pymc3
PyPi (📥 120K / month · 📦 290 · ⏱️ 07.12.2020):
Conda (📥 240K · ⏱️ 07.12.2020):
conda install -c conda-forge pymc3
tensorflow-probability (🥇31 · ⭐ 3.2K) - Probabilistic reasoning and statistical analysis in.. Apache-2
GitHub (👨💻 400 · 🔀 840 · 📦 1 · 📋 940 - 45% open · ⏱️ 12.01.2021):
git clone https://github.com/tensorflow/probability
PyPi (📥 310K / month · 📦 250 · ⏱️ 29.12.2020):
pip install tensorflow-probability
Conda (📥 29K · ⏱️ 13.03.2020):
conda install -c conda-forge tensorflow-probability
hmmlearn (🥇29 · ⭐ 2.2K · ➕) - Hidden Markov Models in Python, with scikit-learn like API. BSD-3
GitHub (👨💻 31 · 🔀 600 · 📦 840 · 📋 340 - 18% open · ⏱️ 28.12.2020):
git clone https://github.com/hmmlearn/hmmlearn
PyPi (📥 130K / month · 📦 210 · ⏱️ 12.09.2020):
Conda (📥 58K · ⏱️ 01.11.2020):
conda install -c conda-forge hmmlearn
GPyTorch (🥈28 · ⭐ 2.2K) - A highly efficient and modular implementation of Gaussian Processes.. MIT
Pyro (🥈27 · ⭐ 6.7K) - Deep universal probabilistic programming with Python and PyTorch. Apache-2
filterpy (🥈27 · ⭐ 1.6K · ➕) - Python Kalman filtering and optimal estimation library. Implements.. MIT
GitHub (👨💻 33 · 🔀 400 · 📦 680 · 📋 170 - 12% open · ⏱️ 04.01.2021):
git clone https://github.com/rlabbe/filterpy
PyPi (📥 15K / month · 📦 210 · ⏱️ 10.10.2018):
Conda (📥 56K · ⏱️ 05.05.2020):
conda install -c conda-forge filterpy
GPflow (🥈27 · ⭐ 1.4K) - Gaussian processes in TensorFlow. Apache-2
GitHub (👨💻 68 · 🔀 390 · 📦 210 · 📋 690 - 12% open · ⏱️ 12.01.2021):
git clone https://github.com/GPflow/GPflow
PyPi (📥 2.3K / month · 📦 17 · ⏱️ 01.12.2020):
Conda (📥 8K · ⏱️ 06.11.2018):
conda install -c conda-forge gpflow
pomegranate (🥉26 · ⭐ 2.6K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
GitHub (👨💻 61 · 🔀 460 · 📦 380 · 📋 570 - 5% open · ⏱️ 09.01.2021):
git clone https://github.com/jmschrei/pomegranate
PyPi (📥 22K / month · 📦 56 · ⏱️ 09.01.2021):
Conda (📥 42K · ⏱️ 01.11.2020):
conda install -c conda-forge pomegranate
pgmpy (🥉24 · ⭐ 1.7K) - Python Library for learning (Structure and Parameter) and inference.. MIT
SALib (🥉24 · ⭐ 420 · ➕) - Sensitivity Analysis Library in Python (Numpy). Contains Sobol, Morris,.. MIT
GitHub (👨💻 23 · 🔀 140 · 📋 220 - 18% open · ⏱️ 12.11.2020):
git clone https://github.com/SALib/SALib
PyPi (📥 13K / month · 📦 44 · ⏱️ 19.11.2020):
Conda (📥 56K · ⏱️ 24.10.2020):
conda install -c conda-forge salib
scikit-posthocs (🥉21 · ⭐ 170 · ➕) - Pairwise Multiple Comparisons (Post Hoc) Tests in.. MIT
bambi (🥉20 · ⭐ 540 · ➕) - BAyesian Model-Building Interface (Bambi) in Python. MIT
pyhsmm (🥉18 · ⭐ 480) - Bayesian inference in HSMMs and HMMs. MIT
Funsor (🥉18 · ⭐ 160) - Functional tensors for probabilistic programming. Apache-2
Baal (🥉17 · ⭐ 310) - Using approximate bayesian posteriors in deep nets for active learning. Apache-2
Orbit (🥉17 · ⭐ 300) - Bayesian forecasting with object-oriented design and probabilistic.. Apache-2
Show 5 hidden projects...
PyStan (🥈27 · ⭐ 910) - PyStan, the Python interface to Stan. ❗️GPL-3.0
patsy (🥈27 · ⭐ 730 · 💀) - Describing statistical models in Python using symbolic formulas. BSD-2
Edward (🥉24 · ⭐ 4.6K · 💀) - A probabilistic programming language in TensorFlow. Deep.. Apache-2
pingouin (🥉22 · ⭐ 610 · ➕) - Statistical package in Python based on Pandas. ❗️GPL-3.0
ZhuSuan (🥉14 · ⭐ 2K · 💀) - A probabilistic programming library for Bayesian deep learning,.. MIT
Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.
CleverHans (🥇25 · ⭐ 4.9K) - An adversarial example library for constructing attacks,.. MIT
Foolbox (🥇25 · ⭐ 1.8K · 📉) - A Python toolbox to create adversarial examples that fool neural.. MIT
ART (🥈23 · ⭐ 1.9K) - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning.. MIT
TextAttack (🥈23 · ⭐ 1.2K) - TextAttack is a Python framework for adversarial attacks, data.. MIT
robustness (🥉18 · ⭐ 450 · ➕) - A library for experimenting with, training and evaluating neural.. MIT
AdvBox (🥉16 · ⭐ 1K) - Advbox is a toolbox to generate adversarial examples that fool neural.. Apache-2
Show 2 hidden projects...
advertorch (🥉17 · ⭐ 790 · 💤) - A Toolbox for Adversarial Robustness Research. ❗️GPL-3.0
Adversary (🥉13 · ⭐ 340 · 💀) - Tool to generate adversarial text examples and test machine.. MIT
Libraries that require and make use of CUDA/GPU system capabilities to optimize data handling and machine learning tasks.
CuPy (🥇30 · ⭐ 4.8K) - A NumPy-compatible array library accelerated by CUDA. MIT
GitHub (👨💻 250 · 🔀 430 · 📥 5.3K · 📦 630 · 📋 1.2K - 29% open · ⏱️ 12.01.2021):
git clone https://github.com/cupy/cupy
PyPi (📥 7.8K / month · 📦 190 · ⏱️ 25.12.2020):
Conda (📥 330K · ⏱️ 30.12.2020):
conda install -c conda-forge cupy
Docker Hub (📥 48K · ⭐ 6 · ⏱️ 12.01.2021):
PyCUDA (🥇27 · ⭐ 1.1K) - CUDA integration for Python, plus shiny features. MIT
gpustat (🥈26 · ⭐ 2.2K · 💤) - A simple command-line utility for querying and monitoring GPU status. MIT
GitHub (👨💻 10 · 🔀 180 · 📦 560 · 📋 66 - 30% open · ⏱️ 19.05.2020):
git clone https://github.com/wookayin/gpustat
PyPi (📥 130K / month · 📦 58 · ⏱️ 02.01.2021):
Conda (📥 7.6K · ⏱️ 24.11.2020):
conda install -c conda-forge gpustat
Apex (🥈23 · ⭐ 4.9K) - A PyTorch Extension: Tools for easy mixed precision and distributed.. BSD-3
ArrayFire (🥈22 · ⭐ 3.3K) - ArrayFire: a general purpose GPU library. BSD-3
py3nvml (🥈22 · ⭐ 160 · 💤) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside.. BSD-3
GitHub (👨💻 6 · 🔀 24 · 📦 220 · 📋 11 - 18% open · ⏱️ 23.04.2020):
git clone https://github.com/fbcotter/py3nvml
PyPi (📥 99K / month · 📦 20 · ⏱️ 06.04.2020):
Conda (📥 11K · ⏱️ 10.10.2020):
conda install -c conda-forge py3nvml
scikit-cuda (🥉21 · ⭐ 790) - Python interface to GPU-powered libraries. BSD-3
cuDF (🥉20 · ⭐ 3.6K) - cuDF - GPU DataFrame Library. Apache-2
DALI (🥉20 · ⭐ 3K) - A library containing both highly optimized building blocks and an.. Apache-2
cuML (🥉19 · ⭐ 1.9K) - cuML - RAPIDS Machine Learning Library. Apache-2
BlazingSQL (🥉17 · ⭐ 1.4K) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for.. Apache-2
cuGraph (🥉16 · ⭐ 620) - cuGraph - RAPIDS Graph Analytics Library. Apache-2
SpeedTorch (🥉16 · ⭐ 590 · 💤) - Library for faster pinned CPU - GPU transfer in Pytorch. MIT
cuSignal (🥉15 · ⭐ 430) - GPU accelerated signal processing. Apache-2
Show 3 hidden projects...
GPUtil (🥈22 · ⭐ 660 · 💀) - A Python module for getting the GPU status from NVIDA GPUs using.. MIT
nvidia-ml-py3 (🥉17 · ⭐ 60 · 💀) - Python 3 Bindings for the NVIDIA Management Library. BSD-3
ipyexperiments (🥉16 · ⭐ 120) - jupyter/ipython experiment containers for GPU and.. Apache-2
Libraries that extend TensorFlow with additional capabilities.
tensor2tensor (🥇32 · ⭐ 11K) - Library of deep learning models and datasets designed to.. Apache-2
tensorflow-hub (🥇32 · ⭐ 2.7K) - A library for transfer learning by reusing parts of.. Apache-2
GitHub (👨💻 66 · 🔀 1.4K · 📦 5K · 📋 540 - 8% open · ⏱️ 12.01.2021):
git clone https://github.com/tensorflow/hub
PyPi (📥 840K / month · 📦 310 · ⏱️ 06.01.2021):
pip install tensorflow-hub
Conda (📥 48K · ⏱️ 24.08.2020):
conda install -c conda-forge tensorflow-hub
TF Addons (🥈30 · ⭐ 1.2K) - Useful extra functionality for TensorFlow 2.x maintained by.. Apache-2
TensorFlow Transform (🥈29 · ⭐ 850) - Input pipeline framework. Apache-2
efficientnet (🥈26 · ⭐ 1.7K) - Implementation of EfficientNet model. Keras and.. Apache-2
TF Model Optimization (🥈26 · ⭐ 940 · 📉) - A toolkit to optimize ML models for deployment for.. Apache-2
TensorFlow I/O (🥉25 · ⭐ 400) - Dataset, streaming, and file system extensions.. Apache-2
TensorFlow Cloud (🥉23 · ⭐ 210) - The TensorFlow Cloud repository provides APIs that.. Apache-2
Neural Structured Learning (🥉22 · ⭐ 760) - Training neural models with structured signals. Apache-2
TensorNets (🥉20 · ⭐ 970) - High level network definitions with pre-trained weights in.. MIT
tffm (🥉19 · ⭐ 760 · 💤) - TensorFlow implementation of an arbitrary order Factorization Machine. MIT
Saliency (🥉16 · ⭐ 620) - TensorFlow implementation for SmoothGrad, Grad-CAM, Guided.. Apache-2
TF Compression (🥉16 · ⭐ 410) - Data compression in TensorFlow. Apache-2
Libraries that extend scikit-learn with additional capabilities.
imbalanced-learn (🥇30 · ⭐ 5K) - A Python Package to Tackle the Curse of Imbalanced.. MIT
GitHub (👨💻 51 · 🔀 1.1K · 📦 3.8K · 📋 450 - 10% open · ⏱️ 03.11.2020):
git clone https://github.com/scikit-learn-contrib/imbalanced-learn
PyPi (📥 750K / month · 📦 280 · ⏱️ 09.06.2020):
pip install imbalanced-learn
Conda (📥 100K · ⏱️ 14.06.2020):
conda install -c conda-forge imbalanced-learn
MLxtend (🥇30 · ⭐ 3.3K) - A library of extension and helper modules for Python's data.. BSD-3
GitHub (👨💻 79 · 🔀 680 · 📦 2.3K · 📋 360 - 27% open · ⏱️ 09.01.2021):
git clone https://github.com/rasbt/mlxtend
PyPi (📥 200K / month · 📦 95 · ⏱️ 26.11.2020):
Conda (📥 150K · ⏱️ 26.11.2020):
conda install -c conda-forge mlxtend
category_encoders (🥈25 · ⭐ 1.5K) - A library of sklearn compatible categorical variable.. BSD-3
GitHub (👨💻 34 · 🔀 290 · 📋 200 - 32% open · ⏱️ 31.07.2020):
git clone https://github.com/scikit-learn-contrib/category_encoders
PyPi (📥 220K / month · 📦 23 · ⏱️ 14.10.2018):
pip install category_encoders
Conda (📥 91K · ⏱️ 29.04.2020):
conda install -c conda-forge category_encoders
combo (🥈24 · ⭐ 470 · ➕) - A Python Toolbox for Machine Learning Model Combination. BSD-2
xgboost
sklearn-contrib-lightning (🥈23 · ⭐ 1.4K) - Large-scale linear classification, regression and.. BSD-3
GitHub (👨💻 16 · 🔀 190 · 📦 72 · 📋 85 - 57% open · ⏱️ 04.01.2021):
git clone https://github.com/scikit-learn-contrib/lightning
PyPi (📥 520 / month · 📦 5 · ⏱️ 16.12.2020):
pip install sklearn-contrib-lightning
Conda (📥 130K · ⏱️ 20.12.2020):
conda install -c conda-forge sklearn-contrib-lightning
scikit-opt (🥉22 · ⭐ 1.7K · ➕) - Genetic Algorithm, Particle Swarm Optimization, Simulated.. MIT
fancyimpute (🥉22 · ⭐ 910 · ➕) - Multivariate imputation and matrix completion.. Apache-2
scikit-lego (🥉22 · ⭐ 370 · ➕) - Extra blocks for scikit-learn pipelines. MIT
GitHub (👨💻 38 · 🔀 63 · 📦 15 · 📋 200 - 10% open · ⏱️ 04.01.2021):
git clone https://github.com/koaning/scikit-lego
PyPi (📥 860 / month · ⏱️ 04.01.2021):
Conda (📥 7.5K · ⏱️ 02.11.2020):
conda install -c conda-forge scikit-lego
iterative-stratification (🥉19 · ⭐ 490 · ➕) - scikit-learn cross validators for iterative.. BSD-3
scikit-tda (🥉19 · ⭐ 260 · ➕) - Topological Data Analysis for Python. MIT
DESlib (🥉18 · ⭐ 290 · ➕) - A Python library for dynamic classifier and ensemble selection. BSD-3
skggm (🥉17 · ⭐ 180 · ➕) - Scikit-learn compatible estimation of general graphical models. MIT
Show 4 hidden projects...
Libraries that extend Pytorch with additional capabilities.
pretrainedmodels (🥇27 · ⭐ 7.6K · 💤) - Pretrained ConvNets for pytorch: NASNet, ResNeXt,.. BSD-3
pytorch-optimizer (🥇25 · ⭐ 1.6K · ➕) - torch-optimizer -- collection of optimizers for.. Apache-2
torchdiffeq (🥇24 · ⭐ 3.3K) - Differentiable ODE solvers with full GPU support and.. MIT
pytorch-summary (🥇24 · ⭐ 2.9K) - Model summary in PyTorch similar to `model.summary()` in.. MIT
PML (🥇24 · ⭐ 2.6K) - The easiest way to use deep metric learning in your application. Modular,.. MIT
GitHub (👨💻 12 · 🔀 360 · 📦 45 · 📋 200 - 16% open · ⏱️ 12.01.2021):
git clone https://github.com/KevinMusgrave/pytorch-metric-learning
PyPi (📥 3.6K / month · ⏱️ 27.12.2019):
pip install pytorch-metric-learning
Conda (📥 1.1K · ⏱️ 12.01.2021):
conda install -c metric-learning pytorch-metric-learning
SRU (🥇24 · ⭐ 1.9K) - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755 ). MIT
EfficientNet-PyTorch (🥈23 · ⭐ 5.3K) - A PyTorch implementation of EfficientNet. Apache-2
EfficientNets (🥈22 · ⭐ 1.2K) - Pretrained EfficientNet, EfficientNet-Lite, MixNet,.. Apache-2
Torchmeta (🥈21 · ⭐ 1.2K) - A collection of extensions and data-loaders for few-shot learning.. MIT
PyTorch Sparse (🥈21 · ⭐ 340) - PyTorch Extension Library of Optimized Autograd Sparse.. MIT
reformer-pytorch (🥈20 · ⭐ 1.3K) - Reformer, the efficient Transformer, in Pytorch. MIT
torch-scatter (🥈20 · ⭐ 570) - PyTorch Extension Library of Optimized Scatter Operations. MIT
Pytorch Toolbelt (🥉19 · ⭐ 870) - PyTorch extensions for fast R&D prototyping and Kaggle.. MIT
TabNet (🥉19 · ⭐ 720) - PyTorch implementation of TabNet paper :.. MIT
Higher (🥉18 · ⭐ 1K) - higher is a pytorch library allowing users to obtain higher order.. Apache-2
Performer Pytorch (🥉17 · ⭐ 440 · 🐣) - An implementation of Performer, a linear attention-.. MIT
Lambda Networks (🥉16 · ⭐ 1.3K · 🐣) - Implementation of LambdaNetworks, a new approach to.. MIT
Tensor Sensor (🥉16 · ⭐ 490 · 🐣) - The goal of this library is to generate more helpful.. MIT
Pywick (🥉16 · ⭐ 310) - High-level batteries-included neural network training library for.. MIT
tinygrad (🥉15 · ⭐ 3.9K · 🐣) - You like pytorch? You like micrograd? You love tinygrad!. MIT
torchsde (🥉15 · ⭐ 630 · 🐣) - Differentiable SDE solvers with GPU support and efficient.. Apache-2
micrograd (🥉14 · ⭐ 1.5K · 💤) - A tiny scalar-valued autograd engine and a neural net library.. MIT
Tez (🥉14 · ⭐ 390 · 🐣) - Tez is a super-simple and lightweight Trainer for PyTorch. It.. Apache-2
Torch-Struct (🥉13 · ⭐ 870) - Fast, general, and tested differentiable structured prediction.. MIT
Show 3 hidden projects...
Libraries for connecting to, operating, and querying databases.
🔗 best-of-python - DB Clients ( ⭐ 2) - Collection of database clients for python.
scipy (🥇40 · ⭐ 7.8K) - Ecosystem of open-source software for mathematics, science, and engineering. BSD-3
GitHub (👨💻 1.1K · 🔀 3.5K · 📥 300K · 📦 290K · 📋 7.2K - 21% open · ⏱️ 11.01.2021):
git clone https://github.com/scipy/scipy
PyPi (📥 14M / month · 📦 87K · ⏱️ 22.12.2020):
Conda (📥 12M · ⏱️ 12.01.2021):
conda install -c conda-forge scipy
SymPy (🥇36 · ⭐ 7.7K) - A computer algebra system written in pure Python. BSD-3
GitHub (👨💻 1K · 🔀 3.2K · 📥 410K · 📦 28K · 📋 11K - 35% open · ⏱️ 12.01.2021):
git clone https://github.com/sympy/sympy
PyPi (📥 550K / month · 📦 6.4K · ⏱️ 12.12.2020):
Conda (📥 1.2M · ⏱️ 08.01.2021):
conda install -c conda-forge sympy
Keras-Preprocessing (🥇29 · ⭐ 900 · ➕) - Utilities for working with image data, text data, and.. MIT
GitHub (👨💻 48 · 🔀 390 · 📋 190 - 48% open · ⏱️ 11.12.2020):
git clone https://github.com/keras-team/keras-preprocessing
PyPi (📥 3.7M / month · 📦 2.7K · ⏱️ 14.05.2020):
pip install keras-preprocessing
Conda (📥 760K · ⏱️ 25.08.2019):
conda install -c conda-forge keras-preprocessing
PyOD (🥇28 · ⭐ 4K) - A Python Toolbox for Scalable Outlier Detection (Anomaly Detection). BSD-2
Cython BLIS (🥇28 · ⭐ 160) - Fast matrix-multiplication as a self-contained Python library no.. BSD-3
GitHub (👨💻 9 · 🔀 22 · 📦 7.7K · 📋 21 - 28% open · ⏱️ 07.12.2020):
git clone https://github.com/explosion/cython-blis
PyPi (📥 790K / month · 📦 390 · ⏱️ 07.12.2020):
Conda (📥 380K · ⏱️ 07.12.2020):
conda install -c conda-forge cython-blis
hdbscan (🥈27 · ⭐ 1.8K) - A high performance implementation of HDBSCAN clustering. BSD-3
GitHub (👨💻 64 · 🔀 330 · 📦 700 · 📋 360 - 60% open · ⏱️ 06.01.2021):
git clone https://github.com/scikit-learn-contrib/hdbscan
PyPi (📥 120K / month · 📦 120 · ⏱️ 19.03.2020):
Conda (📥 510K · ⏱️ 02.11.2020):
conda install -c conda-forge hdbscan
pyopencl (🥈27 · ⭐ 760) - OpenCL integration for Python, plus shiny features. MIT
GitHub (👨💻 82 · 🔀 200 · 📦 440 · 📋 260 - 19% open · ⏱️ 04.01.2021):
git clone https://github.com/inducer/pyopencl
PyPi (📥 5K / month · 📦 240 · ⏱️ 20.11.2020):
Conda (📥 300K · ⏱️ 20.11.2020):
conda install -c conda-forge pyopencl
Streamlit (🥈26 · ⭐ 13K) - Streamlit The fastest way to build data apps in Python. Apache-2
carla (🥈26 · ⭐ 5.5K) - Open-source simulator for autonomous driving research. MIT
Datasette (🥈26 · ⭐ 4.5K) - An open source multi-tool for exploring and publishing data. Apache-2
agate (🥈26 · ⭐ 1K · 💤) - A Python data analysis library that is optimized for humans instead of.. MIT
GitHub (👨💻 47 · 🔀 130 · 📦 480 · 📋 640 - 8% open · ⏱️ 01.04.2020):
git clone https://github.com/wireservice/agate
PyPi (📥 150K / month · 📦 92 · ⏱️ 11.03.2018):
Conda (📥 59K · ⏱️ 19.08.2018):
conda install -c conda-forge agate
pyclustering (🥈26 · ⭐ 780 · ➕) - pyclustring is a Python, C++ data mining library. BSD-3
GitHub (👨💻 26 · 🔀 180 · 📥 280 · 📦 170 · 📋 640 - 8% open · ⏱️ 03.12.2020):
git clone https://github.com/annoviko/pyclustering
PyPi (📥 18K / month · 📦 36 · ⏱️ 25.11.2020):
Conda (📥 12K · ⏱️ 30.11.2020):
conda install -c conda-forge pyclustering
Trax (🥈25 · ⭐ 5.5K) - Trax Deep Learning with Clear Code and Speed. Apache-2
Pythran (🥈25 · ⭐ 1.5K) - Ahead of Time compiler for numeric kernels. BSD-3
GitHub (👨💻 47 · 🔀 130 · 📦 48 · 📋 640 - 15% open · ⏱️ 06.01.2021):
git clone https://github.com/serge-sans-paille/pythran
PyPi (📥 3.9K / month · 📦 13 · ⏱️ 11.12.2020):
Conda (📥 120K · ⏱️ 15.12.2020):
conda install -c conda-forge pythran
DeepChem (🥈24 · ⭐ 2.7K) - Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry,.. MIT
causalml (🥈24 · ⭐ 1.5K · ➕) - Uplift modeling and causal inference with machine learning.. Apache-2
kmodes (🥈24 · ⭐ 800) - Python implementations of the k-modes and k-prototypes clustering.. MIT
PennyLane (🥈24 · ⭐ 690) - PennyLane is a cross-platform Python library for differentiable.. Apache-2
pyjanitor (🥈24 · ⭐ 610) - Clean APIs for data cleaning. Python implementation of R package Janitor. MIT
GitHub (👨💻 87 · 🔀 120 · 📦 72 · 📋 370 - 24% open · ⏱️ 31.12.2020):
git clone https://github.com/ericmjl/pyjanitor
PyPi (📥 1.5K / month · 📦 2 · ⏱️ 03.10.2020):
Conda (📥 76K · ⏱️ 04.10.2020):
conda install -c conda-forge pyjanitor
findspark (🥈24 · ⭐ 380 · 💤) - Find pyspark to make it importable. BSD-3
GitHub (👨💻 14 · 🔀 63 · 📦 1.4K · 📋 19 - 57% open · ⏱️ 08.06.2020):
git clone https://github.com/minrk/findspark
PyPi (📥 560K / month · 📦 200 · ⏱️ 08.06.2020):
Conda (📥 480K · ⏱️ 06.07.2018):
conda install -c conda-forge findspark
datalad (🥈24 · ⭐ 220) - Keep code, data, containers under control with git and git-annex. MIT
GitHub (👨💻 40 · 🔀 68 · 📋 2.9K - 24% open · ⏱️ 12.01.2021):
git clone https://github.com/datalad/datalad
PyPi (📥 1.2K / month · 📦 26 · ⏱️ 14.12.2020):
Conda (📥 76K · ⏱️ 04.01.2021):
conda install -c conda-forge datalad
PaddleHub (🥉23 · ⭐ 4.4K) - Awesome pre-trained models toolkit based on.. Apache-2
metric-learn (🥉23 · ⭐ 1.1K) - Metric learning algorithms in Python. MIT
pycm (🥉23 · ⭐ 1K · ➕) - Multi-class confusion matrix library in Python. MIT
TabPy (🥉23 · ⭐ 1K · ➕) - Execute Python code on the fly and display results in Tableau.. MIT
modAL (🥉23 · ⭐ 1K) - A modular active learning framework for Python. MIT
tensorly (🥉23 · ⭐ 940) - TensorLy: Tensor Learning in Python. BSD-2
GitHub (👨💻 38 · 🔀 190 · 📋 120 - 20% open · ⏱️ 03.01.2021):
git clone https://github.com/tensorly/tensorly
PyPi (📥 2.7K / month · 📦 20 · ⏱️ 07.12.2020):
Conda (📥 110K · ⏱️ 07.12.2020):
conda install -c conda-forge tensorly
PySwarms (🥉23 · ⭐ 710) - A research toolkit for particle swarm optimization in Python. MIT
Mars (🥉22 · ⭐ 2K) - Mars is a tensor-based unified framework for large-scale data computation.. Apache-2
Prince (🥉22 · ⭐ 550) - Python factor analysis library (PCA, CA, MCA, MFA, FAMD). MIT
SUOD (🥉22 · ⭐ 230) - An Acceleration System for Large-scale Outlier Detection (Anomaly Detection). BSD-2
cleanlab (🥉21 · ⭐ 1.4K) - The standard package for machine learning with noisy labels and finding.. MIT
AstroML (🥉21 · ⭐ 710) - Machine learning, statistics, and data mining for astronomy and.. BSD-2
GitHub (👨💻 29 · 🔀 250 · 📦 160 · 📋 130 - 37% open · ⏱️ 09.09.2020):
git clone https://github.com/astroML/astroML
PyPi (📥 520 / month · 📦 29 · ⏱️ 23.03.2020):
Conda (📥 21K · ⏱️ 16.02.2020):
conda install -c conda-forge astroml
BioPandas (🥉21 · ⭐ 320) - Working with molecular structures in pandas DataFrames. BSD-3
GitHub (👨💻 7 · 🔀 76 · 📦 47 · 📋 34 - 47% open · ⏱️ 01.01.2021):
git clone https://github.com/rasbt/biopandas
PyPi (📥 200 / month · 📦 6 · ⏱️ 04.08.2020):
Conda (📥 61K · ⏱️ 08.08.2020):
conda install -c conda-forge biopandas
StreamAlert (🥉20 · ⭐ 2.4K) - StreamAlert is a serverless, realtime data analysis framework.. Apache-2
alibi-detect (🥉20 · ⭐ 520) - Algorithms for outlier and adversarial instance detection,.. Apache-2
scikit-rebate (🥉20 · ⭐ 300 · ➕) - A scikit-learn-compatible Python implementation of.. MIT
gplearn (🥉19 · ⭐ 900 · 💤) - Genetic Programming in Python, with a scikit-learn inspired API. BSD-3
mlens (🥉19 · ⭐ 670 · 💤) - ML-Ensemble high performance ensemble learning. MIT
baikal (🥉19 · ⭐ 570) - A graph-based functional API for building complex scikit-learn pipelines. BSD-3
GitHub (👨💻 1 · 🔀 28 · 📦 3 · 📋 15 - 26% open · ⏱️ 15.11.2020):
git clone https://github.com/alegonz/baikal
PyPi (📥 31 / month · ⏱️ 15.11.2020):
Conda (📥 380K · ⏱️ 07.12.2020):
conda install -c conda-forge cython-blis
Feature Engine (🥉19 · ⭐ 420) - Feature engineering package with sklearn like functionality. BSD-3
GitHub (👨💻 19 · 🔀 120 · 📋 99 - 29% open · ⏱️ 12.01.2021):
git clone https://github.com/solegalli/feature_engine
PyPi (📥 13K / month · 📦 2 · ⏱️ 11.01.2021):
pip install feature_engine
Conda (📥 1.2K · ⏱️ 11.01.2021):
conda install -c conda-forge feature_engine
rrcf (🥉19 · ⭐ 270 · 💤) - Implementation of the Robust Random Cut Forest algorithm for anomaly.. MIT
apricot (🥉18 · ⭐ 300) - apricot implements submodular optimization for the purpose of selecting.. MIT
River (🥉17 · ⭐ 1.3K) - Online machine learning in Python. BSD-3
Show 7 hidden projects...
Autograd (🥇29 · ⭐ 5.1K · 💀) - Efficiently computes derivatives of numpy code. MIT
pysc2 (🥈24 · ⭐ 7.1K · 💀) - StarCraft II Learning Environment. Apache-2
minisom (🥉22 · ⭐ 750 · ➕) - MiniSom is a minimalistic implementation of the Self.. ❗️CC-BY-3.0
impyute (🥉20 · ⭐ 260 · 💀) - Data imputations library to preprocess datasets with missing data. MIT
vecstack (🥉18 · ⭐ 570 · 💀) - Python package for stacking (machine learning technique). MIT
pandas-ml (🥉17 · ⭐ 260 · 💀) - pandas, scikit-learn, xgboost and seaborn integration. BSD-3
traingenerator (🥉9 · ⭐ 840 · 🐣) - A web app to generate template code for machine learning. MIT
Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:
Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
Modify the projects.yaml with your additions or changes, and submit a pull request. This can also be done directly via the Github UI .
If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide .
For more information on how to add or update projects, please read the contribution guidelines . By participating in this project, you agree to abide by its Code of Conduct .