🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
This curated list contains 880 awesome open-source projects with a total of 3M stars grouped into 33 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
🧙♂️ Discover other best-of lists or create your own.
📫 Subscribe to our newsletter for updates and trending projects.
- Machine Learning Frameworks 55 projects
- Data Visualization 50 projects
- Text Data & NLP 90 projects
- Image Data 56 projects
- Graph Data 32 projects
- Audio Data 27 projects
- Geospatial Data 21 projects
- Financial Data 23 projects
- Time Series Data 22 projects
- Medical Data 19 projects
- Tabular Data 3 projects
- Optical Character Recognition 11 projects
- Data Containers & Structures 29 projects
- Data Loading & Extraction 1 projects
- Web Scraping & Crawling 1 projects
- Data Pipelines & Streaming 41 projects
- Distributed Machine Learning 29 projects
- Hyperparameter Optimization & AutoML 47 projects
- Reinforcement Learning 21 projects
- Recommender Systems 15 projects
- Privacy Machine Learning 6 projects
- Workflow & Experiment Tracking 36 projects
- Model Serialization & Deployment 14 projects
- Model Interpretability 50 projects
- Vector Similarity Search (ANN) 12 projects
- Probabilistics & Statistics 23 projects
- Adversarial Robustness 9 projects
- GPU Utilities 18 projects
- Tensorflow Utilities 15 projects
- Sklearn Utilities 17 projects
- Pytorch Utilities 31 projects
- Database Clients 1 projects
- Others 57 projects
- 🥇🥈🥉 Combined project-quality score
- ⭐️ Star count from GitHub
- 🐣 New project (less than 6 months old)
- 💤 Inactive project (6 months no activity)
- 💀 Dead project (12 months no activity)
- 📈📉 Project is trending up or down
- ➕ Project was recently added
- ❗️ Warning (e.g. missing/risky license)
- 👨💻 Contributors count from GitHub
- 🔀 Fork count from GitHub
- 📋 Issue count from GitHub
- ⏱️ Last update timestamp on package manager
- 📥 Download count from package manager
- 📦 Number of dependent projects
- Tensorflow related project
- Sklearn related project
- PyTorch related project
- MxNet related project
- Apache Spark related project
- Jupyter related project
- PaddlePaddle related project
- Pandas related project
General-purpose machine learning and deep learning frameworks.
Tensorflow (🥇44 · ⭐ 160K) - An Open Source Machine Learning Framework for Everyone. Apache-2
-
GitHub (👨💻 3.7K · 🔀 85K · 📦 150K · 📋 32K - 11% open · ⏱️ 15.07.2021):
git clone https://github.com/tensorflow/tensorflow
-
PyPi (📥 10M / month · 📦 23K · ⏱️ 13.07.2021):
pip install tensorflow
-
Conda (📥 2.7M · ⏱️ 30.04.2021):
conda install -c conda-forge tensorflow
-
Docker Hub (📥 56M · ⭐ 1.9K · ⏱️ 15.07.2021):
docker pull tensorflow/tensorflow
scikit-learn (🥇38 · ⭐ 46K) - scikit-learn: machine learning in Python. BSD-3
XGBoost (🥇37 · ⭐ 21K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
LightGBM (🥇36 · ⭐ 13K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
pytorch-lightning (🥈34 · ⭐ 15K) - The lightweight PyTorch wrapper for high-performance.. Apache-2
Theano (🥈34 · ⭐ 9.4K) - Theano was a Python library that allows you to define, optimize, and.. BSD-3
StatsModels (🥈33 · ⭐ 6.4K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
Thinc (🥈32 · ⭐ 2.3K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
PaddlePaddle (🥈31 · ⭐ 16K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2
Vowpal Wabbit (🥈30 · ⭐ 7.6K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
tensorpack (🥉29 · ⭐ 6K) - A Neural Net Training Interface on TensorFlow, with focus on.. Apache-2
Jina (🥉28 · ⭐ 7.6K) - Cloud-native neural search framework for kind of data. Apache-2
-
GitHub (👨💻 110 · 🔀 1K · 📦 110 · 📋 900 - 6% open · ⏱️ 15.07.2021):
git clone https://github.com/jina-ai/jina
-
PyPi (📥 30K / month · ⏱️ 15.07.2021):
pip install jina
-
Docker Hub (📥 740K · ⭐ 4 · ⏱️ 15.07.2021):
docker pull jinaai/jina
Flax (🥉27 · ⭐ 1.9K · 📉) - Flax is a neural network library for JAX that is designed for.. Apache-2
jax
Turi Create (🥉26 · ⭐ 10K) - Turi Create simplifies the development of custom machine learning.. BSD-3
Neural Network Libraries (🥉25 · ⭐ 2.5K) - Neural Network Libraries. Apache-2
tensorflow-upstream (🥉25 · ⭐ 560) - TensorFlow ROCm port. Apache-2
SHOGUN (🥉22 · ⭐ 2.8K · 💤) - Unified and efficient Machine Learning. BSD-3
-
GitHub (👨💻 250 · 🔀 1K · 📋 1.5K - 29% open · ⏱️ 08.12.2020):
git clone https://github.com/shogun-toolbox/shogun
-
Conda (📥 100K · ⏱️ 25.06.2018):
conda install -c conda-forge shogun
-
Docker Hub (📥 1.5K · ⭐ 1 · ⏱️ 31.01.2019):
docker pull shogun/shogun
mace (🥉21 · ⭐ 4.4K) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
-
GitHub (👨💻 63 · 🔀 770 · 📥 1.4K · 📋 650 - 6% open · ⏱️ 15.07.2021):
git clone https://github.com/XiaoMi/mace
Neural Tangents (🥉19 · ⭐ 1.5K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
ThunderSVM (🥉19 · ⭐ 1.3K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
Haiku (🥉19 · ⭐ 1.2K) - JAX-based neural network library. Apache-2
-
GitHub (👨💻 43 · 🔀 86 · 📦 120 · 📋 96 - 23% open · ⏱️ 15.07.2021):
git clone https://github.com/deepmind/dm-haiku
Torchbearer (🥉19 · ⭐ 600) - torchbearer: A model fitting library for PyTorch. MIT
NeoML (🥉16 · ⭐ 630) - Machine learning framework for both deep learning and traditional.. Apache-2
-
GitHub (👨💻 21 · 🔀 86 · 📋 51 - 60% open · ⏱️ 14.07.2021):
git clone https://github.com/neoml-lib/neoml
ThunderGBM (🥉15 · ⭐ 600) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
Show 11 hidden projects...
- dlib (🥈32 · ⭐ 10K) - A toolkit for making real world machine learning and data analysis..
❗️BSL-1.0
- CNTK (🥉26 · ⭐ 17K · 💀) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit.
MIT
- NuPIC (🥉24 · ⭐ 6.3K · 💀) - Numenta Platform for Intelligent Computing is an implementation..
❗️AGPL-3.0
- Lasagne (🥉24 · ⭐ 3.8K · 💀) - Lightweight library to build and train neural networks in Theano.
MIT
- xLearn (🥉24 · ⭐ 2.9K · 💀) - High performance, easy-to-use, and scalable machine learning (ML)..
Apache-2
- neon (🥉23 · ⭐ 3.9K · 💀) - Intel Nervana reference deep learning framework committed to best..
Apache-2
- NeuPy (🥉23 · ⭐ 690 · 💀) - NeuPy is a Tensorflow based python library for prototyping and building..
MIT
- MindsDB (🥉20 · ⭐ 3.8K) - Predictive AI layer for existing databases.
❗️GPL-3.0
- chefboost (🥉20 · ⭐ 260) - A Lightweight Decision Tree Framework supporting regular algorithms:..
MIT
- elegy (🥉17 · ⭐ 230) - Elegy is a framework-agnostic Trainer interface for the Jax..
Apache-2
jax
- StarSpace (🥉13 · ⭐ 3.6K · 💀) - Learning embeddings for classification, retrieval and ranking.
MIT
General-purpose and task-specific data visualization libraries.
Matplotlib (🥇42 · ⭐ 14K) - matplotlib: plotting with Python. Python-2.0
Plotly (🥇36 · ⭐ 9.8K) - The interactive graphing library for Python (includes Plotly Express). MIT
-
GitHub (👨💻 180 · 🔀 1.8K · 📦 5 · 📋 2K - 45% open · ⏱️ 28.06.2021):
git clone https://github.com/plotly/plotly.py
-
PyPi (📥 5.7M / month · 📦 5K · ⏱️ 28.06.2021):
pip install plotly
-
Conda (📥 1.6M · ⏱️ 28.06.2021):
conda install -c conda-forge plotly
-
NPM (📥 45K / month · 📦 4 · ⏱️ 12.01.2021):
npm install plotlywidget
dash (🥇33 · ⭐ 15K) - Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required. MIT
pandas-profiling (🥈32 · ⭐ 7.6K) - Create HTML profiling reports from pandas DataFrame.. MIT
HoloViews (🥈29 · ⭐ 1.9K) - With Holoviews, your data visualizes itself. BSD-3
-
GitHub (👨💻 110 · 🔀 320 · 📋 2.6K - 28% open · ⏱️ 13.07.2021):
git clone https://github.com/holoviz/holoviews
-
PyPi (📥 130K / month · 📦 170 · ⏱️ 22.05.2021):
pip install holoviews
-
Conda (📥 530K · ⏱️ 23.05.2021):
conda install -c conda-forge holoviews
-
NPM (📥 5K / month · ⏱️ 24.05.2020):
npm install @pyviz/jupyterlab_pyviz
bqplot (🥈28 · ⭐ 3.1K) - Plotting library for IPython/Jupyter notebooks. Apache-2
-
GitHub (👨💻 53 · 🔀 410 · 📦 26 · 📋 540 - 35% open · ⏱️ 15.07.2021):
git clone https://github.com/bqplot/bqplot
-
PyPi (📥 64K / month · 📦 110 · ⏱️ 08.06.2021):
pip install bqplot
-
Conda (📥 680K · ⏱️ 08.06.2021):
conda install -c conda-forge bqplot
-
NPM (📥 12K / month · 📦 10 · ⏱️ 08.06.2021):
npm install bqplot
datashader (🥈28 · ⭐ 2.5K) - Quickly and accurately render even the largest data. BSD-3
data-validation (🥈28 · ⭐ 560) - Library for exploring and validating machine learning.. Apache-2
Perspective (🥈27 · ⭐ 3.4K) - Streaming pivot visualization via WebAssembly. Apache-2
Facets Overview (🥉26 · ⭐ 6.6K) - Visualizations for machine learning datasets. Apache-2
HyperTools (🥉26 · ⭐ 1.7K) - A Python toolbox for gaining geometric insights into high-dimensional.. MIT
D-Tale (🥉25 · ⭐ 2.5K) - Visualizer for pandas data structures. ❗️LGPL-2.1
pythreejs (🥉25 · ⭐ 740) - A Jupyter - Three.js bridge. BSD-3
-
GitHub (👨💻 27 · 🔀 160 · 📦 17 · 📋 200 - 30% open · ⏱️ 26.02.2021):
git clone https://github.com/jupyter-widgets/pythreejs
-
PyPi (📥 36K / month · 📦 26 · ⏱️ 26.02.2021):
pip install pythreejs
-
Conda (📥 320K · ⏱️ 02.03.2021):
conda install -c conda-forge pythreejs
-
NPM (📥 7K / month · 📦 8 · ⏱️ 26.02.2021):
npm install jupyter-threejs
hvPlot (🥉25 · ⭐ 410) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
python-ternary (🥉24 · ⭐ 440) - Ternary plotting library for python with matplotlib. MIT
Chartify (🥉23 · ⭐ 2.9K) - Python library that makes it easy for data scientists to create.. Apache-2
Multicore-TSNE (🥉23 · ⭐ 1.6K · 💤) - Parallel t-SNE implementation with Python and Torch.. BSD-3
Pandas-Bokeh (🥉23 · ⭐ 700) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
Sweetviz (🥉21 · ⭐ 1.7K) - Visualize and compare datasets, target values and associations, with one.. MIT
AutoViz (🥉21 · ⭐ 390) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
animatplot (🥉18 · ⭐ 380 · 💤) - A python package for animating plots build on matplotlib. MIT
Show 8 hidden projects...
- plotnine (🥈28 · ⭐ 2.7K) - A grammar of graphics for Python.
❗️GPL-2.0
- cartopy (🥈27 · ⭐ 890) - Cartopy - a cartographic python library with matplotlib support.
❗️LGPL-3.0
- pivottablejs (🥉21 · ⭐ 440 · 💀) - Dragndrop Pivot Tables and Charts for Jupyter/IPython..
MIT
- ivis (🥉20 · ⭐ 240) - Dimensionality reduction in very large datasets using Siamese..
Apache-2
- pdvega (🥉16 · ⭐ 340 · 💀) - Interactive plotting for Pandas using Vega-Lite.
MIT
- nx-altair (🥉16 · ⭐ 170 · 💀) - Draw interactive NetworkX graphs with Altair.
MIT
- data-describe (🥉15 · ⭐ 280) - datadescribe: Pythonic EDA Accelerator for Data Science.
Apache-2
- nptsne (🥉14 · ⭐ 25) - nptsne is a numpy compatible python binary package that offers a number..
Apache-2
Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.
transformers (🥇36 · ⭐ 49K) - Transformers: State-of-the-art Natural Language.. Apache-2
gensim (🥇36 · ⭐ 12K) - Topic Modelling for Humans. ❗️LGPL-2.1
nltk (🥇35 · ⭐ 10K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
flair (🥇32 · ⭐ 11K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT
ChatterBot (🥇31 · ⭐ 11K) - ChatterBot is a machine learning, conversational dialog engine for.. BSD-3
sentencepiece (🥇31 · ⭐ 5.2K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
TextBlob (🥈30 · ⭐ 7.7K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
sentence-transformers (🥈30 · ⭐ 5.6K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2
snowballstemmer (🥈30 · ⭐ 500) - Snowball compiler and stemming algorithms. BSD-3
DeepPavlov (🥈28 · ⭐ 5.3K) - An open source library for deep learning end-to-end dialog.. Apache-2
Tokenizers (🥈28 · ⭐ 4.7K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
TensorFlow Text (🥈27 · ⭐ 770) - Making text a first-class citizen in TensorFlow. Apache-2
vaderSentiment (🥈26 · ⭐ 3.1K) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary and.. MIT
haystack (🥈26 · ⭐ 2.1K) - End-to-end Python framework for building natural language search.. Apache-2
textgenrnn (🥈25 · ⭐ 4.5K · 💤) - Easily train your own text-generating neural network of any.. MIT
neuralcoref (🥈25 · ⭐ 2.3K) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
TextDistance (🥈25 · ⭐ 2K) - Compute distance between sequences. 30+ algorithms, pure python.. MIT
scattertext (🥈25 · ⭐ 1.6K) - Beautiful visualizations of how language differs among document.. Apache-2
spacy-transformers (🥈25 · ⭐ 980) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT
spacy
Ciphey (🥉24 · ⭐ 7.4K) - Automatically decrypt encryptions without knowing the key or cipher,.. MIT
-
GitHub (👨💻 45 · 🔀 420 · 📋 260 - 22% open · ⏱️ 14.07.2021):
git clone https://github.com/Ciphey/Ciphey
-
PyPi (📥 8.3K / month · ⏱️ 06.06.2021):
pip install ciphey
-
Docker Hub (📥 11K · ⭐ 4 · ⏱️ 06.06.2021):
docker pull remnux/ciphey
fastNLP (🥉24 · ⭐ 2.2K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
pytorch-nlp (🥉24 · ⭐ 1.9K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3
DeepMatcher (🥉23 · ⭐ 3.7K) - Python package for performing Entity and Text Matching using Deep.. BSD-3
PyTextRank (🥉23 · ⭐ 1.6K) - Python implementation of TextRank for phrase extraction and.. MIT
SciSpacy (🥉23 · ⭐ 960) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
english-words (🥉22 · ⭐ 5.3K · 💤) - A text file containing 479k English words for all your.. Unlicense
gpt-2-simple (🥉22 · ⭐ 2.7K) - Python package to easily retrain OpenAI's GPT-2 text-.. MIT
Texar (🥉22 · ⭐ 2.2K · 💤) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2
pySBD (🥉22 · ⭐ 330) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
Texthero (🥉21 · ⭐ 2.3K) - Text preprocessing, representation and visualization from zero to hero. MIT
NLP Architect (🥉20 · ⭐ 2.7K) - A model library for exploring state-of-the-art deep learning.. Apache-2
DELTA (🥉20 · ⭐ 1.4K · 💤) - DELTA is a deep learning based natural language and speech.. Apache-2
-
GitHub (👨💻 41 · 🔀 290 · 📋 76 - 6% open · ⏱️ 17.12.2020):
git clone https://github.com/Delta-ML/delta
-
PyPi (📥 27 / month · ⏱️ 27.03.2020):
pip install delta-nlp
-
Docker Hub (📥 13K · ⏱️ 14.07.2021):
docker pull zh794390558/delta
YouTokenToMe (🥉20 · ⭐ 750) - Unsupervised text tokenizer focused on computational efficiency. MIT
lightseq (🥉18 · ⭐ 1.3K) - LightSeq: A High Performance Library for Sequence Processing and.. Apache-2
nboost (🥉18 · ⭐ 580 · 💤) - NBoost is a scalable, search-api-boosting platform for deploying.. Apache-2
OpenNRE (🥉15 · ⭐ 3.2K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
-
GitHub (👨💻 9 · 🔀 860 · 📋 330 - 6% open · ⏱️ 31.05.2021):
git clone https://github.com/thunlp/OpenNRE
BLINK (🥉12 · ⭐ 670) - Entity Linker solution. MIT
-
GitHub (👨💻 16 · 🔀 110 · 📋 62 - 54% open · ⏱️ 02.04.2021):
git clone https://github.com/facebookresearch/BLINK
Show 19 hidden projects...
- fuzzywuzzy (🥇31 · ⭐ 8.3K) - Fuzzy String Matching in Python.
❗️GPL-2.0
- langid (🥈26 · ⭐ 1.8K · 💀) - Stand-alone language identification system.
BSD-3
- polyglot (🥈25 · ⭐ 1.9K · 💤) - Multilingual text (NLP) processing toolkit.
❗️GPL-3.0
- flashtext (🥉23 · ⭐ 4.9K · 💀) - Extract Keywords from sentence or Replace keywords in sentences.
MIT
- stop-words (🥉22 · ⭐ 130 · 💀) - Get list of common stop words in various languages in Python.
BSD-3
- NeuroNER (🥉19 · ⭐ 1.6K · 💀) - Named-entity recognition using neural networks. Easy-to-use and..
MIT
- pyfasttext (🥉19 · ⭐ 230 · 💀) - Yet another Python binding for fastText.
❗️GPL-3.0
- textpipe (🥉18 · ⭐ 290) - Textpipe: clean and extract metadata from text.
MIT
- textaugment (🥉18 · ⭐ 150) - TextAugment: Text Augmentation Library.
MIT
- TextBox (🥉16 · ⭐ 280) - TextBox is an open-source library for building text generation system.
MIT
- Headliner (🥉16 · ⭐ 230 · 💀) - Easy training and deployment of seq2seq models.
MIT
- skift (🥉16 · ⭐ 220) - scikit-learn wrappers for Python fastText.
MIT
- TransferNLP (🥉15 · ⭐ 290 · 💀) - NLP library designed for reproducible experimentation..
MIT
- NeuralQA (🥉15 · ⭐ 200 · 💤) - NeuralQA: A Usable Library for Question Answering on Large Datasets..
MIT
- ONNX-T5 (🥉15 · ⭐ 170) - Summarization, translation, sentiment-analysis, text-generation and..
Apache-2
- textvec (🥉14 · ⭐ 170 · 💤) - Text vectorization tool to outperform TFIDF for classification..
MIT
- fastT5 (🥉13 · ⭐ 180 · 🐣) - boost inference speed of T5 models by 5x & reduce the model size..
Apache-2
- numerizer (🥉13 · ⭐ 120) - A Python module to convert natural language numerics into ints and..
MIT
- spacy-dbpedia-spotlight (🥉12 · ⭐ 37) - A spaCy wrapper for DBpedia Spotlight.
MIT
spacy
Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.
torchvision (🥇36 · ⭐ 9.4K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
scikit-image (🥇33 · ⭐ 4.4K) - Image processing in Python. BSD-2
Albumentations (🥇31 · ⭐ 8.4K) - Fast image augmentation library and an easy-to-use wrapper.. MIT
opencv-python (🥇31 · ⭐ 2.1K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
Face Recognition (🥈29 · ⭐ 41K · 📉) - The world's simplest facial recognition api for.. MIT
detectron2 (🥈29 · ⭐ 17K) - Detectron2 is FAIR's next-generation platform for object.. Apache-2
PyTorch Image Models (🥈29 · ⭐ 12K) - PyTorch image models, scripts, pretrained weights --.. Apache-2
-
GitHub (👨💻 52 · 🔀 1.8K · 📥 480K · 📦 790 · 📋 350 - 11% open · ⏱️ 13.07.2021):
git clone https://github.com/rwightman/pytorch-image-models
InsightFace (🥈29 · ⭐ 9.6K) - Face Analysis Project on PyTorch and MXNet. MIT
imageai (🥈27 · ⭐ 6.3K) - A python library built to empower developers to build applications and.. MIT
MMDetection (🥈26 · ⭐ 16K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2
-
GitHub (👨💻 240 · 🔀 5.5K · 📦 77 · 📋 4.1K - 8% open · ⏱️ 13.07.2021):
git clone https://github.com/open-mmlab/mmdetection
facenet-pytorch (🥈26 · ⭐ 2.2K) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT
Face Alignment (🥉24 · ⭐ 5.1K) - 2D and 3D Face alignment library build using pytorch. BSD-3
CellProfiler (🥉24 · ⭐ 590) - An open-source application for biological image analysis. BSD-3
Image Super-Resolution (🥉23 · ⭐ 2.9K) - Super-scale your images and run experiments with.. Apache-2
-
GitHub (👨💻 10 · 🔀 550 · 📦 58 · 📋 180 - 41% open · ⏱️ 02.06.2021):
git clone https://github.com/idealo/image-super-resolution
-
PyPi (📥 5K / month · 📦 8 · ⏱️ 08.01.2020):
pip install ISR
-
Docker Hub (📥 150 · ⏱️ 01.04.2019):
docker pull idealo/image-super-resolution-gpu
Torch Points 3D (🥉23 · ⭐ 1.4K) - Pytorch framework for doing deep learning on point clouds. BSD-3
Image Deduplicator (🥉22 · ⭐ 3.7K · 💤) - Finding duplicate images made easy!. Apache-2
tensorflow-graphics (🥉22 · ⭐ 2.5K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2
layout-parser (🥉22 · ⭐ 2.2K) - A unified toolkit for Deep Learning Based Document Image.. Apache-2
vidgear (🥉22 · ⭐ 1.8K) - High-performance cross-platform Video Processing Python framework.. Apache-2
Classy Vision (🥉22 · ⭐ 1.3K) - An end-to-end PyTorch framework for image and video.. MIT
vit-pytorch (🥉21 · ⭐ 5K) - Implementation of Vision Transformer, a simple way to achieve.. MIT
image-match (🥉19 · ⭐ 2.6K) - Quickly search over billions of images. Apache-2
Norfair (🥉19 · ⭐ 1.1K) - Lightweight Python library for adding real-time 2D object tracking to.. BSD-3
Caer (🥉19 · ⭐ 520 · 📉) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT
PaddleDetection (🥉18 · ⭐ 4.3K) - Object detection and instance segmentation toolkit.. Apache-2
-
GitHub (👨💻 63 · 🔀 1.1K · 📦 5 · 📋 2.1K - 21% open · ⏱️ 14.07.2021):
git clone https://github.com/PaddlePaddle/PaddleDetection
pytorchvideo (🥉18 · ⭐ 1.7K · 🐣) - A deep learning library for video understanding.. Apache-2
pycls (🥉17 · ⭐ 1.6K) - Codebase for Image Classification Research, written in PyTorch. MIT
-
GitHub (👨💻 13 · 🔀 180 · 📦 3 · 📋 67 - 25% open · ⏱️ 09.07.2021):
git clone https://github.com/facebookresearch/pycls
DE⫶TR (🥉16 · ⭐ 7.2K) - End-to-End Object Detection with Transformers. Apache-2
-
GitHub (👨💻 21 · 🔀 1.2K · 📋 350 - 27% open · ⏱️ 30.06.2021):
git clone https://github.com/facebookresearch/detr
PySlowFast (🥉16 · ⭐ 3.9K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2
-
GitHub (👨💻 24 · 🔀 760 · 📦 4 · 📋 430 - 48% open · ⏱️ 08.07.2021):
git clone https://github.com/facebookresearch/SlowFast
Show 8 hidden projects...
- imgaug (🥇32 · ⭐ 11K · 💀) - Image augmentation for machine learning experiments.
MIT
- glfw (🥈30 · ⭐ 7.8K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input.
❗️Zlib
- Augmentor (🥉25 · ⭐ 4.5K · 💀) - Image augmentation library in Python for machine learning.
MIT
- chainercv (🥉25 · ⭐ 1.5K · 💀) - ChainerCV: a Library for Deep Learning in Computer Vision.
MIT
- Pillow-SIMD (🥉24 · ⭐ 1.6K · 💀) - The friendly PIL fork.
❗️PIL
- segmentation_models (🥉23 · ⭐ 3.3K · 💀) - Segmentation models with pretrained backbones. Keras..
MIT
- Luminoth (🥉22 · ⭐ 2.4K · 💀) - Deep Learning toolkit for Computer Vision.
BSD-3
- solt (🥉17 · ⭐ 250 · 💀) - Streaming over lightweight data transformations.
MIT
Libraries for graph processing, clustering, embedding, and machine learning tasks.
PyTorch Geometric (🥇29 · ⭐ 12K) - Geometric Deep Learning Extension Library for PyTorch. MIT
dgl (🥇28 · ⭐ 7.6K) - Python package built to ease deep learning on graph, on top of existing.. Apache-2
StellarGraph (🥈26 · ⭐ 2K) - StellarGraph - Machine Learning on Graphs. Apache-2
AmpliGraph (🥈23 · ⭐ 1.6K) - Python library for Representation Learning on Knowledge.. Apache-2
pygraphistry (🥈23 · ⭐ 1.4K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3
PyTorch-BigGraph (🥈22 · ⭐ 2.8K) - Generate embeddings from large-scale graph-structured.. BSD-3
torch-cluster (🥈22 · ⭐ 390) - PyTorch Extension Library of Optimized Graph Cluster.. MIT
Paddle Graph Learning (🥉19 · ⭐ 1.1K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2
pytorch_geometric_temporal (🥉19 · ⭐ 870) - A Temporal Extension Library for PyTorch Geometric. MIT
graph-nets (🥉18 · ⭐ 4.9K · 💤) - Build Graph Nets in Tensorflow. Apache-2
GraphEmbedding (🥉16 · ⭐ 2.1K · 💤) - Implementation and experiments of graph embedding.. MIT
-
GitHub (👨💻 8 · 🔀 640 · 📦 12 · 📋 49 - 71% open · ⏱️ 18.10.2020):
git clone https://github.com/shenweichen/GraphEmbedding
OpenKE (🥉13 · ⭐ 2.6K) - An Open-Source Package for Knowledge Embedding (KE). MIT
-
GitHub (👨💻 10 · 🔀 800 · 📋 300 - 22% open · ⏱️ 06.04.2021):
git clone https://github.com/thunlp/OpenKE
GraphVite (🥉12 · ⭐ 920) - GraphVite: A General and High-performance Graph Embedding System. Apache-2
Show 10 hidden projects...
- igraph (🥇28 · ⭐ 840) - Python interface for igraph.
❗️GPL-2.0
- pygal (🥈26 · ⭐ 2.4K) - PYthon svg GrAph plotting Library.
❗️LGPL-3.0
- Karate Club (🥈23 · ⭐ 1.3K) - Karate Club: An API Oriented Open-source Python Framework for..
❗️GPL-3.0
- DeepWalk (🥉21 · ⭐ 2.3K · 💀) - DeepWalk - Deep Learning for Graphs.
❗️GPL-3.0
- DIG (🥉17 · ⭐ 780) - A library for graph deep learning research.
❗️GPL-3.0
- Sematch (🥉17 · ⭐ 360 · 💀) - semantic similarity framework for knowledge graph.
Apache-2
- DeepGraph (🥉17 · ⭐ 240) - Analyze Data with Pandas-based Networks. Documentation:.
BSD-3
- pyRDF2Vec (🥉17 · ⭐ 120) - Python Implementation and Extension of RDF2Vec.
MIT
- GraphSAGE (🥉14 · ⭐ 2.4K · 💀) - Representation learning on large graphs using stochastic..
MIT
- OpenNE (🥉13 · ⭐ 1.5K · 💀) - An Open-Source Package for Network Embedding (NE).
MIT
Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.
DeepSpeech (🥇30 · ⭐ 18K · 📈) - DeepSpeech is an open source embedded (offline, on-.. MPL-2.0
torchaudio (🥇30 · ⭐ 1.4K) - Data manipulation and transformation for audio signal.. BSD-2
pyAudioAnalysis (🥈27 · ⭐ 4K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
python-soundfile (🥈25 · ⭐ 390 · 💤) - SoundFile is an audio library based on libsndfile, CFFI,.. BSD-3
python_speech_features (🥉24 · ⭐ 1.9K · 💤) - This library provides common speech features for ASR.. MIT
speechbrain (🥉22 · ⭐ 2.6K) - A PyTorch-based Speech Toolkit. Apache-2
audiomentations (🥉22 · ⭐ 620) - A Python library for audio data augmentation. Inspired by.. MIT
tinytag (🥉22 · ⭐ 470) - Read music meta data and length of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA and.. MIT
TTS (🥉19 · ⭐ 4.9K) - Deep learning for Text to Speech (Discussion forum:.. MPL-2.0
-
GitHub (👨💻 56 · 🔀 800 · 📥 820 · 📋 510 - 3% open · ⏱️ 12.02.2021):
git clone https://github.com/mozilla/TTS
Show 7 hidden projects...
- SpeechRecognition (🥇30 · ⭐ 5.7K · 💀) - Speech recognition module for Python, supporting..
BSD-3
- aubio (🥈26 · ⭐ 2.2K) - a library for audio and music analysis.
❗️GPL-3.0
- Essentia (🥉24 · ⭐ 1.9K) - C++ library for audio and music analysis, description and..
❗️AGPL-3.0
- Madmom (🥉22 · ⭐ 770 · 💀) - Python audio and music signal processing library.
BSD-3
- Dejavu (🥉21 · ⭐ 5.5K · 💀) - Audio fingerprinting and recognition in Python.
MIT
- Muda (🥉19 · ⭐ 190) - A library for augmenting annotated audio data.
ISC
- Julius (🥉17 · ⭐ 200) - Fast PyTorch based DSP for audio and 1D signals.
MIT
Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.
pydeck (🥇34 · ⭐ 8.8K) - WebGL2 powered geospatial visualization layers. MIT
-
GitHub (👨💻 170 · 🔀 1.6K · 📦 1.6K · 📋 2.2K - 4% open · ⏱️ 13.07.2021):
git clone https://github.com/visgl/deck.gl
-
PyPi (📥 350K / month · 📦 2 · ⏱️ 13.04.2021):
pip install pydeck
-
Conda (📥 41K · ⏱️ 13.04.2021):
conda install -c conda-forge pydeck
-
NPM (📥 210K / month · 📦 560 · ⏱️ 06.07.2021):
npm install deck.gl
ipyleaflet (🥉29 · ⭐ 1.2K) - A Jupyter - Leaflet.js bridge. MIT
-
GitHub (👨💻 68 · 🔀 300 · 📦 940 · 📋 430 - 38% open · ⏱️ 15.07.2021):
git clone https://github.com/jupyter-widgets/ipyleaflet
-
PyPi (📥 48K / month · 📦 98 · ⏱️ 17.06.2021):
pip install ipyleaflet
-
Conda (📥 720K · ⏱️ 17.06.2021):
conda install -c conda-forge ipyleaflet
-
NPM (📥 25K / month · 📦 2 · ⏱️ 17.06.2021):
npm install jupyter-leaflet
ArcGIS API (🥉24 · ⭐ 1.1K) - Documentation and samples for ArcGIS API for Python. Apache-2
-
GitHub (👨💻 70 · 🔀 780 · 📋 370 - 34% open · ⏱️ 06.07.2021):
git clone https://github.com/Esri/arcgis-python-api
-
PyPi (📥 36K / month · 📦 20 · ⏱️ 08.07.2021):
pip install arcgis
-
Docker Hub (📥 4.4K · ⭐ 33 · ⏱️ 06.03.2020):
docker pull esridocker/arcgis-api-python-notebook
Show 7 hidden projects...
- Geocoder (🥈30 · ⭐ 1.4K · 💀) - Python Geocoder.
MIT
- Sentinelsat (🥉23 · ⭐ 640) - Search and download Copernicus Sentinel satellite images.
❗️GPL-3.0
- gmaps (🥉22 · ⭐ 720 · 💀) - Google maps for Jupyter notebooks.
BSD-3
- geoplotlib (🥉21 · ⭐ 920 · 💀) - python toolbox for visualizing geographical data and making maps.
MIT
- Satpy (🥉21 · ⭐ 730) - Python package for earth-observing satellite data processing.
❗️GPL-3.0
- EarthPy (🥉20 · ⭐ 270) - A package built to support working with spatial data using open source..
BSD-3
- pymap3d (🥉19 · ⭐ 200) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef..
BSD-2
Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.
yfinance (🥇29 · ⭐ 5.3K) - Yahoo! Finance market data downloader (+faster Pandas Datareader). Apache-2
Alpha Vantage (🥈26 · ⭐ 3.4K) - A python wrapper for Alpha Vantage API for financial data. MIT
empyrical (🥈25 · ⭐ 810 · 💤) - Common financial risk and performance metrics. Used by zipline.. Apache-2
TensorTrade (🥉23 · ⭐ 3.3K · 📉) - An open source reinforcement learning framework for.. Apache-2
Enigma Catalyst (🥉23 · ⭐ 2.2K · 💤) - An Algorithmic Trading Library for Crypto-Assets in.. Apache-2
stockstats (🥉23 · ⭐ 810 · 💤) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
finmarketpy (🥉21 · ⭐ 2.6K) - Python library for backtesting trading strategies & analyzing.. Apache-2
tf-quant-finance (🥉20 · ⭐ 2.7K) - High-performance TensorFlow library for quantitative.. Apache-2
Crypto Signals (🥉19 · ⭐ 3.2K) - Github.com/CryptoSignal - #1 Quant Trading & Technical Analysis.. MIT
-
GitHub (👨💻 28 · 🔀 840 · 📋 240 - 17% open · ⏱️ 28.06.2021):
git clone https://github.com/CryptoSignal/crypto-signal
-
Docker Hub (📥 140K · ⭐ 7 · ⏱️ 03.09.2020):
docker pull shadowreaver/crypto-signal
Show 7 hidden projects...
- backtrader (🥈26 · ⭐ 6.9K) - Python Backtesting library for trading strategies.
❗️GPL-3.0
- Alphalens (🥉24 · ⭐ 2K · 💀) - Performance analysis of predictive (alpha) stock factors.
Apache-2
- PyAlgoTrade (🥉23 · ⭐ 3.4K · 💀) - Python Algorithmic Trading Library.
Apache-2
- FinTA (🥉23 · ⭐ 1.2K) - Common financial technical indicators implemented in Pandas.
❗️LGPL-3.0
- arch (🥉23 · ⭐ 750) - ARCH models in Python.
❗️NCSA
- Backtesting.py (🥉18 · ⭐ 1.5K) - Backtest trading strategies in Python.
❗️AGPL-3.0
- surpriver (🥉12 · ⭐ 1.3K · 💤) - Find big moving stocks before they move using machine..
❗️GPL-3.0
Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.
Prophet (🥇29 · ⭐ 13K) - Tool for producing high quality forecasts for time series data that has.. MIT
pmdarima (🥇28 · ⭐ 930) - A statistical library designed to fill the void in Python's time series.. MIT
STUMPY (🥈22 · ⭐ 1.8K) - STUMPY is a powerful and scalable Python library for computing a Matrix.. BSD-3
Darts (🥈22 · ⭐ 1.2K) - A python library for easy manipulation and forecasting of time series. Apache-2
-
GitHub (👨💻 32 · 🔀 140 · 📦 6 · 📋 110 - 20% open · ⏱️ 09.07.2021):
git clone https://github.com/unit8co/darts
-
PyPi (📥 4.3K / month · ⏱️ 09.07.2021):
pip install u8darts
-
Docker Hub (📥 130 · ⏱️ 22.05.2021):
docker pull unit8/darts
pytorch-forecasting (🥉21 · ⭐ 1.2K) - Time series forecasting with PyTorch. MIT
Show 8 hidden projects...
- PyFlux (🥈23 · ⭐ 1.9K · 💀) - Open source time series library for Python.
BSD-3
- luminol (🥉21 · ⭐ 940 · 💀) - Anomaly Detection and Correlation library.
Apache-2
- pydlm (🥉20 · ⭐ 400 · 💀) - A python library for Bayesian time series modeling.
BSD-3
- matrixprofile-ts (🥉19 · ⭐ 640 · 💀) - A Python library for detecting patterns and anomalies..
Apache-2
- Auto TS (🥉19 · ⭐ 260) - Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost..
Apache-2
- ADTK (🥉18 · ⭐ 680 · 💀) - A Python toolkit for rule-based/unsupervised anomaly detection in time..
MPL-2.0
- tick (🥉18 · ⭐ 350 · 💀) - Module for statistical learning, with a particular emphasis on time-..
BSD-3
- tsaug (🥉14 · ⭐ 200 · 💀) - A Python package for time series augmentation.
Apache-2
Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.
MNE (🥈27 · ⭐ 1.6K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
DeepVariant (🥉20 · ⭐ 2.3K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3
MedicalTorch (🥉15 · ⭐ 740) - A medical imaging framework for Pytorch. Apache-2
Medical Detection Toolkit (🥉13 · ⭐ 1K) - The Medical Detection Toolkit contains 2D + 3D.. Apache-2
-
GitHub (👨💻 3 · 🔀 260 · 📋 120 - 30% open · ⏱️ 31.05.2021):
git clone https://github.com/MIC-DKFZ/medicaldetectiontoolkit
MedicalNet (🥉12 · ⭐ 1.1K · 💤) - Many studies have shown that the performance on deep learning is.. MIT
-
GitHub (👨💻 1 · 🔀 320 · 📋 61 - 77% open · ⏱️ 27.08.2020):
git clone https://github.com/Tencent/MedicalNet
Show 6 hidden projects...
- NiftyNet (🥉22 · ⭐ 1.3K · 💀) - [unmaintained] An open-source convolutional neural..
Apache-2
- MedPy (🥉21 · ⭐ 360 · 💀) - Medical image processing in Python.
❗️GPL-3.0
- DLTK (🥉20 · ⭐ 1.3K · 💀) - Deep Learning Toolkit for Medical Image Analysis.
Apache-2
- Brainiak (🥉20 · ⭐ 250) - Brain Imaging Analysis Kit.
Apache-2
- Glow (🥉20 · ⭐ 170) - An open-source toolkit for large-scale genomic analysis.
Apache-2
- DeepNeuro (🥉14 · ⭐ 100 · 💀) - A deep learning python package for neuroimaging data. Made by:.
MIT
Libraries for processing tabular and structured data.
carefree-learn (🥇17 · ⭐ 340) - Tabular Datasets PyTorch. MIT
pytorch_tabular (🥉15 · ⭐ 360) - A standard framework for modelling Deep Learning Models.. MIT
Libraries for optical character recognition (OCR) and text extraction from images or videos.
Tesseract (🥇30 · ⭐ 3.7K) - Python-tesseract is an optical character recognition (OCR) tool.. Apache-2
EasyOCR (🥇29 · ⭐ 12K) - Ready-to-use OCR with 80+ supported languages and all popular writing.. Apache-2
OCRmyPDF (🥈26 · ⭐ 4.6K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them.. MPL-2.0
attention-ocr (🥉21 · ⭐ 870 · 💤) - A Tensorflow model for text recognition (CNN + seq2seq.. MIT
Show 2 hidden projects...
- pdftabextract (🥉19 · ⭐ 1.9K · 💀) - A set of tools for extracting tables from PDF files..
Apache-2
- Mozart (🥉11 · ⭐ 280) - An optical music recognition (OMR) system. Converts sheet music..
Apache-2
General-purpose data containers & structures as well as utilities & extensions for pandas.
h5py (🥈36 · ⭐ 1.5K) - HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5.. BSD-3
numexpr (🥈31 · ⭐ 1.6K) - Fast numerical array expression evaluator for Python, NumPy, PyTables,.. MIT
Bottleneck (🥈30 · ⭐ 630) - Fast NumPy array functions written in C. BSD-2
Modin (🥈29 · ⭐ 6.2K) - Modin: Speed up your Pandas workflows by changing a single line of.. Apache-2
datasketch (🥉26 · ⭐ 1.5K) - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog,.. MIT
Vaex (🥉25 · ⭐ 6.4K) - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and.. MIT
Pandaral·lel (🥉25 · ⭐ 1.6K) - A simple and efficient tool to parallelize Pandas.. BSD-3
Arctic (🥉23 · ⭐ 2.3K) - Arctic is a high performance datastore for numeric data. ❗️LGPL-2.1
Show 7 hidden projects...
- Blaze (🥈28 · ⭐ 3K · 💀) - NumPy and Pandas interface to Big Data.
BSD-3
- sklearn-pandas (🥈28 · ⭐ 2.5K) - Pandas integration with sklearn.
❗️Zlib
- pandasql (🥉24 · ⭐ 1K · 💀) - sqldf for pandas.
MIT
- pickleDB (🥉21 · ⭐ 570 · 💀) - pickleDB is an open source key-value store using Python's json..
BSD-3
- Pandas Summary (🥉21 · ⭐ 360 · 💀) - An extension to pandas dataframes describe function.
MIT
- StaticFrame (🥉21 · ⭐ 230) - Immutable and grow-only Pandas-like DataFrames with a more explicit..
MIT
- fletcher (🥉18 · ⭐ 210) - Pandas ExtensionDType/Array backed by Apache Arrow.
MIT
Libraries for loading, collecting, and extracting data from a variety of data sources and formats.
🔗 best-of-python - Data Extraction ( ⭐ 1.7K · 🐣) - Collection of data-loading and -extraction libraries.
Libraries for web scraping, crawling, downloading, and mining as well as libraries.
🔗 best-of-web-python - Web Scraping ( ⭐ 1.3K · 🐣) - Collection of web-scraping and crawling libraries.
Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
Celery (🥇40 · ⭐ 18K) - Asynchronous task queue/job queue based on distributed message passing. BSD-3
luigi (🥇34 · ⭐ 15K) - Luigi is a Python module that helps you build complex pipelines of batch.. Apache-2
Airflow (🥇33 · ⭐ 23K · 📈) - Platform to programmatically author, schedule, and monitor.. Apache-2
-
GitHub (👨💻 1.9K · 🔀 8.8K · 📥 150K · 📋 3.8K - 26% open · ⏱️ 15.07.2021):
git clone https://github.com/apache/airflow
-
PyPi (📥 3M / month · 📦 290 · ⏱️ 14.07.2021):
pip install apache-airflow
-
Conda (📥 360K · ⏱️ 04.07.2021):
conda install -c conda-forge airflow
-
Docker Hub (📥 30M · ⭐ 270 · ⏱️ 15.07.2021):
docker pull apache/airflow
dbt (🥈31 · ⭐ 3.2K) - dbt (data build tool) enables data analysts and engineers to transform.. Apache-2
Kedro (🥈29 · ⭐ 4.1K) - A Python framework for creating reproducible, maintainable and modular.. Apache-2
PyFunctional (🥈26 · ⭐ 1.9K) - Python library for creating data pipelines with chain functional.. MIT
Activeloop (🥉25 · ⭐ 3.3K) - Fastest dataset optimization and management for machine and deep.. MPL-2.0
Great Expectations (🥉24 · ⭐ 4.7K · 📉) - Always know what to expect from your data. Apache-2
streamparse (🥉24 · ⭐ 1.4K · 💤) - Run Python in Apache Storm topologies. Pythonic API, CLI.. Apache-2
ploomber (🥉21 · ⭐ 330) - Write maintainable, production-ready pipelines using Jupyter or your.. Apache-2
mrq (🥉20 · ⭐ 840 · 💤) - Mr. Queue - A distributed worker task queue in Python using Redis & gevent. MIT
Databolt Flow (🥉19 · ⭐ 920) - Python library for building highly effective data science workflows. MIT
spark-deep-learning (🥉18 · ⭐ 1.9K) - Deep Learning Pipelines for Apache Spark. Apache-2
-
GitHub (👨💻 15 · 🔀 450 · 📦 17 · 📋 100 - 73% open · ⏱️ 20.01.2021):
git clone https://github.com/databricks/spark-deep-learning
Mara Pipelines (🥉17 · ⭐ 1.7K) - A lightweight opinionated ETL framework, halfway between plain.. MIT
Show 6 hidden projects...
- dbnd (🥉24 · ⭐ 190) - DBND is an agile pipeline framework that helps data engineering teams..
Apache-2
- pysparkling (🥉22 · ⭐ 240) - A pure Python implementation of Apache Spark's RDD and DStream..
MIT
- BatchFlow (🥉19 · ⭐ 160) - BatchFlow helps you conveniently work with random or sequential..
Apache-2
- flupy (🥉19 · ⭐ 160) - Fluent data pipelines for python and your shell.
MIT
- bodywork-core (🥉17 · ⭐ 260) - MLOps tool for deploying machine learning projects to..
❗️AGPL-3.0
- Botflow (🥉16 · ⭐ 1.2K · 💀) - Python Fast Dataflow programming framework for Data pipeline work(..
BSD-3
Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.
dask.distributed (🥇32 · ⭐ 1.2K) - A distributed task scheduler for Dask. BSD-3
horovod (🥈29 · ⭐ 11K) - Distributed training framework for TensorFlow, Keras, PyTorch, and.. Apache-2
DeepSpeed (🥈26 · ⭐ 5.2K) - DeepSpeed is a deep learning optimization library that makes.. MIT
-
GitHub (👨💻 58 · 🔀 530 · 📦 56 · 📋 570 - 50% open · ⏱️ 14.07.2021):
git clone https://github.com/microsoft/DeepSpeed
-
PyPi (📥 54K / month · ⏱️ 12.07.2021):
pip install deepspeed
-
Docker Hub (📥 8.9K · ⭐ 2 · ⏱️ 05.05.2021):
docker pull deepspeed/deepspeed
ipyparallel (🥈26 · ⭐ 2K) - Interactive Parallel Computing in Python. BSD-3
petastorm (🥈26 · ⭐ 1.2K) - Petastorm library enables single machine or distributed training.. Apache-2
BigDL (🥈25 · ⭐ 3.7K) - BigDL: Distributed Deep Learning Framework for Apache Spark. Apache-2
-
GitHub (👨💻 74 · 🔀 920 · 📦 26 · 📋 920 - 20% open · ⏱️ 12.07.2021):
git clone https://github.com/intel-analytics/BigDL
-
PyPi (📥 3.4K / month · 📦 6 · ⏱️ 09.07.2021):
pip install bigdl
-
Maven (⏱️ 20.04.2021):
<dependency> <groupId>com.intel.analytics.bigdl</groupId> <artifactId>bigdl-SPARK_2.4</artifactId> <version>[VERSION]</version> </dependency>
TensorFlowOnSpark (🥈25 · ⭐ 3.7K) - TensorFlowOnSpark brings TensorFlow programs to.. Apache-2
analytics-zoo (🥉23 · ⭐ 2.3K) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2
BytePS (🥉19 · ⭐ 2.9K) - A high performance and generic framework for distributed DNN training. Apache-2
-
GitHub (👨💻 19 · 🔀 400 · 📋 240 - 37% open · ⏱️ 26.06.2021):
git clone https://github.com/bytedance/byteps
-
PyPi (📥 270 / month · ⏱️ 04.11.2020):
pip install byteps
-
Docker Hub (📥 1.1K · ⏱️ 03.03.2020):
docker pull bytepsimage/tensorflow
Apache Singa (🥉19 · ⭐ 2.3K) - a distributed deep learning platform. Apache-2
-
GitHub (👨💻 76 · 🔀 640 · 📦 1 · 📋 78 - 51% open · ⏱️ 11.07.2021):
git clone https://github.com/apache/singa
-
Conda (📥 270 · ⏱️ 11.07.2021):
conda install -c nusdbsystem singa
-
Docker Hub (📥 170 · ⭐ 2 · ⏱️ 04.06.2019):
docker pull apache/singa
Show 7 hidden projects...
- DEAP (🥈28 · ⭐ 4.3K) - Distributed Evolutionary Algorithms in Python.
❗️LGPL-3.0
- TensorFrames (🥉20 · ⭐ 760 · 💀) - [DEPRECATED] Tensorflow wrapper for DataFrames on..
Apache-2
- sk-dist (🥉19 · ⭐ 270) - Distributed scikit-learn meta-estimators in PySpark.
Apache-2
- somoclu (🥉19 · ⭐ 230) - Massively parallel self-organizing maps: accelerate training on multicore..
MIT
- launchpad (🥉14 · ⭐ 200 · 🐣) - Launchpad is a library that simplifies writing..
Apache-2
- autodist (🥉12 · ⭐ 100) - Simple Distributed Deep Learning on TensorFlow.
Apache-2
- LazyCluster (🥉12 · ⭐ 40 · 💤) - Distributed machine learning made simple.
Apache-2
Libraries for hyperparameter optimization, automl and neural architecture search.
scikit-optimize (🥇30 · ⭐ 2.1K) - Sequential model-based optimization with a `scipy.optimize`.. BSD-3
Bayesian Optimization (🥇29 · ⭐ 5.2K · 💤) - A Python implementation of global optimization with.. MIT
Keras Tuner (🥇29 · ⭐ 2.3K) - Hyperparameter tuning for humans. Apache-2
auto-sklearn (🥈28 · ⭐ 5.5K) - Automated Machine Learning with scikit-learn. BSD-3
featuretools (🥈27 · ⭐ 5.6K) - An open source python library for automated feature engineering. BSD-3
mljar-supervised (🥈23 · ⭐ 1.4K) - Automated Machine Learning Pipeline with Feature Engineering.. MIT
lazypredict (🥉22 · ⭐ 380) - Lazy Predict help build a lot of basic models without much code.. MIT
Neuraxle (🥉21 · ⭐ 430) - A Sklearn-like Framework for Hyperparameter Tuning and AutoML in.. Apache-2
AlphaPy (🥉17 · ⭐ 610) - Automated Machine Learning [AutoML] with Python, scikit-learn, Keras,.. Apache-2
HyperparameterHunter (🥉15 · ⭐ 660) - Easy hyperparameter optimization and automatic result.. MIT
model_search (🥉11 · ⭐ 3.1K · 🐣) - AutoML algorithms for model architecture search at scale. Apache-2
-
GitHub (👨💻 1 · 🔀 300 · 📋 47 - 72% open · ⏱️ 17.03.2021):
git clone https://github.com/google/model_search
Devol (🥉11 · ⭐ 930 · 💤) - Genetic neural architecture search with Keras. MIT
-
GitHub (👨💻 18 · 🔀 110 · 📋 27 - 25% open · ⏱️ 05.07.2020):
git clone https://github.com/joeddav/devol
Show 21 hidden projects...
- TPOT (🥇29 · ⭐ 8.1K) - A Python Automated Machine Learning tool that optimizes machine..
❗️LGPL-3.0
- Orion (🥈24 · ⭐ 190) - Asynchronous Distributed Hyperparameter Optimization.
BSD-3
- MLBox (🥉22 · ⭐ 1.2K · 💤) - MLBox is a powerful Automated Machine Learning python library.
❗️BSD-1-Clause
- optunity (🥉21 · ⭐ 370 · 💀) - optimization routines for hyperparameter tuning.
BSD-3
- Hyperactive (🥉21 · ⭐ 280) - A hyperparameter optimization and data collection toolbox for..
MIT
- Auto ViML (🥉21 · ⭐ 270) - Automatically Build Multiple ML Models with a Single Line of Code...
Apache-2
- auto_ml (🥉20 · ⭐ 1.5K · 💀) - [UNMAINTAINED] Automated machine learning for analytics & production.
MIT
- HpBandSter (🥉20 · ⭐ 480 · 💀) - a distributed Hyperband implementation on Steroids.
BSD-3
- Test Tube (🥉19 · ⭐ 690 · 💀) - Python library to easily log experiments and parallelize..
MIT
- sklearn-deap (🥉18 · ⭐ 640 · 💀) - Use evolutionary algorithms instead of gridsearch in..
MIT
- Sherpa (🥉18 · ⭐ 300 · 💤) - Hyperparameter optimization that enables researchers to..
❗️GPL-3.0
- Advisor (🥉17 · ⭐ 1.4K · 💀) - Open-source implementation of Google Vizier for hyper parameters..
Apache-2
- automl-gs (🥉16 · ⭐ 1.7K · 💀) - Provide an input CSV and a target field to predict, generate a..
MIT
- Xcessiv (🥉16 · ⭐ 1.3K · 💀) - A web-based application for quick, scalable, and automated..
Apache-2
- Auto Tune Models (🥉16 · ⭐ 510 · 💀) - Auto Tune Models - A multi-tenant, multi-data system for..
MIT
- Parfit (🥉16 · ⭐ 200 · 💀) - A package for parallelizing the fit and flexibly scoring of..
MIT
- Auptimizer (🥉15 · ⭐ 170) - An automatic ML model optimization tool.
❗️GPL-3.0
- Hypermax (🥉14 · ⭐ 96 · 💤) - Better, faster hyper-parameter optimization.
BSD-3
- ENAS (🥉13 · ⭐ 2.5K · 💀) - PyTorch implementation of Efficient Neural Architecture Search via..
Apache-2
- featurewiz (🥉13 · ⭐ 67) - Use advanced feature engineering strategies and select the best..
Apache-2
- Hypertunity (🥉11 · ⭐ 120 · 💀) - A toolset for black-box hyperparameter optimisation.
Apache-2
Libraries for building and evaluating reinforcement learning & agent-based systems.
OpenAI Gym (🥇36 · ⭐ 25K) - A toolkit for developing and comparing reinforcement learning.. MIT
TensorLayer (🥈26 · ⭐ 6.7K) - Deep Learning and Reinforcement Learning Library for.. Apache-2
Stable Baselines (🥈26 · ⭐ 3.2K) - A fork of OpenAI Baselines, implementations of reinforcement.. MIT
TensorForce (🥈26 · ⭐ 3K) - Tensorforce: a TensorFlow library for applied reinforcement.. Apache-2
FinRL (🥉22 · ⭐ 2.3K) - A Deep Reinforcement Learning Library for Automated Trading in Quantitative.. MIT
PARL (🥉21 · ⭐ 2.1K) - A high-performance distributed training framework for Reinforcement.. Apache-2
ReAgent (🥉17 · ⭐ 3K) - A platform for Reasoning systems (Reinforcement Learning, Contextual.. BSD-3
-
GitHub (👨💻 100 · 🔀 430 · 📋 120 - 35% open · ⏱️ 13.07.2021):
git clone https://github.com/facebookresearch/ReAgent
Show 5 hidden projects...
- baselines (🥈27 · ⭐ 12K · 💀) - OpenAI Baselines: high-quality implementations of reinforcement..
MIT
- keras-rl (🥈25 · ⭐ 5.1K · 💀) - Deep Reinforcement Learning for Keras.
MIT
- TRFL (🥉21 · ⭐ 3.1K · 💀) - TensorFlow Reinforcement Learning.
Apache-2
- DeepMind Lab (🥉16 · ⭐ 6.5K) - A customisable 3D platform for agent-based AI research.
❗️GPL-2.0
- Maze (🥉11 · ⭐ 140 · 🐣) - Maze Applied Reinforcement Learning Framework.
❗️Custom
Libraries for building and evaluating recommendation systems.
scikit-surprise (🥇27 · ⭐ 4.9K · 💤) - A Python scikit for building and analyzing recommender.. BSD-3
lightfm (🥈25 · ⭐ 3.7K) - A Python implementation of LightFM, a hybrid recommendation algorithm. Apache-2
TF Ranking (🥈23 · ⭐ 2.2K) - Learning to Rank in TensorFlow. Apache-2
Recommenders (🥉21 · ⭐ 11K) - Best Practices on Recommendation Systems. MIT
-
GitHub (👨💻 96 · 🔀 1.8K · 📥 39 · 📦 2 · 📋 600 - 24% open · ⏱️ 17.06.2021):
git clone https://github.com/microsoft/recommenders
TF Recommenders (🥉21 · ⭐ 920) - TensorFlow Recommenders is a library for building.. Apache-2
Case Recommender (🥉18 · ⭐ 350) - Case Recommender: A Flexible and Extensible Python.. MIT
Show 5 hidden projects...
- tensorrec (🥉21 · ⭐ 1.2K · 💀) - A TensorFlow recommendation algorithm and framework in..
Apache-2
- recmetrics (🥉18 · ⭐ 280 · 💤) - A library of metrics for evaluating recommender systems.
MIT
- Spotlight (🥉17 · ⭐ 2.5K · 💀) - Deep recommender models using PyTorch.
MIT
- lkpy (🥉17 · ⭐ 170) - Python recommendation toolkit.
MIT
- OpenRec (🥉16 · ⭐ 380 · 💀) - OpenRec is an open-source and modular library for neural network-..
Apache-2
Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.
TensorFlow Privacy (🥈23 · ⭐ 1.4K) - Library for training machine learning models with.. Apache-2
TFEncrypted (🥉21 · ⭐ 900 · 💤) - A Framework for Encrypted Machine Learning in TensorFlow. Apache-2
FATE (🥉20 · ⭐ 3.3K) - An Industrial Grade Federated Learning Framework. Apache-2
-
GitHub (👨💻 58 · 🔀 940 · 📋 910 - 33% open · ⏱️ 07.07.2021):
git clone https://github.com/FederatedAI/FATE
Libraries to organize, track, and visualize machine learning experiments.
Tensorboard (🥇36 · ⭐ 5.6K) - TensorFlow's Visualization Toolkit. Apache-2
DVC (🥇32 · ⭐ 8.3K) - Data Version Control | Git for Data & Models | ML Experiments Management. Apache-2
wandb client (🥇31 · ⭐ 3.1K) - A tool for visualizing and tracking your machine learning.. MIT
SageMaker SDK (🥇31 · ⭐ 1.4K) - A library for training and deploying machine learning.. Apache-2
tensorboardX (🥈30 · ⭐ 7K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
AzureML SDK (🥈30 · ⭐ 2.4K) - This is a publish-only repository. All pull requests are ignored... MIT
ClearML (🥈27 · ⭐ 2.6K) - ClearML - Auto-Magical Suite of tools to streamline your ML.. Apache-2
-
GitHub (👨💻 34 · 🔀 370 · 📥 370 · 📦 52 · 📋 340 - 31% open · ⏱️ 15.07.2021):
git clone https://github.com/allegroai/clearml
-
PyPi (📥 26K / month · ⏱️ 22.06.2021):
pip install clearml
-
Docker Hub (📥 30K · ⏱️ 05.10.2020):
docker pull allegroai/trains
ml-metadata (🥈25 · ⭐ 350) - For recording and retrieving metadata associated with ML.. Apache-2
livelossplot (🥉24 · ⭐ 1.1K) - Live training loss plot in Jupyter Notebook for Keras,.. MIT
TensorWatch (🥉22 · ⭐ 3.1K) - Debugging, monitoring and visualization for Python Machine Learning.. MIT
Labml (🥉21 · ⭐ 600) - Monitor deep learning model training and hardware usage from your mobile.. MIT
aim (🥉18 · ⭐ 1.4K) - Aim a super-easy way to record, search and compare 1000s of ML training.. Apache-2
Show 12 hidden projects...
- knockknock (🥉23 · ⭐ 2.2K · 💀) - Knock Knock: Get notified when your training ends with only two..
MIT
- lore (🥉22 · ⭐ 1.5K · 💀) - Lore makes machine learning approachable for Software Engineers and..
MIT
- TensorBoard Logger (🥉22 · ⭐ 610 · 💀) - Log TensorBoard events without touching TensorFlow.
MIT
- hiddenlayer (🥉21 · ⭐ 1.5K · 💀) - Neural network graphs and training metrics for..
MIT
- quinn (🥉20 · ⭐ 260) - pyspark methods to enhance developer productivity.
Apache-2
- MXBoard (🥉19 · ⭐ 330 · 💀) - Logging MXNet data for visualization in TensorBoard.
Apache-2
- gokart (🥉19 · ⭐ 200) - Gokart solves reproducibility, task dependencies, constraints of good code,..
MIT
- SKLL (🥉18 · ⭐ 520) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine..
❗️BSD-1-Clause
- datmo (🥉17 · ⭐ 330 · 💀) - Open source production model management tool for data scientists.
MIT
- steppy (🥉16 · ⭐ 130 · 💀) - Lightweight, Python library for fast and reproducible experimentation.
MIT
- ModelChimp (🥉15 · ⭐ 120) - Experiment tracking for machine and deep learning projects.
BSD-2
- traintool (🥉10 · ⭐ 9) - Train off-the-shelf machine learning models in one..
Apache-2
Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.
TorchServe (🥇27 · ⭐ 1.9K) - Model Serving on PyTorch. Apache-2
-
GitHub (👨💻 75 · 🔀 340 · 📥 420 · 📦 64 · 📋 640 - 17% open · ⏱️ 15.07.2021):
git clone https://github.com/pytorch/serve
-
PyPi (📥 9K / month · ⏱️ 20.05.2021):
pip install torchserve
-
Conda (📥 10K · ⏱️ 21.05.2021):
conda install -c pytorch torchserve
-
Docker Hub (📥 67K · ⭐ 3 · ⏱️ 20.05.2021):
docker pull pytorch/torchserve
Core ML Tools (🥈24 · ⭐ 2.3K) - Core ML tools contain supporting tools for Core ML model.. BSD-3
mmdnn (🥈22 · ⭐ 5.4K · 💤) - MMdnn is a set of tools to help users inter-operate among different deep.. MIT
Hummingbird (🥈22 · ⭐ 2.5K) - Hummingbird compiles trained ML models into tensor computation for.. MIT
m2cgen (🥈22 · ⭐ 1.8K) - Transform ML models into a native code (Java, C, Python, Go, JavaScript,.. MIT
pytorch2keras (🥉19 · ⭐ 720) - PyTorch to Keras model convertor. MIT
Show 4 hidden projects...
- huggingface_hub (🥉21 · ⭐ 160) - Client library to download and publish models and other..
Apache-2
- sklearn-porter (🥉17 · ⭐ 1.1K · 💀) - Transpile trained scikit-learn estimators to C, Java,..
MIT
- Larq Compute Engine (🥉17 · ⭐ 150) - Highly optimized inference engine for Binarized..
Apache-2
- backprop (🥉15 · ⭐ 190) - Backprop makes it simple to use, finetune, and deploy state-of-the-..
Apache-2
Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.
shap (🥇34 · ⭐ 13K) - A game theoretic approach to explain the output of any machine learning model. MIT
InterpretML (🥇27 · ⭐ 3.9K) - Fit interpretable models. Explain blackbox machine learning. MIT
Model Analysis (🥇27 · ⭐ 1.1K) - Model analysis tools for TensorFlow. Apache-2
Fairness 360 (🥈25 · ⭐ 1.4K) - A comprehensive set of fairness metrics for datasets and.. Apache-2
yellowbrick (🥈24 · ⭐ 3.3K · 📉) - Visual analysis and diagnostic tools to facilitate.. Apache-2
dtreeviz (🥈24 · ⭐ 1.6K) - A python library for decision tree visualization and model interpretation. MIT
Explainability 360 (🥈22 · ⭐ 880) - Interpretability and explainability of data and machine.. Apache-2
tf-explain (🥈22 · ⭐ 840) - Interpretability Methods for tf.keras models with Tensorflow 2.x. MIT
TreeInterpreter (🥉21 · ⭐ 670) - Package for interpreting scikit-learn's decision tree.. BSD-3
explainerdashboard (🥉21 · ⭐ 590) - Quickly build Explainable AI dashboards that show the inner.. MIT
random-forest-importances (🥉21 · ⭐ 460) - Code to compute permutation and drop-column.. MIT
sklearn-evaluation (🥉21 · ⭐ 310) - Machine learning model evaluation made easy: plots,.. MIT
What-If Tool (🥉19 · ⭐ 550) - Source code/webpage/demos for the What-If Tool. Apache-2
LIT (🥉18 · ⭐ 2.6K) - The Language Interpretability Tool: Interactively analyze NLP models for.. Apache-2
iNNvestigate (🥉18 · ⭐ 850) - A toolbox to iNNvestigate neural networks' predictions!. BSD-2
FlashTorch (🥉16 · ⭐ 600) - Visualization toolkit for neural networks in PyTorch! Demo --. MIT
Show 14 hidden projects...
- eli5 (🥇27 · ⭐ 2.4K · 💀) - A library for debugging/inspecting machine learning classifiers and..
MIT
- scikit-plot (🥈25 · ⭐ 2.1K · 💀) - An intuitive library to add plotting functionality to..
MIT
- keras-vis (🥈24 · ⭐ 2.8K · 💀) - Neural network visualization toolkit for keras.
MIT
- DALEX (🥉20 · ⭐ 860) - moDel Agnostic Language for Exploration and eXplanation.
❗️GPL-3.0
- Skater (🥉19 · ⭐ 990 · 💀) - Python Library for Model Interpretation/Explanations.
❗️UPL-1.0
- imodels (🥉19 · ⭐ 230) - Interpretable ML package for concise, transparent, and accurate predictive..
MIT
- responsible-ai-widgets (🥉19 · ⭐ 220) - This project provides responsible AI user interfaces..
MIT
- fairness-indicators (🥉19 · ⭐ 210) - Tensorflow's Fairness Evaluation and Visualization..
Apache-2
- model-card-toolkit (🥉18 · ⭐ 200) - a tool that leverages rich metadata and lineage..
Apache-2
- ExplainX.ai (🥉17 · ⭐ 210) - Explainable AI framework for data scientists. Explain & debug any..
MIT
- interpret-text (🥉15 · ⭐ 260) - A library that incorporates state-of-the-art explainers for..
MIT
- contextual-ai (🥉14 · ⭐ 69) - Contextual AI adds explainability to different stages of..
Apache-2
- Attribution Priors (🥉13 · ⭐ 83) - Tools for training explainable models using..
MIT
- bias-detector (🥉12 · ⭐ 34 · 🐣) - Bias Detector is a python package for detecting bias in machine..
MIT
Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search.
🔗 ANN Benchmarks ( ⭐ 2.3K) - Benchmarks of approximate nearest neighbor libraries in Python.
Annoy (🥇30 · ⭐ 8.7K) - Approximate Nearest Neighbors in C++/Python optimized for memory usage.. Apache-2
Faiss (🥇29 · ⭐ 14K) - A library for efficient similarity search and clustering of dense vectors. MIT
NMSLIB (🥇29 · ⭐ 2.5K) - Non-Metric Space Library (NMSLIB): An efficient similarity search.. Apache-2
hnswlib (🥈26 · ⭐ 1.6K) - Header-only C++/python library for fast approximate nearest neighbors. Apache-2
PyNNDescent (🥈26 · ⭐ 420) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
Milvus (🥉25 · ⭐ 6.9K) - An open source vector database powered by Faiss, NMSLIB and Annoy. Apache-2
-
GitHub (👨💻 150 · 🔀 940 · 📋 2.8K - 11% open · ⏱️ 15.07.2021):
git clone https://github.com/milvus-io/milvus
-
PyPi (📥 19K / month · 📦 6 · ⏱️ 12.07.2021):
pip install pymilvus
-
Docker Hub (📥 410K · ⭐ 13 · ⏱️ 13.07.2021):
docker pull milvusdb/milvus
N2 (🥉20 · ⭐ 480) - TOROS N2 - lightweight approximate Nearest Neighbor library which runs fast.. Apache-2
Show 2 hidden projects...
Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, gaussian processes, or statistics.
tensorflow-probability (🥇30 · ⭐ 3.4K) - Probabilistic reasoning and statistical analysis in.. Apache-2
pomegranate (🥈27 · ⭐ 2.7K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
Orbit (🥉20 · ⭐ 590) - A Python package for Bayesian forecasting with object-oriented design.. Apache-2
Baal (🥉18 · ⭐ 360) - Using approximate bayesian posteriors in deep nets for active learning. Apache-2
Show 8 hidden projects...
- patsy (🥇29 · ⭐ 760 · 💀) - Describing statistical models in Python using symbolic formulas.
BSD-2
- Edward (🥉25 · ⭐ 4.6K · 💀) - A probabilistic programming language in TensorFlow. Deep..
Apache-2
- pingouin (🥉25 · ⭐ 760) - Statistical package in Python based on Pandas.
❗️GPL-3.0
- PyStan (🥉23 · ⭐ 100) - PyStan, a Python interface to Stan, a platform for statistical modeling...
ISC
- scikit-posthocs (🥉21 · ⭐ 210) - Multiple Pairwise Comparisons (Post Hoc) Tests in Python.
MIT
- Funsor (🥉19 · ⭐ 180) - Functional tensors for probabilistic programming.
Apache-2
- ZhuSuan (🥉14 · ⭐ 2.1K · 💀) - A probabilistic programming library for Bayesian deep learning,..
MIT
- Lea (🥉10 · 💤) - Discrete probability distributions in Python.
❗️GPL-3.0
Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.
CleverHans (🥇27 · ⭐ 5.2K) - An adversarial example library for constructing attacks,.. MIT
Foolbox (🥇27 · ⭐ 2K) - A Python toolbox to create adversarial examples that fool neural networks.. MIT
TextAttack (🥈25 · ⭐ 1.5K) - TextAttack is a Python framework for adversarial attacks, data.. MIT
ART (🥈23 · ⭐ 2.4K) - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning.. MIT
robustness (🥉19 · ⭐ 560) - A library for experimenting with, training and evaluating neural.. MIT
AdvBox (🥉18 · ⭐ 1.1K) - Advbox is a toolbox to generate adversarial examples that fool neural.. Apache-2
Show 3 hidden projects...
Libraries that require and make use of CUDA/GPU system capabilities to optimize data handling and machine learning tasks.
CuPy (🥇32 · ⭐ 5.2K) - NumPy & SciPy for GPU. MIT
-
GitHub (👨💻 270 · 🔀 480 · 📥 17K · 📦 770 · 📋 1.4K - 24% open · ⏱️ 15.07.2021):
git clone https://github.com/cupy/cupy
-
PyPi (📥 80K / month · 📦 190 · ⏱️ 24.06.2021):
pip install cupy
-
Conda (📥 750K · ⏱️ 26.06.2021):
conda install -c conda-forge cupy
-
Docker Hub (📥 52K · ⭐ 6 · ⏱️ 15.07.2021):
docker pull cupy/cupy
scikit-cuda (🥈23 · ⭐ 840) - Python interface to GPU-powered libraries. BSD-3
DALI (🥉17 · ⭐ 3.4K) - A GPU-accelerated library containing highly optimized building blocks.. Apache-2
-
GitHub (👨💻 60 · 🔀 410 · 📋 1K - 16% open · ⏱️ 13.07.2021):
git clone https://github.com/NVIDIA/DALI
BlazingSQL (🥉17 · ⭐ 1.5K) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for.. Apache-2
Vulkan Kompute (🥉17 · ⭐ 430) - General purpose GPU compute framework for cross vendor.. Apache-2
cuSignal (🥉14 · ⭐ 500) - GPU accelerated signal processing. Apache-2
-
GitHub (👨💻 31 · 🔀 66 · 📋 120 - 12% open · ⏱️ 14.07.2021):
git clone https://github.com/rapidsai/cusignal
Show 5 hidden projects...
- GPUtil (🥈23 · ⭐ 740 · 💀) - A Python module for getting the GPU status from NVIDA GPUs using..
MIT
- py3nvml (🥈23 · ⭐ 180) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside your..
BSD-3
- nvidia-ml-py3 (🥉20 · ⭐ 68 · 💀) - Python 3 Bindings for the NVIDIA Management Library.
BSD-3
- SpeedTorch (🥉16 · ⭐ 620 · 💀) - Library for faster pinned CPU - GPU transfer in Pytorch.
MIT
- ipyexperiments (🥉16 · ⭐ 140) - jupyter/ipython experiment containers for GPU and..
Apache-2
Libraries that extend TensorFlow with additional capabilities.
tensor2tensor (🥇33 · ⭐ 11K · 📈) - Library of deep learning models and datasets designed.. Apache-2
TensorFlow Datasets (🥇33 · ⭐ 2.9K) - TFDS is a collection of datasets ready to use with.. Apache-2
tensorflow-hub (🥇33 · ⭐ 2.9K · 📈) - A library for transfer learning by reusing parts of.. Apache-2
Keras-Preprocessing (🥈29 · ⭐ 960) - Utilities for working with image data, text data, and.. MIT
TensorFlow Transform (🥈29 · ⭐ 880) - Input pipeline framework. Apache-2
TF Model Optimization (🥉28 · ⭐ 1.1K) - A toolkit to optimize ML models for deployment for.. Apache-2
efficientnet (🥉24 · ⭐ 1.9K · 💤) - Implementation of EfficientNet model. Keras and.. Apache-2
TensorFlow I/O (🥉24 · ⭐ 470) - Dataset, streaming, and file system extensions.. Apache-2
Neural Structured Learning (🥉23 · ⭐ 840) - Training neural models with structured signals. Apache-2
TensorNets (🥉20 · ⭐ 990) - High level network definitions with pre-trained weights in.. MIT
TF Compression (🥉18 · ⭐ 510) - Data compression in TensorFlow. Apache-2
Show 2 hidden projects...
- TensorFlow Cloud (🥉26 · ⭐ 300) - The TensorFlow Cloud repository provides APIs that..
Apache-2
- tffm (🥉19 · ⭐ 760 · 💀) - TensorFlow implementation of an arbitrary order Factorization Machine.
MIT
Libraries that extend scikit-learn with additional capabilities.
imbalanced-learn (🥇32 · ⭐ 5.3K) - A Python Package to Tackle the Curse of Imbalanced.. MIT
category_encoders (🥈26 · ⭐ 1.7K · 💤) - A library of sklearn compatible categorical variable.. BSD-3
fancyimpute (🥈26 · ⭐ 980) - Multivariate imputation and matrix completion algorithms.. Apache-2
sklearn-contrib-lightning (🥈24 · ⭐ 1.5K) - Large-scale linear classification, regression and.. BSD-3
-
GitHub (👨💻 17 · 🔀 190 · 📥 71 · 📦 85 · 📋 90 - 55% open · ⏱️ 15.06.2021):
git clone https://github.com/scikit-learn-contrib/lightning
-
PyPi (📥 1.9K / month · 📦 10 · ⏱️ 15.06.2021):
pip install sklearn-contrib-lightning
-
Conda (📥 150K · ⏱️ 20.12.2020):
conda install -c conda-forge sklearn-contrib-lightning
scikit-opt (🥈23 · ⭐ 2.4K) - Genetic Algorithm, Particle Swarm Optimization, Simulated.. MIT
scikit-lego (🥉20 · ⭐ 550) - Extra blocks for scikit-learn pipelines. MIT
iterative-stratification (🥉19 · ⭐ 580 · 💤) - scikit-learn cross validators for iterative.. BSD-3
Show 7 hidden projects...
- sklearn-crfsuite (🥈24 · ⭐ 370 · 💀) - scikit-learn inspired API for CRFsuite.
MIT
- scikit-multilearn (🥈23 · ⭐ 670 · 💀) - A scikit-learn based module for multi-label et. al...
BSD-2
- skope-rules (🥉21 · ⭐ 400 · 💤) - machine learning with logical rules in Python.
❗️BSD-1-Clause
- scikit-tda (🥉17 · ⭐ 290) - Topological Data Analysis for Python.
MIT
- skggm (🥉16 · ⭐ 180 · 💤) - Scikit-learn compatible estimation of general graphical models.
MIT
- celer (🥉16 · ⭐ 120) - Fast solver for L1-type problems: Lasso, sparse Logisitic regression,..
BSD-3
- dabl (🥉16 · ⭐ 88) - Data Analysis Baseline Library.
BSD-3
Libraries that extend Pytorch with additional capabilities.
EfficientNet-PyTorch (🥇27 · ⭐ 6.2K) - A PyTorch implementation of EfficientNet and.. Apache-2
pytorch-summary (🥇26 · ⭐ 3.2K) - Model summary in PyTorch similar to `model.summary()` in.. MIT
pytorch-optimizer (🥇26 · ⭐ 2K) - torch-optimizer -- collection of optimizers for.. Apache-2
PML (🥈25 · ⭐ 3.4K) - The easiest way to use deep metric learning in your application. Modular,.. MIT
torchdiffeq (🥈24 · ⭐ 3.6K) - Differentiable ODE solvers with full GPU support and.. MIT
SRU (🥈24 · ⭐ 2K) - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755). MIT
EfficientNets (🥈23 · ⭐ 1.4K · 📈) - Pretrained EfficientNet, EfficientNet-Lite, MixNet,.. Apache-2
accelerate (🥈22 · ⭐ 1.6K) - A simple way to train and use PyTorch models with multi-.. Apache-2
lightning-flash (🥈22 · ⭐ 820 · 🐣) - Collection of tasks for fast prototyping, baselining,.. Apache-2
torch-scatter (🥈22 · ⭐ 700) - PyTorch Extension Library of Optimized Scatter Operations. MIT
PyTorch Sparse (🥈22 · ⭐ 430) - PyTorch Extension Library of Optimized Autograd Sparse.. MIT
reformer-pytorch (🥉21 · ⭐ 1.5K) - Reformer, the efficient Transformer, in Pytorch. MIT
Pytorch Toolbelt (🥉20 · ⭐ 1K) - PyTorch extensions for fast R&D prototyping and Kaggle.. MIT
Performer Pytorch (🥉18 · ⭐ 650) - An implementation of Performer, a linear attention-based.. MIT
Lambda Networks (🥉17 · ⭐ 1.5K · 💤) - Implementation of LambdaNetworks, a new approach to.. MIT
Tensor Sensor (🥉17 · ⭐ 570) - The goal of this library is to generate more helpful.. MIT
tinygrad (🥉16 · ⭐ 4.9K) - You like pytorch? You like micrograd? You love tinygrad!. MIT
-
GitHub (👨💻 49 · 🔀 540 · 📦 1 · 📋 85 - 24% open · ⏱️ 29.06.2021):
git clone https://github.com/geohot/tinygrad
Torch-Struct (🥉15 · ⭐ 960) - Fast, general, and tested differentiable structured prediction.. MIT
-
GitHub (👨💻 13 · 🔀 72 · 📋 43 - 41% open · ⏱️ 09.05.2021):
git clone https://github.com/harvardnlp/pytorch-struct
torchsde (🥉15 · ⭐ 770) - Differentiable SDE solvers with GPU support and efficient.. Apache-2
-
GitHub (👨💻 5 · 🔀 76 · 📋 38 - 15% open · ⏱️ 07.07.2021):
git clone https://github.com/google-research/torchsde
Show 6 hidden projects...
- pretrainedmodels (🥇29 · ⭐ 8.1K · 💀) - Pretrained ConvNets for pytorch: NASNet, ResNeXt,..
BSD-3
- AdaBound (🥉20 · ⭐ 2.9K · 💀) - An optimizer that trains as fast as Adam and as good as SGD.
Apache-2
- Poutyne (🥉20 · ⭐ 480) - A simplified framework and utilities for PyTorch.
❗️LGPL-3.0
- Antialiased CNNs (🥉16 · ⭐ 1.4K · 💤) - pip install antialiased-cnns to improve stability and..
❗️CC BY-NC-SA 4.0
- TorchDrift (🥉15 · ⭐ 160 · 🐣) - Drift Detection for your PyTorch Models.
Apache-2
- micrograd (🥉14 · ⭐ 1.8K · 💀) - A tiny scalar-valued autograd engine and a neural net library..
MIT
Libraries for connecting to, operating, and querying databases.
🔗 best-of-python - DB Clients ( ⭐ 1.7K · 🐣) - Collection of database clients for python.
scipy (🥇42 · ⭐ 8.4K) - Ecosystem of open-source software for mathematics, science, and engineering. BSD-3
Datasette (🥇28 · ⭐ 5.2K · 📉) - An open source multi-tool for exploring and publishing data. Apache-2
agate (🥈27 · ⭐ 1K) - A Python data analysis library that is optimized for humans instead of machines. MIT
TabPy (🥈25 · ⭐ 1.1K) - Execute Python code on the fly and display results in Tableau visualizations:. MIT
pyclustering (🥈25 · ⭐ 850) - pyclustring is a Python, C++ data mining library. BSD-3
pyjanitor (🥈25 · ⭐ 700) - Clean APIs for data cleaning. Python implementation of R package Janitor. MIT
PennyLane (🥈24 · ⭐ 930) - PennyLane is a cross-platform Python library for differentiable.. Apache-2
metric-learn (🥉23 · ⭐ 1.1K) - Metric learning algorithms in Python. MIT
alibi-detect (🥉22 · ⭐ 760) - Algorithms for outlier, adversarial and drift detection. Apache-2
scikit-rebate (🥉22 · ⭐ 320) - A scikit-learn-compatible Python implementation of ReBATE, a.. MIT
Feature Engine (🥉21 · ⭐ 580) - Feature engineering package with sklearn like functionality. BSD-3
StreamAlert (🥉19 · ⭐ 2.6K) - StreamAlert is a serverless, realtime data analysis framework.. Apache-2
-
GitHub (👨💻 31 · 🔀 300 · 📋 340 - 26% open · ⏱️ 10.02.2021):
git clone https://github.com/airbnb/streamalert
River (🥉19 · ⭐ 1.7K) - Online machine learning in Python. BSD-3
-
GitHub (👨💻 65 · 🔀 220 · 📦 28 · 📋 310 - 8% open · ⏱️ 29.06.2021):
git clone https://github.com/online-ml/river
opyrator (🥉18 · ⭐ 2.3K · 🐣) - Turns your machine learning code into microservices with web API,.. MIT
baikal (🥉18 · ⭐ 580) - A graph-based functional API for building complex scikit-learn pipelines. BSD-3
avalanche (🥉17 · ⭐ 540) - Avalanche: an End-to-End Library for Continual Learning. MIT
-
GitHub (👨💻 33 · 🔀 73 · 📋 370 - 16% open · ⏱️ 07.07.2021):
git clone https://github.com/ContinualAI/avalanche
apricot (🥉17 · ⭐ 320) - apricot implements submodular optimization for the purpose of selecting.. MIT
traingenerator (🥉11 · ⭐ 1K) - A web app to generate template code for machine learning. MIT
-
GitHub (👨💻 3 · 🔀 140 · 📋 12 - 75% open · ⏱️ 29.04.2021):
git clone https://github.com/jrieke/traingenerator
Show 14 hidden projects...
- Cython BLIS (🥈26 · ⭐ 170) - Fast matrix-multiplication as a self-contained Python library no..
BSD-3
- pysc2 (🥈25 · ⭐ 7.3K · 💀) - StarCraft II Learning Environment.
Apache-2
- datalad (🥈24 · ⭐ 250) - Keep code, data, containers under control with git and git-annex.
MIT
- minisom (🥉23 · ⭐ 870) - MiniSom is a minimalistic implementation of the Self Organizing..
❗️CC-BY-3.0
- cleanlab (🥉21 · ⭐ 2K) - The standard package for machine learning with noisy labels and..
❗️AGPL-3.0
- mlens (🥉21 · ⭐ 690 · 💀) - ML-Ensemble high performance ensemble learning.
MIT
- impyute (🥉20 · ⭐ 280 · 💀) - Data imputations library to preprocess datasets with missing data.
MIT
- SUOD (🥉20 · ⭐ 280) - (MLSys' 21) An Acceleration System for Large-scare Unsupervised..
BSD-2
- vecstack (🥉19 · ⭐ 630 · 💀) - Python package for stacking (machine learning technique).
MIT
- rrcf (🥉19 · ⭐ 320 · 💀) - Implementation of the Robust Random Cut Forest algorithm for anomaly..
MIT
- pandas-ml (🥉19 · ⭐ 270 · 💀) - pandas, scikit-learn, xgboost and seaborn integration.
BSD-3
- dstack (🥉16 · ⭐ 190) - An open-source tool to rapidly develop data applications with Python.
Apache-2
- pykale (🥉14 · ⭐ 250) - Knowledge-Aware machine LEarning (KALE) from multiple sources in Python.
MIT
- nylon (🥉13 · ⭐ 62 · 🐣) - An intelligent, flexible grammar of machine learning.
MIT
- Papers With Code: Discover ML papers, code, and evaluation tables.
- Sotabench: Discover & compare open-source ML models.
- Google Dataset Search: Dataset search engine by Google.
- Dataset List: List of the biggest ML datasets from across the web.
- Awesome Public Datasets: A topic-centric list of open datasets.
- Best-of lists: Discover other best-of lists with awesome open-source projects on all kinds of topics.
- best-of-python-dev: A ranked list of awesome python developer tools and libraries.
- best-of-web-python: A ranked list of awesome python libraries for web development.
Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:
- Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
- Modify the projects.yaml with your additions or changes, and submit a pull request. This can also be done directly via the Github UI.
If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide.
For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.