/pymc3

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

Primary LanguagePythonOtherNOASSERTION

PyMC3 logo

Build Status Coverage NumFOCUS_badge Binder Dockerhub

PyMC3 is a Python package for Bayesian statistical modeling and Probabilistic Machine Learning focusing on advanced Markov chain Monte Carlo (MCMC) and variational inference (VI) algorithms. Its flexibility and extensibility make it applicable to a large suite of problems.

Check out the getting started guide, or interact with live examples using Binder! For questions on PyMC3, head on over to our PyMC Discourse forum.

The future of PyMC3 & Theano

There have been many questions and uncertainty around the future of PyMC3 since Theano stopped getting developed by the original authors, and we started experiments with PyMC4.

We are happy to announce that PyMC3 on Theano (which we are developing further) with a new JAX backend is the future. PyMC4 will not be developed further.

See the full announcement for more details.

Features

  • Intuitive model specification syntax, for example, x ~ N(0,1) translates to x = Normal('x',0,1)
  • Powerful sampling algorithms, such as the No U-Turn Sampler, allow complex models with thousands of parameters with little specialized knowledge of fitting algorithms.
  • Variational inference: ADVI for fast approximate posterior estimation as well as mini-batch ADVI for large data sets.
  • Relies on Theano-PyMC which provides:
    • Computation optimization and dynamic C or JAX compilation
    • Numpy broadcasting and advanced indexing
    • Linear algebra operators
    • Simple extensibility
  • Transparent support for missing value imputation

Getting started

If you already know about Bayesian statistics:

Learn Bayesian statistics with a book together with PyMC3:

PyMC3 talks

There are also several talks on PyMC3 which are gathered in this YouTube playlist and as part of PyMCon 2020

Installation

The latest release of PyMC3 can be installed from PyPI using pip:

pip install pymc3

Note: Running pip install pymc will install PyMC 2.3, not PyMC3, from PyPI.

Or via conda-forge:

conda install -c conda-forge pymc3

Plotting is done using ArviZ - if you follow the installation instructions above, then it will be installed alongside PyMC3.

The current development branch of PyMC3 can be installed from GitHub, also using pip:

pip install git+https://github.com/pymc-devs/pymc3

To ensure the development branch of Theano is installed alongside PyMC3 (recommended), you can install PyMC3 using the requirements.txt file. This requires cloning the repository to your computer:

git clone https://github.com/pymc-devs/pymc3
cd pymc3
pip install -r requirements.txt

However, if a recent version of Theano has already been installed on your system, you can install PyMC3 directly from GitHub.

Another option is to clone the repository and install PyMC3 using python setup.py install or python setup.py develop.

Dependencies

PyMC3 is tested on Python 3.6, 3.7, and 3.8 and depends on Theano-PyMC, NumPy, SciPy, and pandas (see requirements.txt for version information).

Optional

In addtion to the above dependencies, the GLM submodule relies on Patsy.

Citing PyMC3

Salvatier J., Wiecki T.V., Fonnesbeck C. (2016) Probabilistic programming in Python using PyMC3. PeerJ Computer Science 2:e55 DOI: 10.7717/peerj-cs.55.

Contact

We are using discourse.pymc.io as our main communication channel. You can also follow us on Twitter @pymc_devs for updates and other announcements.

To ask a question regarding modeling or usage of PyMC3 we encourage posting to our Discourse forum under the “Questions” Category. You can also suggest feature in the “Development” Category.

To report an issue with PyMC3 please use the issue tracker.

Finally, if you need to get in touch for non-technical information about the project, send us an e-mail.

License

Apache License, Version 2.0

Software using PyMC3

  • Exoplanet: a toolkit for modeling of transit and/or radial velocity observations of exoplanets and other astronomical time series.
  • Bambi: BAyesian Model-Building Interface (BAMBI) in Python.
  • pymc3_models: Custom PyMC3 models built on top of the scikit-learn API.
  • PMProphet: PyMC3 port of Facebook's Prophet model for timeseries modeling
  • webmc3: A web interface for exploring PyMC3 traces
  • sampled: Decorator for PyMC3 models.
  • NiPyMC: Bayesian mixed-effects modeling of fMRI data in Python.
  • beat: Bayesian Earthquake Analysis Tool.
  • pymc-learn: Custom PyMC models built on top of pymc3_models/scikit-learn API
  • fenics-pymc3: Differentiable interface to FEniCS, a library for solving partial differential equations.
  • cell2location: Comprehensive mapping of tissue cell architecture via integrated single cell and spatial transcriptomics.

Please contact us if your software is not listed here.

Papers citing PyMC3

See Google Scholar for a continuously updated list.

Contributors

See the GitHub contributor page. Also read our Code of Conduct guidelines for a better contributing experience.

Support

PyMC3 is a non-profit project under NumFOCUS umbrella. If you want to support PyMC3 financially, you can donate here.

PyMC for enterprise

PyMC is now available as part of the Tidelift Subscription!

Tidelift is working with PyMC and the maintainers of thousands of other open source projects to deliver commercial support and maintenance for the open source dependencies you use to build your applications. Save time, reduce risk, and improve code health, while contributing financially to PyMC -- making it even more robust, reliable and, let's face it, amazing!

tidelift_learn tidelift_demo

Sponsors

NumFOCUS

Quantopian

ODSC