/pyramid

Pyramid brings R's beloved auto.arima to Python, making an even stronger case for why you don't need R for data science. It does so not by calling R under the hood, but by wrapping statsmodels' well-tested ARIMA and SARIMAX estimators in a single, easy-to-use scikit-learn-esque estimator.

Primary LanguagePythonMIT LicenseMIT

PyPI version Linux build status Windows build status Coverage Status Supported versions Supported versions

pyramid

Pyramid is a no-nonsense statistical Python library with a solitary objective: bring R's auto.arima functionality to Python. Pyramid operates by wrapping statsmodels.tsa.ARIMA and statsmodels.tsa.statespace.SARIMAX into one estimator class and creating a more user-friendly estimator interface for programmers familiar with scikit-learn.

Installation

Pyramid is on pypi under the package name pyramid-arima and can be downloaded via pip:

$ pip install pyramid-arima

To ensure the package was built correctly, import the following module in python:

from pyramid.arima import auto_arima

Quickstart

For an easy, reproducible quick-start example, see examples/.

Other considerations

  • How do I make pyramid run as quickly as R?

    • R's code is heavily C-based. Pyramid runs on statsmodels, which is Python based. There will be some differences in performance speed-wise, but much of it can be eliminated by using stepwise=True. See this discussion for more thoughts...
  • Refreshing ARIMA models

    • Periodically, your ARIMA will need to be refreshed given new observations. See this discussion and this discussion on either re-using auto_arima-estimated order terms or re-fitting altogether.