/sktime

A unified toolbox for machine learning with time series

Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

Welcome to sktime

A unified interface for machine learning with time series

๐Ÿš€ Version 0.30.1 out now! Check out the release notes here.

sktime is a library for time series analysis in Python. It provides a unified interface for multiple time series learning tasks. Currently, this includes time series classification, regression, clustering, annotation, and forecasting. It comes with time series algorithms and scikit-learn compatible tools to build, tune and validate time series models.

Overview
Open Source BSD 3-clause
Tutorials Binder !youtube
Community !discord !slack
CI/CD github-actions !codecov readthedocs platform
Code !pypi !conda !python-versions !black
Downloads PyPI - Downloads PyPI - Downloads Downloads
Citation !zenodo

๐Ÿ“š Documentation

Documentation
โญ Tutorials New to sktime? Here's everything you need to know!
๐Ÿ“‹ Binder Notebooks Example notebooks to play with in your browser.
๐Ÿ‘ฉโ€๐Ÿ’ป User Guides How to use sktime and its features.
โœ‚๏ธ Extension Templates How to build your own estimator using sktime's API.
๐ŸŽ›๏ธ API Reference The detailed reference for sktime's API.
๐Ÿ“บ Video Tutorial Our video tutorial from 2021 PyData Global.
๐Ÿ› ๏ธ Changelog Changes and version history.
๐ŸŒณ Roadmap sktime's software and community development plan.
๐Ÿ“ Related Software A list of related software.

๐Ÿ’ฌ Where to ask questions

Questions and feedback are extremely welcome! We strongly believe in the value of sharing help publicly, as it allows a wider audience to benefit from it.

Type Platforms
๐Ÿ› Bug Reports GitHub Issue Tracker
โœจ Feature Requests & Ideas GitHub Issue Tracker
๐Ÿ‘ฉโ€๐Ÿ’ป Usage Questions GitHub Discussions ยท Stack Overflow
๐Ÿ’ฌ General Discussion GitHub Discussions
๐Ÿญ Contribution & Development dev-chat channel ยท Discord
๐ŸŒ Meet-ups and collaboration sessions Discord - Fridays 13 UTC, dev/meet-ups channel

๐Ÿ’ซ Features

Our objective is to enhance the interoperability and usability of the time series analysis ecosystem in its entirety. sktime provides a unified interface for distinct but related time series learning tasks. It features dedicated time series algorithms and tools for composite model building such as pipelining, ensembling, tuning, and reduction, empowering users to apply an algorithm designed for one task to another.

sktime also provides interfaces to related libraries, for example scikit-learn, statsmodels, tsfresh, PyOD, and fbprophet, among others.

Module Status Links
Forecasting stable Tutorial ยท API Reference ยท Extension Template
Time Series Classification stable Tutorial ยท API Reference ยท Extension Template
Time Series Regression stable API Reference
Transformations stable Tutorial ยท API Reference ยท Extension Template
Parameter fitting maturing API Reference ยท Extension Template
Time Series Clustering maturing API Reference ยท Extension Template
Time Series Distances/Kernels maturing Tutorial ยท API Reference ยท Extension Template
Time Series Alignment experimental API Reference ยท Extension Template
Annotation experimental Extension Template
Time Series Splitters maturing Extension Template
Distributions and simulation experimental

โณ Install sktime

For troubleshooting and detailed installation instructions, see the documentation.

  • Operating system: macOS X ยท Linux ยท Windows 8.1 or higher
  • Python version: Python 3.8, 3.9, 3.10, 3.11, and 3.12 (only 64-bit)
  • Package managers: pip ยท conda (via conda-forge)

pip

Using pip, sktime releases are available as source packages and binary wheels. Available wheels are listed here.

pip install sktime

or, with maximum dependencies,

pip install sktime[all_extras]

For curated sets of soft dependencies for specific learning tasks:

pip install sktime[forecasting]  # for selected forecasting dependencies
pip install sktime[forecasting,transformations]  # forecasters and transformers

or similar. Valid sets are:

  • forecasting
  • transformations
  • classification
  • regression
  • clustering
  • param_est
  • networks
  • annotation
  • alignment

Cave: in general, not all soft dependencies for a learning task are installed, only a curated selection.

conda

You can also install sktime from conda via the conda-forge channel. The feedstock including the build recipe and configuration is maintained in this conda-forge repository.

conda install -c conda-forge sktime

or, with maximum dependencies,

conda install -c conda-forge sktime-all-extras

(as conda does not support dependency sets, flexible choice of soft dependencies is unavailable via conda)

โšก Quickstart

Forecasting

from sktime.datasets import load_airline
from sktime.forecasting.base import ForecastingHorizon
from sktime.forecasting.theta import ThetaForecaster
from sktime.split import temporal_train_test_split
from sktime.performance_metrics.forecasting import mean_absolute_percentage_error

y = load_airline()
y_train, y_test = temporal_train_test_split(y)
fh = ForecastingHorizon(y_test.index, is_relative=False)
forecaster = ThetaForecaster(sp=12)  # monthly seasonal periodicity
forecaster.fit(y_train)
y_pred = forecaster.predict(fh)
mean_absolute_percentage_error(y_test, y_pred)
>>> 0.08661467738190656

Time Series Classification

from sktime.classification.interval_based import TimeSeriesForestClassifier
from sktime.datasets import load_arrow_head
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score

X, y = load_arrow_head()
X_train, X_test, y_train, y_test = train_test_split(X, y)
classifier = TimeSeriesForestClassifier()
classifier.fit(X_train, y_train)
y_pred = classifier.predict(X_test)
accuracy_score(y_test, y_pred)
>>> 0.8679245283018868

๐Ÿ‘‹ How to get involved

There are many ways to join the sktime community. We follow the all-contributors specification: all kinds of contributions are welcome - not just code.

Documentation
๐Ÿ’ Contribute How to contribute to sktime.
๐ŸŽ’ Mentoring New to open source? Apply to our mentoring program!
๐Ÿ“… Meetings Join our discussions, tutorials, workshops, and sprints!
๐Ÿ‘ฉโ€๐Ÿ”ง Developer Guides How to further develop sktime's code base.
๐Ÿšง Enhancement Proposals Design a new feature for sktime.
๐Ÿ… Contributors A list of all contributors.
๐Ÿ™‹ Roles An overview of our core community roles.
๐Ÿ’ธ Donate Fund sktime maintenance and development.
๐Ÿ›๏ธ Governance How and by whom decisions are made in sktime's community.

๐Ÿ† Hall of fame

Thanks to all our community for all your wonderful contributions, PRs, issues, ideas.


๐Ÿ’ก Project vision

  • By the community, for the community -- developed by a friendly and collaborative community.
  • The right tool for the right task -- helping users to diagnose their learning problem and suitable scientific model types.
  • Embedded in state-of-art ecosystems and provider of interoperable interfaces -- interoperable with scikit-learn, statsmodels, tsfresh, and other community favorites.
  • Rich model composition and reduction functionality -- build tuning and feature extraction pipelines, solve forecasting tasks with scikit-learn regressors.
  • Clean, descriptive specification syntax -- based on modern object-oriented design principles for data science.
  • Fair model assessment and benchmarking -- build your models, inspect your models, check your models, and avoid pitfalls.
  • Easily extensible -- easy extension templates to add your own algorithms compatible with sktime's API.