/ff3

Primary LanguagePython

ff3

ff3 is a library for creating statistical reports and visualizations based on sample datasets. It can help you to detect domain drifts on your data and create low-dimensional projections.

Installation

pip install ff3

Usage

DriftReport

In this section you'll find examples on how to use ff3 library to create drift reports.

ff3.DriftReport creates a statistical report based on a comparison between given datasets distributions. It compares them by a set of statistical tests and finds deviations.

import pandas as pd
from ff3 import DataDrift

df1 = pd.read_csv("yesterday-data.csv")
df2 = pd.read_csv("today-data.csv")

drift = DriftReport(d1, d2)
drift.eval()
drift.detailed_report()  # create an interactive report

img

In case you want to find out relationship between two distinct features, you can create a bivariate report.

drift.bivariate_report("passenger_count", "tip_amount")

img

You can de/serialize DriftReport instance if required.

serialized = drift.to_dict()
drift2 = DriftReport.from_dict(serialized)
drift2.detailed_report()