This repository contains code examples demonstrating how Bodo accelerates and scales
Pandas and Scikit-learn workloads automatically. All examples can be run on a local laptop
using Bodo Community Edition: pip install bodo
.
In addition, there are examples for using Bodo with data infrastructure tools such as Docker, Terraform, Kubernetes and Streamlit. Feedback is appreciated.
First make sure you have Bodo installed.
scikit-learn
is also required for the ML examples.
Many of the example scripts can take in optional arguments.
python <path>/<script.py> --help
shows the usage.
By default all examples scripts can be run from the top directory (Bodo-examples) without any changes. Otherwise, make sure to change path of data files. For more information on the examples, please see the docstring at the top of each python script.
-
TPCH Queries, more information on TPC-H can be found here
An example performing beer reviews example:
# run example on 8 cores
mpiexec -n 8 python examples/beer-reviews/beer-reviews.py
More documentation can be found at http://docs.bodo.ai.
Bodo tutorial can be found here.