crate/cratedb-toolkit

Share and use datasets via Python code

amotl opened this issue · 2 comments

amotl commented

About

Easily consume datasets from tutorials and/or production applications like others are doing it, using Python code.

References

Standards

About

Those patches add a corresponding miniature subsystem, and bring it into application. With them, cratedb-toolkit will provide convenient access to cratedb-datasets.

Synopsis

from cratedb_toolkit.datasets import load_dataset

dataset = load_dataset("tutorial/weather-basic")
dataset.dbtable(dburi="crate://crate@localhost/", table="weather_data").load()

About

Provide access to datasets at Kaggle, to be easily consumed by tutorials and/or production applications.

Synopsis

from cratedb_toolkit.datasets import load_dataset

dataset = load_dataset("kaggle://guillemservera/global-daily-climate-data/daily_weather.parquet")

# Only download once, nothing else.
dataset.acquire()

# Create table schema in database.
dataset.dbtable(dburi="crate://crate@localhost/", table="kaggle_daily_weather").create()