data-as-code/dac

Support `load.py` and `schema.py` as templates with values injected during `dac pack`

Opened this issue · 0 comments

As a dac package producer
I would like to have the possibility to insert parameters in the load.py and schema.py files that will be filled when running dac pack
so that I can re-use the same templates

Example

import dask.dataframe as dd


def load() -> dd.DataFrame:
    return dd.read_parquet(path="az://STORAGE_CONTAINER_NAME/DATA_FOLDER/*.parquet", 
                           storage_options={"account_name": "STORAGE_ACCOUNT_NAME", "anon": False})

Here, STORAGE_ACCOUNT_NAME, STORAGE_CONTAINER_NAME, and DATA_FOLDER could be parametrized to allow re-usability