This repo contains code, documentation, and examples related to data enclave support for the RAISE project.
Research users can find details on how to use RAISE enclaves here.
When developing code for this repo, developers may want to install the project in editable mode:
$ pip install -e .
The code can be linted and tested locally as well:
$ pip install .[test]
$ flake8
$ pytest
Code coverage reports can be generated when running tests:
$ pytest --cov=enclave_mgmt --cov-report=term --cov-report=html
The compile models script is designed to facilitate the process of data collection and validation for researchers.
- compile_models.py
- This script serves as the entry point. It orchestrates the execution of various components.
- collect_data.py
- The
collect_data.py
script leverages theboto3
library to interact with S3 and uses thepandas
library to create dataframes for subsequent processing.
- create_models.py
- The
create_models.py
script takes the data collected bycollect_data.py
and transforms it to be used in the Pydantic models. This process also validates the data.
- models.py
- The
models.py
script acts as a centralized location for storing the Pydantic models.