Galaxy Environment Analysis Data Preparation
This repo is used to prepare gea data sets to be uploaded to GCP
The raw data is located under ./data/raw
The processed data is located under ./data/processed
Instructions to run
- Create
.env
file from.env.template
and fill in fields - Make sure the python dependencies are installed
pip install -r requirements.txt
- Run
python process_data.py
to generate the processed data - Run
python upload_data.py
to upload data to bucket