/data-engineer-scripts

A curated collection of streamlined and effective scripts and tools designed specifically for data engineering tasks.

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

data-engineer-scripts

Simple scripts and tools for data engineer

Script Description requirements
aws-glueetl-costs-analysis.py Crawl the cost data of AWS Glue ETL jobs, create scatter plots, and help find jobs with abnormal costs. boto3==1.34.19
pandas==2.1.4
plotly==5.18.0
databricks-sql-warehouse-unload.py Unload data from databricks sql warehouse, save to local file or AWS S3, results format support parquet/csv. databricks-sdk==0.17.0
loguru==0.7.2
pyarrow==14.0.2
requests==2.31.0
redo==2.0.4