GoogleCloudPlatform/public-datasets-pipelines
Cloud-native, data onboarding architecture for Google Cloud Datasets
PythonApache-2.0
Issues
- 0
Dependency Dashboard
#51 opened by forking-renovate - 2
!
#740 opened by jackoff482940 - 0
- 0
- 0
Apple M1 numpy install issue
#512 opened by brandonruhl - 0
Dataset Pipeline looks misplaced.
#277 opened by happyhuman - 0
- 0
- 0
- 0
- 0
Add Dockerfile linter as a check
#187 opened by adlersantos - 1
- 0
- 1
Use backport providers for DAG generation
#12 opened by leahecole - 5
Containerize custom tasks
#5 opened by adlersantos - 0
- 0
- 0
- 0
- 0
Replace COVID-19 sample dataset in README
#126 opened by adlersantos - 1
Pin dependencies + add bot
#28 opened by leahecole - 0
Add header-checker bot for license header checking
#93 opened by leahecole - 0
Mechanism to store shared variables used by multiple DAGs in a single place
#100 opened by adlersantos - 0
Support partitioning and clustering for BQ tables
#114 opened by adlersantos - 0
- 0
- 0
- 0
- 0
Support JSON file references in YAML config
#69 opened by adlersantos - 0
Support self-referencing variables in YAML
#65 opened by adlersantos - 0
- 0
- 0
Add the ability to deploy a single pipeline
#14 opened by adlersantos - 0
- 0
Add renovate bot
#34 opened by adlersantos - 1
Terminology change: from datasets > dataset > pipeline; pipelines > pipeline_group > pipeline
#30 opened by adlersantos - 0
Templating tool for simple pipelines
#31 opened by adlersantos - 0
- 0
automatically test pipelines
#11 opened by tswast - 0
automatically generate DAGs and Terraform for PRs
#10 opened by tswast - 1