/public-datasets-pipelines

Cloud-native, data onboarding architecture for Google Cloud Datasets

Primary LanguagePythonApache License 2.0Apache-2.0

Google Cloud Datasets: Data Pipelines and Documentation Set

public-datasets-pipelines

This repository contains the following:

  • Cloud-native, data pipeline architecture for onboarding public datasets to Google Cloud Datasets.
  • Documentation set containing tutorials, samples, and other articles making use of the datasets hosted by the program.

For detailed documentation, please see the Wiki Pages.

Datasets

Here are some of the featured datasets onboarded using this repository/architecture.