The codes present in the repository are for the use of Google Cloud Platform.
Before the update, it is necessary to create the dataset in the data
directory, the code 00_make_data_csv_random.py
creates a csv with random data.
After the data generated it is necessary to convert csv to Avro, the code 01_csv_to_avro.py
does this. A very important point, if your data contains fields with date, datetime or timestamp it is necessary to convert these fields, the code has an example.
The 02_upload_gcs.py
code does this, but it is necessary to do .env
with the export GOOGLE_APPLICATION_CREDENTIALS = <your_gcp_key.json>
parameter.
The 03_create_table_bq.py
code does this, but it is necessary to do .env
with the export GOOGLE_APPLICATION_CREDENTIALS = <your_gcp_key.json>
parameter.
The 04_gcs_bq.py
code does this, but it is necessary to do .env
with the export GOOGLE_APPLICATION_CREDENTIALS = <your_gcp_key.json>
parameter.
The codes present in the repository have no connection with any company or group, the codes are for exclusive use for study with Google Cloud Platform.