Objective: Count the number of times each word occurs in gs://dataflow-samples/shakespeare/kinglear.txt
and output it into a file
Create venv
python -m venv env
Activate it:
source env/bin/activate
Install requirements:
pip install -r requirements.txt
Run the pipeline:
python pipeline.py --output output.txt