/streaming_pipeline

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Streaming Pipeline

Prerequisites

Preffered use conda to create virtual env. Install Anaconda if you don't have it and run command below. It will create env.

conda env create -f environment.yaml

Run locally from main

python main.py io-pipeline
python main.py streaming-pipeline

Kubernetis Setup

Kafka

helm install kafka bitnami/kafka  --set externalAccess.enabled=true --set externalAccess.service.type=LoadBalancer --set externalAccess.service.port=9094 --set externalAccess.autoDiscovery.enabled=true --set serviceAccount.create=true --set rbac.create=true

MongoDB

helm install mongodb bitnami/mongodb  --set service.type=LoadBalancer