pip install pyspark
To run the consumer: spark-submit --packages org.apache.spark:spark-sql-kafka-0-10_2.12:3.1.2 consumer.py
To run the producer: python3 producer.py
Run the producer first and then the consumer
MongoDB: https://www.mongodb.com/docs/manual/tutorial/install-mongodb-on-ubuntu/