AlexIoannides/pyspark-example-project
Implementing best practices for PySpark ETL jobs and applications.
Python
Issues
- 1
Wrong variables in example
#22 opened by minhsphuc12 - 1
etl_config.json not loaded in EMR
#21 opened by junjchen - 1
Does not work when running on yarn in client mode
#30 opened by Rustem - 0
Failed TestCase
#28 opened by marouenes - 1
Add License
#27 opened by ajknzhol - 0
Pass Parameters to Spark
#25 opened by sou-joshi - 0
- 0
Issue while executing the code via pycharm
#23 opened by averma111 - 1
- 1
- 0
- 0
Pyspark-best-practices
#17 opened by devmatilag - 5
- 1
import sklearn fails
#13 opened by divayjindal95 - 1
PEX integration?
#11 opened by archenroot - 1
YAML instead of JSON
#12 opened by archenroot - 4
- 7