- Challenges with RDD
- RDD to DataFrame conversion
- Data ingestion from
- RDBMS (MySQL)
- NoSQL (MongoDB)
- SFTP
- Amazon S3 bucket
- Amazon Redshift Database
- Applying transformation using
- DSL (Domain Spaecific Language)
- Spark SQL
- Window/analytics function (lead(), lag(), rank(), dense_rank(), etc.)