japila-books/apache-spark-internals

rdd.filter() does not preserve partitioning?

rtkaleta opened this issue · 0 comments

Hi, thanks for writing the notebooks.

In the introduction to RDD partitioning you mention that the filter operation does not preserve partitioning. But I'm looking at the source code where I see that MapPartitionsRDD is instantiated with preservesPartitioning = true.

Doesn't this conflict your statement?