rdd-vs-df

There are 1 repositories under rdd-vs-df topic.

  • apache-spark-evaluation

    apache-spark-evaluation

    Evaluates the execution time differences between RDD (Resilient Distributed Datasets) and DataFrame data structures in Apache Spark. Also takes into account the file format being used, such as CSV or Parquet.

    Language:Python1