This repository contains scripts to analyse the network of Users-Subreddits of reddit, using Apache Spark.
-
- Create itemset buckets for using in this script
-
- Changes the Subreddit ids to names for the output of this script
-
- Changes the Subreddit ids to names for the output of this script
- Java 8 JDK
- scala (https://www.scala-lang.org/)
- sbt (http://www.scala-sbt.org/)
- Generate the binaries with
sbt universal:packageZipTarball
- Uncompress the generated file located at
target/universal/cn-project-0.1.0-SNAPSHOT.tgz
- Run the script inside
<extractedDirectory>/bin
that has the same name as the scala script you want to run