/TopKeywords

Find top keywords by count using Apache Spark

Primary LanguageScala

Spark Top Keywords

Get top 10 keywords by category.

Note: Apache Spark is included as a dependency, separate Spark installation is not required.

Data:

  • Input file is located in ./src/main/resources/
  • Output file will be created in ./output/ folder

Running instructions:

> sbt compile
  • Run:
> sbt run

These commands may take a long time on first run.

Unit Tests:

Run test using following commands:

> sbt test