/spark-mines

Wikipedia text mining and article search on Spark

Primary LanguageShellMIT LicenseMIT

spark-mines

A text mining project on Wikipedia using Apache Spark - word frequencies, topic modelling, PageRank, and a mini search-engine for Wikipedia articles.