/hadoop-mapreduce-publication-data-analysis

This project aims at leveraging the capabilities of Apache's Hadoop framework by applying Map-Reduce primitives to analyze publication data. This data distributed as the dblp-dataset is analyzed for no. of publications per author, per year, no. of co-authors per paper etc.

Primary LanguageScala

No issues in this repository yet.