/Wikipedia_Search_Engine

A TF-IDF (Term Frequency & Inverse Document Frequency) based search algorithm for searching a small subset of Wikipedia Data using Apache Spark Cluster of 3 Nodes on top of HDFS, hosted on AWS, having web UI with Django.

Primary LanguagePython

Watchers