Wikipedia-Search-Engine

Project Links:

Video Demonstration
Live Project (Temporary Hosting, May not be active now!)

Pre-Requisites:

Hadoop v2.7.2 Cluster of atleast 3 Nodes (Installation Source 1) (Installation Source 2)
Setup Spark Cluster on top of HDFS (Configuration Part-1) (Configuration Part-2)
Install Django and Setup Virtual Environment of Python3 (Installation) (Windows)
Preferably use PyCharm or Atom IDE (PyCharm Configuration)
Python 2 (Installation)

STEPS:

(Activate your virtual environment and clone this repository to present working directory)

STEP-1: Start Django in-built server:

python manage.py runserver 127.0.0.1:8000

STEP-2: Now open browser and goto 127.0.0.1:8000/bigdatajob to interact with the search engine. (Use credentials (testuser,test1234) to login!!)

Collaborators

Madhav Agarwal
Raj Kumar Maurya
Kunwar Ashutosh Singh

This project was the winner of BigWar, a 48 hours National Level Project Competition organized by ABESIT, Ghaziabad during BigDataThon'17 from 7-9th April 2017.

mdv3101/Wikipedia_Search_Engine

Wikipedia-Search-Engine

Project Links:

Pre-Requisites:

STEPS:

Collaborators