/Dawrly

Crawler based search engine that demonstrates the main features of a search engine (web crawling, indexing and ranking) and the interaction between them.

Primary LanguageJavaMIT LicenseMIT

GitHub contributors GitHub issues GitHub license GitHub forks GitHub stars GitHub Language

πŸ“ Table of Contents

πŸ“™ About

  • Crawler based search engine that demonstrates the main features of a search engine (web crawling, indexing and ranking) and the interaction between them.

πŸ“ƒ Documentation

  • index.html file in documentation folder

🏁 Getting Started

This is an list of needed instructions to set up your project locally, to get a local copy up and running follow these instructuins.

Prerequisite

  1. Apache Tomcat 9
  2. MongoDB
  3. Any Java IDE (Intellij in our case)

Installation

  1. Clone the repository
$ git clone https://github.com/ZeyadTarekk/Dawrly.git
  1. Right click on the folder and open it with the IDE

Running

Running program

Run Crawler_Indexer_Main to start the Crawler then the Indexer
unzip "apache-tomcat-9.0.62.zip"
Copy the content of Interface file to "apache-tomcat-9.0.62\webapps\SearchEngine"
Copy "stopwords.txt" and "bodyFiles" to "apache-tomcat-9.0.62\bin"
Run main function in RunGUI class

πŸ’» Built Using

  • Java

πŸ“· Demo Screenshots

Main Screen

Suggestion Mechanism

Results

Pagination of results

Phrase Searching

Contributors

Zeyad Tarek
Zeyad Tarek
Abdelrahman Mohamed
Abdelrahman Mohamed
Beshoy Morad
Beshoy Morad
Ziad Sherif
Ziad Sherif

License

This software is licensed under MIT License, See License for more information Β©ZeyadTarekk.