/parallel-keyword-search

Academic project for CSE4001 - Parallel and Distributed Computing

Primary LanguagePHPMIT LicenseMIT

CSE4001 Parallel and Distributed Computing

Team Members

  • Aryan Blouria - 19BCE0330
  • Aryan Vats - 19BCE0336
  • Abstract

  • Word Search Engines in the world are efficient enough for the current level of processing speed. But, as the technology evolves, the speed has to be greater than the present. The present method is a sequential method. The metrics of sequential methods are much inferior compared to that of multithreaded search.
  • Searching by keywords in News Articles becomes quite a necessity given the importance of time taken to search the right topic for a reader. With a search tool for News Articles, the reader could just search for the right articles with important information hassle free. With this project we plan to implement a search tool for Indian News Articles (using a news articles dataset taken from the Internet), based on multithreading with the help of OpenMP.
  • Dataset: Indian News Articles Dataset (from Kaggle). This dataset contains various Indian news articles from multiple sources like Firstpost.com, ndtv.com etc. The main column of this dataset which will be needed for this project is the ‘content’ column.