Information-Retrieval-Engine-for-Sinhala-Song-Lyrics

Search engine for a Sinhala songs lyrics

Description

This Repo contain the project source code files of the

Get Started

  • Building the Lyrics Corpus
    Use the scrapingTool.py located in Sinhala song lyrics scraping tool/ to scrape the website which includes the Sinhala Lyrics
  • Convert scraped CSV data file to json using any online tool
  • Download ElasticSearch
  • Extract the zip file, open the bin folder and run ElasticSearch
  • Verify whether the ElasticSearch is running by visiting http://localhost:9200/
  • Use SearchTool.py in ElasticSearch folder to test your queries

Supported Featues

  • Search by song title, artist name, music composer, melody and lyrics author.
  • Search by song content
  • Support both English and Sinhala search queries
  • Faceted search for filtering artist,music,melody and lyrics author
  • Search for lyrics using Youtube video id

Indexing and Quering

  • Text classification
  • Boosting

Details

For detaied descriptions, Please refer the README.md file in project folders

Version

  • Version : 1.0
  • Last Scraped : 24/June/2020