HackJaipur

Team

  1. Harshit aggarwal
  2. Jayvardhan Rathi
  3. Gaurav Singh Chandrabhan
  4. Sandipan Das

Multi Office Summarizer

An abstractive summarizer for office needs Our group's idea was to use summarization methods for an office environment. It is used to summarise meetings by converting speech to text. The text file is then sent to the summariser to generate a summary. We have also built an optical charcter recognition system so that we can extract text data from pdf and convert it to a txt file and this txt file will also be fed to the summarizer.

Challenges we ran into

Running the Tesseract and other related libraies we ran into dependency issues and finally we overcame it by using Linux and was not working with Windows. Moreover we faced some difficulties with ML model integration with HTML rendered webpage.

Technologies we used

  1. Tensorflow
  2. Tesseract
  3. DeepSpeech
  4. HTML
  5. CSS
  6. Python
  7. Flask

Youtube

https://www.youtube.com/watch?v=qSe7xUWj7zQ&feature=youtu.be

Website

https://diplern.github.io/Website/Index-1.html