/python_rtf_file_analyzer

Read RTF file, convert it to a text string, scrap some info from it, and then for each file create txt and excel file from scrapped data

Primary LanguageRich Text FormatMIT LicenseMIT

Articles and Conferences Transcript Content Analyzer

Installation

Make sure python >= 3.7 has been installed

Install following packages using pip in terminal:(Copy and paste the below code in your terminal)

  • pip install striprtf pandas xlrd XlsxWriter

Check these before start

  • The excel file name should be data.xlsx and must next to the analyzer.py

  • All rtf files should be in ECs folder, next to the analyzer.py. Just put all rtf files inside the ECs folder without nesting.

  • In Windows RUN CMD as Administrator.[in project directory hold shift+right click to see open PowerShell here in context menu]

Usage

Open your OS Terminal ( like CMD or Mac Terminal )

Type below command for more info and help:

python analyzer.py --help

Type below command for starting parse process:

python analyzer.py --start