/parser

crawl the links and check the defined pattern for text correction

Primary LanguagePython

Parser

Crawl the links and check the defined pattern for text correction

Prerequisites

  • python >= 3.6 (tested with 3.6.1)

Procedure to run the script:

  1. Install the scrapy library

     pip install scrapy  
    
  2. Download the html files to the data directory.

     python main.py 
    
  3. Match pattern to parse the html files into a csv file (URL, Incorrect, Correct, Suggestion)

     python match_pattern.py