- python >= 3.6 (tested with 3.6.1)
-
Install the scrapy library
pip install scrapy
-
Download the html files to the data directory.
python main.py
-
Match pattern to parse the html files into a csv file (URL, Incorrect, Correct, Suggestion)
python match_pattern.py