Don't Worry About It! But if you'd like to know, it scrapes emails from CSV's containing website addresses. The program pulls the html, uses a regexp match, and removes / cleans up duplicates.
Use the package manager pip to install dependencies (see below).
git clone https://github.com/kendalled/GetScraped.git
Put many csv files containing URL's in the src/Data Folder. Then:
pip3 install pandas
pip3 install lxml
pip3 install unicodecsv
Then, Run the Following:
python3 getscrapedall.py
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update issues as appropriate.