These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.
Install
- beautifulsoup4
- goose3
- pandas
pip install beautifulsoup4
pip install goose3
pip install pandas
...
...
...
- BeautifulSoup - For parsing HTML
- Goose3 - For pulling texts from HTML
- Pandas - For saving dataframe to .csv
Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.
We use SemVer for versioning. For the versions available, see the tags on this repository.
- Linsu Han - linsuhan@graphen.ai, lh2910@columbia.edu - GitHub
See also the list of contributors who participated in this project.
This project is licensed under Apache License, Version 2.0 - see the LICENSE.txt file for details