Python script to scrape articles. Works even on articles that requrie a user to sign in before reading.
pip install docx
pip install regex
pip install requests
pip install bs4
- Run the file
- Enter the URL as prompted.
- Enter the destination file path. (Can be to a Word file that has not been created already)
- Remove quotaion marks. Eg: Change "C:\MyFolder\article.docx" to C:\MyFolder\article.docx
- Navigate to the file path enetered before to find your file!
- Star the repo :)
- Doesnt work on Medium Articles
- Output doc is unformatted and without images