mittal-parth/Say-No-To-Sign-In-Articles

Python

Say-No-To-Sign-In-Articles

Python script to scrape articles. Works even on articles that requrie a user to sign in before reading.

Install required packages:

pip install docx
pip install regex
pip install requests
pip install bs4

Running

Run the file
Enter the URL as prompted.
Enter the destination file path. (Can be to a Word file that has not been created already)
Remove quotaion marks. Eg: Change "C:\MyFolder\article.docx" to C:\MyFolder\article.docx
Navigate to the file path enetered before to find your file!
Star the repo :)

Known Bugs

Doesnt work on Medium Articles
Output doc is unformatted and without images