Say-No-To-Sign-In-Articles

Python script to scrape articles. Works even on articles that requrie a user to sign in before reading.


Install required packages:

pip install docx
pip install regex
pip install requests
pip install bs4

Running

  • Run the file
  • Enter the URL as prompted.
  • Enter the destination file path. (Can be to a Word file that has not been created already)
  • Remove quotaion marks. Eg: Change "C:\MyFolder\article.docx" to C:\MyFolder\article.docx
  • Navigate to the file path enetered before to find your file!
  • Star the repo :)

Known Bugs

  • Doesnt work on Medium Articles
  • Output doc is unformatted and without images