/sanfoundry-scraper

A Small Scraping Script written in Python that helps you collect and merge all questions for a subject on sanfoundry.com into a HTML document with additional data.

Primary LanguagePython

sanfoundry-scraper

Extracts all MCQs of a subject that you input the link of.

run the main.py file after getting the code using the Download zip button or following this link.

just run main.py using the following command

python main.py

or

python main.py --url {url-of-quiz}

this scrapper also uses multithreading

python main.py --url {url-of-quiz} --thread --workers 15

Here workers are the number of threads. Default workers are 5 .

You can change by passing --workers {thread-choice} .

More workers can cause unstability

Input the URL of the Subject (for example, "https://www.sanfoundry.com/1000-object-oriented-programming-oops-questions-answers/") and run it to get a file in a folder named Saved_MCQs of all the MCQs of the subject.

run the following command if you are using the program for the first time.

pip install -r requirements.txt

and

python main.py --help

to install requirements

LOG: This is getting traction hehe so gonna make it better to use and better at output

  1. PDF Output instead of HTML or maybe optional idk
  2. good docs 💀
  3. plain text output format with just questions and answers in txt. easier to "study" xD.