/quora-nl-scraper

This repo scrapes Dutch questions and answers from Quora NL.

Primary LanguagePythonMIT LicenseMIT

Quora NL Scraper

Code to scrape question and answers from Quora NL.
Adaptation of the English version.

topic_list.txt: text file with topics on each newline, of which each word of the topic needs to be lowercased and joined by '%20' as substitute for space. For example: 'mark%20rutte'
quora-question-scraper.py: scrapes questions of which the topic is mentioned in the question URL and puts them in topic_questions_urls_date.csv.
quora-answer-scraper.py: scrapes answers from topic_questions_urls_date.csv obtained by question scraper.

NOTE: Scraped answers include the section of related answers ('gerelateerde antwoorden'). Login is required if related answers should be excluded.