-
Extracted from stackoverflow.com the 10 newest Android-related questions and the 10 most voted Android-related questions that are created in the past week.
-
Writtern in Python 3.7
-
Extracted the 10 newest Android-related questions without duplicated ones.
-
Extracted the 10 most voted Android-related questions posted in the last 7 days without duplicated ones.
-
Displayed the titles of the extracted questions on a website.
-
Updated the web content when clicking the update button and showed the update time.(NEW)
-
Displayed the full text of question in a drop-down box when clicking on the question title.
-
Displayed key attributes of every question under its title, such as answer number, vote number and creat time.(NEW)
-
Saved the 10 newest and 10 most voted questions in csv files respectively, and automatically named files with update time.(NEW)
-
pip install requests
-
pip install beautifulsoup4
-
pip install dominate
-
pip install flask
-
Win 10
-
Latest Firefox/Chrome supporting HTML5
-
Download the repository.
-
Change the directory to where you downloaded the crawler.
-
Run 'python server.py' in cmd.
-
Open 'http://127.0.0.1:5000/' in web browser.
-
Waite for a few seconds until 'Finished!' is printed in terminal.
-
Update the data by clicking on the button 'Update'.
-
The csv files have been saved automatically in the root dictionary of the crawler.