fanboy-tag-scraper

A scraper to collect posts from https://fbtag.net I have tested to run this on cygwin on a windows machine and ubuntu.

Requirements

Clone this repository (git clone .... )
Enter directory
Install requirements (pip install -r requirements.txt)
Copy settings file and edit to your settings (cp settings.example.py settings.py)

scrapy crawl fbtag <-a tag_filter="3,8"> <-a discussion_list_deep=5> <-a discussion_deep=2> <-a sort_order=oldest>

Parameters explained:

Only collect posts from given tags ids. Commaseparated tagid

How many pages to parse through in the discussion list navigation.

How many pages to parse through in every discussion.

Which order to parse the post list? Possible values: