/tinhtevn-crawler

This is the project that crawl all threads of a specific category in Tinhte.vn based on Scrapy library.

Primary LanguagePython

Tinhte.vn Crawler

Introduction

This is the project that crawl all threads of a specific category in Tinhte.vn based on Scrapy library .
Written by: hunghinh2000

Input/Output:

Input: the link of category you need to crawl on Tinhte.vn. You can change the link in ./config/main.cfg
Output: image folder contains all images of each thread and json folder contains information of image.

Installing

Requirements

python >= 3.5

Install

Run this command to install requirements:
pip3 install -r requirements.txt

Config

Change system config in ./config/main.cfg to suit your enviroments.

Run

python3 main.py

Contributing

Links