/Py_Cralwer

Web crawler for public information on China court websites.

Primary LanguagePython

Py_Cralwer

###File description
MajorCrawler.py : All the get[City]() functions here.
Run_Instace.py : Execution.
parseTest.py : Regular expression and mongoDB insertion.
TestEnv.py : Beijing court crawling only.

###Log
[11/15] For advance crawlering and generalization.
[11/10] Working on regular expression parsing nodes in txt, and insertion to mongoDB