/urlseed

find news home page by analysis page contents.

Primary LanguageJava

1\ use scrapy to download the page info 2\ use this project to extract news main page address 3\ download the news main page 4\ get the most common news url in the news page.