这是一个基于scrapy框架的简单小爬虫,适合刚学爬虫和scrapy的同学学习。这是作者在大二上期末的python课程设计。用于爬取 movie.douban.com 中评分最高的250部电影。
爬取内容有:
- 电影名称
- 导演及出演演员
- 评分
- 简述
- 电影封面
This is a simple crawler predicated on the "scrapy" module. I consider that it is a awesome example for someone who just studing "scrapy" or learning crawler. And, this is my python class design in the first semester of sophomore year. It is used to crawl Top250 movies on movie.douban.com.
Crawled items:
- Movie Name
- Director & actors
- Rating
- Intro
- Cover