This script allows you to collect information from the Python Enhancement Proposals (PEP) documentation site.
- Fetches PEP details from the PEP documentation website.
- Provides information about PEP numbers, titles, statuses, and more.
-
Clone the repository:
git clone https://github.com/Kapshtyk/scrapy_parser_pep.git cd scrapy_parser_pep
-
Install the required dependencies:
pip install -r requirements.txt
- Run the script:
scrapy crawl pep
The parser will create two csv files in the results folder:
- pep_{time}.csv contains information about all PEPs, their status and name
- status_summary_{time}.csv contains summary information about the count of each status
Status Count
Active 31
Final 276
Accepted 50
Deferred 37
Superseded 20
Rejected 122
Withdrawn 56
April Fool! 1
Draft 29
Provisional 1
Total 623
- Scrapy 2.5.1
- LinkedIn - Arseny Kapshtyk
- Github - @kapshtyk