Web-Scripe-Project-for-Chinese-Joke-Blog

Part 1 Load requests and BeautifulSoup Package

Part 2 Define get_urls function which will acquire the each detail page link and get_info which will acquire target information from each detail page

Part 3 Extract Information and Generate Dataframe

Part 4 Export Dataframe to CSV


About the dataset:

The dataset related to this project is web-scraped by myself via python(beatiful) and basic HTML knowledge.

The Chinese Jokes Blog is: https://duanzixing.com/page/1/