The project applies concepts of Web scraping on the web site LACENTRALE. LACENTRALE is a french web site to buy or sell secondhand cars.
It allows us to extract automatically huge amount of information from this web site.
https://slides.com/safaeladib/webscraping_lacentrale
Acess the home page to get all vehicle makes.
Gather all announce’s url for a specific vehicle make.
Crawl all relevant data on a specific announce’s page.
Repeat the previous steps for a certain number of pages.
Save collected data into a csv file.
- Python library BeautifulSoup
- css selector to parse HTML page
- Python pandas library to save collected data into a dataframe