/webScraper_cpp

The web scraper program is designed to extract main content from web pages by following links recursively up to a certain depth. It saves the extracted content to a text file and keeps track of processed links and their MD5 hashes to avoid duplicate content.

Primary LanguageC++MIT LicenseMIT

Watchers