C++ header-only library for extracting information from HTML documents
- Extract page title
- Extract all links
- Cleanup extracted links (fix relative)
- Filter extracted links (may not add, easy for user of lib to do)
- Extract all page text content
- Error handling :).