kindsmiles/pyvigate

Context and Styling aware data extraction from websites

Opened this issue · 0 comments

An LLM powered webscraped that has awareness about the DOM elements and styling in context could potentially scrape things in a heirarchical manner that other tools cant really easily replicate, especially within a headless environment

Example use case : Hackernews comments extraction AND Extracting / Filtering out Textual Data from Main Hero Content for Varying Sources