microsoft/openscraping-lib-csharp
Turn unstructured HTML pages into structured data. The OpenScraping library can extract information from HTML pages using a JSON config file with xPath rules. It can scrape even multi-level complex objects such as tables and forum posts. This is the C# version.
C#NOASSERTION
Issues
- 1
- 1
- 0
- 4
- 3
Regex Transformation not returning first match
#24 opened by Sebbl22 - 19
- 1
Modify MultiExtractor and document it
#4 opened by zmarty - 1
Document how to write custom transformations
#5 opened by zmarty - 2
Regexp Transformation
#8 opened by agabuza - 1
- 1
Broken links to "Transformations" examples
#16 opened by jwatkin - 4
Incompatibility with .Net Standard 2.0
#12 opened by colinthompson - 1
How to return the href of a hyperlink?
#13 opened by blaiseliu - 4
- 4
Get meta data?
#9 opened by TJF0700