Strumenta/SmartReader
SmartReader is a library to extract the main content of a web page, based on a port of the Readability library by Mozilla
C#Apache-2.0
Issues
- 8
- 5
- 2
- 3
ConvertToPlaintext performance enhancements
#61 opened by malv007 - 2
data URIs in IMG SRC not preserved, treated as relative URL and resolved to invalid URL
#58 opened by acidus99 - 1
System.ArgumentOutOfRangeException: Year, Month, and Day parameters describe an un-representable DateTime.
#55 opened by iansmirlis - 1
- 0
This is great, thank you!
#57 opened by LAB02-Admin - 7
Demo website vs library
#52 opened by iansmirlis - 3
- 0
System.ArgumentOutOfRangeException: Specified argument was out of the range of valid values.
#53 opened by iansmirlis - 2
- 1
Memory leak on undisposed doc within Reader
#50 opened by Joshhua5 - 4
- 1
Thresholds must be language sensitive
#47 opened by ivanicin - 7
- 8
Support for german language characters
#44 opened by marhyno - 2
- 2
Pass the original html in Article
#49 opened by ivanicin - 3
Angle Sharp parsing xml attributes
#42 opened by prestonkell - 1
SmartReader.UriExtensions.ToAbsoluteURI(Uri pageUri, String uriToCheck) throws exception when uriToCheck = ""
#41 opened by mininmaxim - 2
Is there a way to get the full html text of an article? (from the <htm> or <body> tag)?
#25 opened by trescatorce - 2
- 4
- 4
System.ObjectDisposedException: Cannot access a disposed object. Object name: 'SocketsHttpHandler'.
#22 opened by MaratPavlov - 1
- 4
Crash on Xamarin forms
#18 opened by sherifawad - 0
Improving documentation
#13 opened by gabriele-tomassetti - 0
Integrate CI service
#14 opened by gabriele-tomassetti - 0
Update and publish demo project
#15 opened by gabriele-tomassetti - 13
Improve extensibility of parser
#6 opened by kodfodrasz - 2
New Bug: Featured Image seems to only provide asset name in the latest version instead of full url
#11 opened by alombard - 2
- 2
Lack of user-agent when sending request.
#9 opened by AndySchmitt - 4
- 1
System.Globalization.CultureNotFoundException
#7 opened by iixi - 1
GetArticle from html string
#5 opened by iixi - 6
does it work on .NET Framework 4.6?
#2 opened by yasindn - 1
Define the license in the README
#1 opened by ftomassetti