weblyzard/inscriptis
A python based HTML to text conversion library, command line client and Web service.
PythonApache-2.0
Issues
- 1
- 0
Match links to bookmarks
#87 opened by nslpls - 1
Handling of Form 'input' elements.
#86 opened by grokwich - 1
How to extract the page title from the HTML?
#85 opened by StubbornDeer - 8
Instructions for Custom HTML tag handling do not work (or: Custom HTML tag handling does not work)
#81 opened by simonpercivall - 9
- 1
exclude header & footer
#79 opened by hadifar - 1
Comment block inside `span` gives a whitespace
#83 opened by dgtlmoon - 2
Text mapping to original HTML elements
#77 opened by ThomasArtin - 2
Inscriptis - How to handle `<title>` when it's outside a parent tag? ( RSS abuse :-) )
#78 opened by dgtlmoon - 1
Indentation using python
#76 opened by byashwan - 2
Presentation of internal spans seems a bit odd
#75 opened by mikix - 2
detection of (almost) hidden text in html
#72 opened by arpitest - 3
Display links config
#62 opened by crtnx - 3
[discussion] compared to other tools (links)
#66 opened by dgtlmoon - 5
Strange memory leak(?) consuming behaviour
#65 opened by dgtlmoon - 3
Exception handling
#63 opened by crtnx - 4
Converting tables to “running text”
#44 opened by omri-suissa-clearmash - 2
Mixing content of columns in a forum
#29 opened by rogerwaldvogel - 1
- 1
web service instructions don't work as written
#40 opened by rlskoeser - 2
Mixed text in extraction of table with span
#33 opened by sudoale - 2
- 4
Add support for converting HTML into indented text
#18 opened by emmggi - 1
Please replace urlopen with requests
#17 opened by yakovkeselman - 2
Failure while parsing Site
#16 opened by sandrohoerler - 1
- 3
Handle links?
#9 opened by rprots - 0
Code cleanup
#3 opened by AlbertWeichselbraun - 0
- 2
Empty row bug
#6 opened by Lucas-Gerrand - 0
- 0
Investigate missing link issue
#2 opened by AlbertWeichselbraun - 0
Improve handling of spaces
#1 opened by AlbertWeichselbraun