buriy/python-readability
fast python port of arc90's readability tool, updated to match latest readability.js!
PythonApache-2.0
Issues
- 12
Using AdBlock rules to remove elements
#43 opened by bburky - 5
Aggressively removes images
#89 opened by rupeshk - 3
Why not calculate node score from deep to shallow?
#78 opened by eromoe - 1
- 1
- 1
Error when using a list for 'positive_keywords'
#91 opened by Bartvds - 1
- 1
ERROR: 'xslt-config
#92 opened by por7er - 0
No extract to body
#90 opened by ichenfujun - 3
Add charset info to the clean html
#88 opened by rsuhada - 3
HTML isn't parsed when get_clean_html is called
#86 opened by ostrea - 1
Error in install and import
#83 opened by farhad-arjmand - 1
Travis for CI
#81 opened by decentral1se - 5
- 3
-v/--verbose not work
#77 opened by eromoe - 0
- 5
Issue with Medium pages
#71 opened by Andre0991 - 2
- 5
- 2
Debug.info to debug.warning
#69 opened by c24b - 0
- 2
- 7
No Title for most articles
#46 opened by gevezex - 1
Keep the <meta charset> tag or even add it
#51 opened by gdamjan - 7
- 4
Python 3.4 Support
#66 opened by shaka908 - 6
- 2
Failure if best_elem is root
#58 opened by jnothman - 9
Differences with Goose
#57 opened by 0x0ece - 2
Having problems grabbing the main article
#52 opened by appscluster - 6
Try UnicodeDammit instead of using chardet library (or maybe combine the approaches).
#42 opened by buriy - 6
- 6
- 3
Pypi not up-to-date
#40 opened by scraperdragon - 3
htmls.py sets root logger loglevel
#37 opened by youngrok - 0
Rename this package to python-readability-lxml
#7 opened by buriy - 2
Errors in processing slashdot pages
#12 opened by andrebask - 3
- 2
error: get comments, not text
#23 opened by xnj - 1
- 2
Save charset
#34 opened by Cosmologist - 2
- 5
- 3
Erratic <p> insertion in Macrumors article
#30 opened by akavlie - 11
0.2.4 uninstallable .egg uploaded to pypi
#16 opened by mitechie - 2
Annoying warning when using readability
#26 opened by sprat - 0
Warning in htmls.py
#22 opened by xnj - 1
Add version 0.2.5.1 to pypi
#21 opened by droodle - 2
processed html has self closed body tag
#13 opened by mitechie - 1
Eliminate display:none tags
#14 opened by arski