Issues
- 0
Define `__all__` in `__init__.py`
#110 opened by adbar - 0
- 1
Bug: clean_url fails on apostrophe in urls
#106 opened by mikewolfd - 0
Deprecate Python 3.6 & 3.7
#73 opened by adbar - 0
Convert Readme file to markdown format
#98 opened by adbar - 1
- 0
Check if `langcodes` can be replaced by `babel`
#70 opened by adbar - 0
- 1
Change license to Apache 2.0
#80 opened by adbar - 0
Persistance for `UrlStore` (file I/O)
#71 opened by adbar - 0
- 1
Add support for username in netloc?
#76 opened by adbar - 0
Add `is_homepage()` heuristic
#75 opened by adbar - 0
Navigation: add heuristic based on site depth
#72 opened by adbar - 0
- 0
Provide function `is_valid_url()`
#62 opened by adbar - 0
Define option to focus on given extension types
#61 opened by adbar - 0
Offer IRI to URI conversion
#57 opened by adbar - 13
- 0
Make use of signal optional
#17 opened by adbar - 3
Courlan does not load `/page/` links
#16 opened by sbusso - 0
Domain/subdomain confusion in link extraction
#14 opened by adbar - 0
Investigate sampling issue
#7 opened by adbar - 0
Test and fix URL sampling to support Python 3.11
#6 opened by adbar - 0
Remove tox test settings
#5 opened by adbar - 0
Drop support for Python 3.5
#4 opened by adbar - 1
Replace tldextract with tld?
#1 opened by adbar - 1
Send head request with urllib3
#2 opened by adbar