datawizard1337/ARGUS
ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
PythonGPL-3.0
Issues
- 9
- 7
Argus is returning no text for some websites
#55 opened by DJW-TU - 1
Only scrape urls
#54 opened by DioLimpens - 6
Getting started with ARGUS
#53 opened by DioLimpens - 7
scrapyd compatibility issue
#51 opened by ebergam - 1
Problem with relative paths
#40 opened by cordoba27 - 0
Textspider jobs do not finish
#38 opened by datawizard1337 - 2
- 1
Take Website Screenshots
#34 opened by davidlenz - 6
- 2
tldextract of registered_domain vs domain
#17 opened by datawizard1337 - 2
Post-processing multiple chunks
#32 opened by cordoba27 - 4
Duplicate webpages
#31 opened by davidlenz - 0
Argus.exe not working
#30 opened by davidlenz - 2
- 0
Optional encodings for export files
#18 opened by datawizard1337 - 0
RSS Feeds
#22 opened by datawizard1337 - 0
GUI
#27 opened by datawizard1337 - 0
- 1
Text extraction from div
#13 opened by datawizard1337 - 1
Text extraction enhancement
#20 opened by datawizard1337 - 0
Duplicate webpage texts
#19 opened by datawizard1337