webcrawlers
There are 8 repositories under webcrawlers topic.
ExcitingTheory/amplify-spiders-v1
A tool for search engine based competitive analysis. AWS Amplify project with many web crawlers for different search engines, and a simple web frontend for displaying historical results in Next.js
AnonCatalyst/WebDiver
WebDiver is a versatile Python script for crawling websites, extracting internal and external links, titles, and descriptions. It's useful for tasks such as web analysis, OSINT (Open Source Intelligence) gathering, and competitive analysis.
ozakboy/Taiwan-news-crawlers
.net-based Crawlers for news of Taiwan (.net 台灣新聞爬蟲,數據物件化,方便使用)
alimogh/spideyweb
Simple WebCrawler
mr-rjh3/webcrawlers
Various webcrawler programs made to gain a deeper understandings of webcrawling concepts.
Th3-C0der/Web-Crawler
A simple WebCrawler for exploring and downloading content from web pages within a given domain/url.
Modyev/WebsiteCrawler
Web Crawler written in C# that parses all urls from a specific page then recursively visits them while parsing all links available on that webpage
slemarchand/no-robots
🚫🤖 Override /robots.txt to disallow all web crawlers, regardless settings stored in the database. Compatible with Liferay 7.0, 7.1, 7.2, 7.3 and 7.4.