webcrawlers

There are 8 repositories under webcrawlers topic.

AnonCatalyst/WebDiver
WebDiver is a versatile Python script for crawling websites, extracting internal and external links, titles, and descriptions. It's useful for tasks such as web analysis, OSINT (Open Source Intelligence) gathering, and competitive analysis.
Language:Python8 1 01
ExcitingTheory/amplify-spiders-v1
A tool for search engine based competitive analysis. AWS Amplify project with many web crawlers for different search engines, and a simple web frontend for displaying historical results in Next.js
Language:TypeScript7 0 00
ozakboy/Taiwan-news-crawlers
.net-based Crawlers for news of Taiwan (.net 台灣新聞爬蟲，數據物件化，方便使用)
Language:C#2 1 00
alimogh/spideyweb
Simple WebCrawler
Language:Python1 0 0
mr-rjh3/webcrawlers
Various webcrawler programs made to gain a deeper understandings of webcrawling concepts.
Language:Python1 2 0
Th3-C0der/Web-Crawler
A simple WebCrawler for exploring and downloading content from web pages within a given domain/url.
Language:HTML1 1 00
Modyev/WebsiteCrawler
Web Crawler written in C# that parses all urls from a specific page then recursively visits them while parsing all links available on that webpage
Language:C#1 0
slemarchand/no-robots
🚫🤖 Override /robots.txt to disallow all web crawlers, regardless settings stored in the database. Compatible with Liferay 7.0, 7.1, 7.2, 7.3 and 7.4.
Language:Java2 0