Pinned Repositories
brackets-open-in-cmder
Extension for Brackets to open the project folder in cmder.
brackets-terminal
Terminal Emulator for Brackets test editor
code_snippets
crawler
a web crawler
crawler4py
A web crawler in Python
crawlera-tools
Crawlera tools
crawlers
Some quick 'n dirty web crawlers.
crawley
crawtext_last
Yet another tiny crawler in python, using Bing Search API, Boilerpipe and Adblock.
newspaper
News extraction, article extraction and content curation in python. Built with multithreading, 10+ languages, NLP, ML, and more!
kvdaniel's Repositories
kvdaniel/newspaper
News extraction, article extraction and content curation in python. Built with multithreading, 10+ languages, NLP, ML, and more!
kvdaniel/brackets-open-in-cmder
Extension for Brackets to open the project folder in cmder.
kvdaniel/brackets-terminal
Terminal Emulator for Brackets test editor
kvdaniel/code_snippets
kvdaniel/crawler4py
A web crawler in Python
kvdaniel/crawlera-tools
Crawlera tools
kvdaniel/crawlers
Some quick 'n dirty web crawlers.
kvdaniel/crawley
kvdaniel/GoogleSearchCrawler
a tool for crawl Google search results
kvdaniel/HtmlJsCrawler
Simple html and javascript files crawler
kvdaniel/IWCT_Weibo_Crawler
This repository designing a sina weibo crawler is dedicated to the research program of IWCT,SJTU
kvdaniel/node-crawler
Web Crawler/Spider for NodeJS + server-side jQuery ;-)
kvdaniel/noskiddie
noskiddie is a basic log watcher to ban kids/scanner/crawler based on a blacklist
kvdaniel/pty.js
Bindings to forkpty(3) for node.js.
kvdaniel/pyspider
A Powerful Spider(Web Crawler) System in Python.
kvdaniel/pySpidy
A simple, yet powerful, python web crawler for Google with browser capabilities
kvdaniel/python-crawler-ccw
web resources crawler for pdf or doc by python 3
kvdaniel/python-sitemap
Mini website crawler to make sitemap from a website.
kvdaniel/retina-crawler
A news crawler for the Retina Project
kvdaniel/scrapy-spiders
Collection of python scripts I have created to crawl various websites, mostly for lead generation projects to match keywords and collect email addresses and post URLs
kvdaniel/simDHT
A DHT crawler, very simple, writen by Python.
kvdaniel/spiderfoot
SpiderFoot, the open source footprinting and intelligence-gathering tool.
kvdaniel/term.js
A terminal written in javascript.
kvdaniel/testspiders
Useful test spiders for Scrapy
kvdaniel/tweeks
dummy
kvdaniel/warehouse
Next Generation Python Package Repository
kvdaniel/web-crawlers
several py web crawlers
kvdaniel/webmagic
A scalable web crawler framework.
kvdaniel/yelpcrawl
Crawl and scrape Yelp's restaurant data for every zip code in the United States (or a specified zipcode). Yelp Crawler.
kvdaniel/Zeek
Python distributed web scrapper and dynamic crawler