Pinned Repositories
mlxtend
A library of extension and helper modules for Python's data analysis and machine learning libraries.
webstruct
NER toolkit for HTML data
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
clang_indexer
Contacts
Рамблер-Контакты, социальный мессенджер.
kv-queue
pure-c queue implementation on top of existent key-value storages
libcmdline
ocrsdk.com
ABBYY Cloud OCR SDK
qtwebkit
QtWebKit development repository
supervised-splash
whalebot-helmsman's Repositories
whalebot-helmsman/qtwebkit
QtWebKit development repository
whalebot-helmsman/supervised-splash
whalebot-helmsman/kv-queue
pure-c queue implementation on top of existent key-value storages
whalebot-helmsman/libcmdline
whalebot-helmsman/cookbooks
coobooks that are used internally in couchbase for managing vms and test machines
whalebot-helmsman/cpython
The Python programming language
whalebot-helmsman/dateparser
python parser for human readable dates
whalebot-helmsman/distributed
A distributed task scheduler for Dask
whalebot-helmsman/extruct
Extract embedded metadata from HTML markup
whalebot-helmsman/frozen-garden
whalebot-helmsman/html-text
Extract text from HTML
whalebot-helmsman/html5-parser
Fast C based HTML 5 parsing for python
whalebot-helmsman/html5print
HTML5, CSS, Javascript Pretty Print
whalebot-helmsman/MaybeDont
A component that tries to avoid downloading duplicate content
whalebot-helmsman/mlxtend
A library of extension and helper modules for Python's data analysis and machine learning libraries.
whalebot-helmsman/parsel
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
whalebot-helmsman/protego
A pure-Python robots.txt parser with support for modern conventions.
whalebot-helmsman/pykt-64
whalebot-helmsman/python-wapiti
Python bindings for libwapiti
whalebot-helmsman/rankGauss
[NOT IMPLEMENTED] rankGauss implementation as described in Normalization section of https://www.kaggle.com/c/porto-seguro-safe-driver-prediction/discussion/44629
whalebot-helmsman/rssarchiver
create archives of RSS feeds
whalebot-helmsman/scikit-cache
whalebot-helmsman/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
whalebot-helmsman/scrapy-bench
A CLI for benchmarking Scrapy.
whalebot-helmsman/scrapy-splash
Scrapy+Splash for JavaScript integration
whalebot-helmsman/slakna
whalebot-helmsman/splash
Lightweight, scriptable browser as a service with an HTTP API
whalebot-helmsman/tomita-parser
whalebot-helmsman/whalebot
whalebot-helmsman/Wikipedia
A Pythonic wrapper for the Wikipedia API