Pinned Repositories
ajax-solr
A JavaScript framework for creating user interfaces to Solr.
Autofill-Form-Chrome-Extension
Bootstrap-Form-Builder
Web app for drag drop building bootstrap forms.
capybara
Acceptance test framework for web applications
chrome-app-samples
Chrome Apps
clustershell
Scalable cluster administration Python framework — Manage node sets, node groups and execute commands on cluster nodes in parallel.
dbpedia-spotlight
DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.
elastic.js
A JavaScript implementation of the elasticsearch Query DSL
elasticsearch-analysis-skos
SKOS analysis for Elasticsearch
flashproxy
Flashproxy project (https://crypto.stanford.edu/flashproxy/) forked from https://git.torproject.org/flashproxy.git
parsing's Repositories
parsing/ajax-solr
A JavaScript framework for creating user interfaces to Solr.
parsing/flashproxy
Flashproxy project (https://crypto.stanford.edu/flashproxy/) forked from https://git.torproject.org/flashproxy.git
parsing/node-cookie
RFC6265 Cookies and CookieJar for Node.js
parsing/node-websocket-relay
An http to websocket relay server written in node.js.
parsing/python-zombie
A Python driver for Zombie.js (http://zombie.labnotes.org/), a headless browser powered by node.js.
parsing/websocket-relay
parsing/dashboard
Dashboard for communities: what's up, who's here and what are they working on. Designed for use at the Open Knowledge Foundation.
parsing/distribute_crawler
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
parsing/django-dynamic-scraper
Creating Scrapy scrapers via the Django admin interface
parsing/html-structure-view
A simple and useful tool to assist in planning, presentation and visualization of structures of your site.
parsing/HtmlConverter
PHP5 library that provides easy HTML-to-Text conversion
parsing/moodle-local_html2rtf
A PHP/XSLT library for Moodle 2 that enables conversion of an (X)HTML string to an RTF document string. Used in the portfolioact plugin.
parsing/natural
general natural language facilities for node
parsing/node-calais
Node.js module/CLI tool for semantic analysis of text using the OpenCalais web service.
parsing/node-scrapyard
Scrapyard makes scraping websites easy.
parsing/node.io
A data scraping and processing framework
parsing/ofxparse
Ofx file format parser for Python
parsing/pdf-extract
Node PDF Extract
parsing/pdfminer
Python PDF Parser
parsing/phantomrobot
Robot Framework Remote Test Library for PhantomJS
parsing/python-scrapinghub
A client interface for Scrapinghub's API
parsing/queuelib
Collection of persistent (disk-based) queues
parsing/scrapy-redis
Redis-based components for scrapy that allows distributed crawling
parsing/Scrapy.js
node.js implementation of python based open source screen scraping tool.
parsing/scrapyjs
Scrapy-Javascript integration
parsing/selectorgadget
Old home of selectorgadget
parsing/socket.io
Realtime application framework for Node.JS, with HTML5 WebSockets and cross-browser fallbacks support.
parsing/soupselect
CSS selector support for BeautifulSoup.
parsing/truncatehtml
A Jekyll plugin that truncates HTML while preserving markup structure.
parsing/watir-webdriver-inspector
Minimal combinatorial code generation on element inspection. [Work in progress]