Pinned Repositories
discogs-xml2db
Import the discogs monthly XML dumps into a database (fork of http://code.google.com/p/discogs-sql-importer/)
dupuis
UI tools for record deduplication and linkage
parslepy
Python implementation of the Parsley language for extracting structured data from web pages
peewee
a small orm, with support for postgresql, mysql and sqlite
scrapy
Scrapy, a fast high-level screen scraping and web crawling framework for Python.
scrapy-chromedebugproto
Example of how to integrate Scrapy with Chrome Debugging Protocol [very alpha stage]
sketchtml
Experiments around web page fingerprints
sql2graph
helper module to export data from a relational database to a graph database (through CSV files)
redapple's Repositories
redapple/parslepy
Python implementation of the Parsley language for extracting structured data from web pages
redapple/sql2graph
helper module to export data from a relational database to a graph database (through CSV files)
redapple/scrapy-chromedebugproto
Example of how to integrate Scrapy with Chrome Debugging Protocol [very alpha stage]
redapple/dupuis
UI tools for record deduplication and linkage
redapple/discogs-xml2db
Import the discogs monthly XML dumps into a database (fork of http://code.google.com/p/discogs-sql-importer/)
redapple/sketchtml
Experiments around web page fingerprints
redapple/peewee
a small orm, with support for postgresql, mysql and sqlite
redapple/scrapy
Scrapy, a fast high-level screen scraping and web crawling framework for Python.
redapple/js2xml-talk
redapple/mbslave
Simple MusicBrainz replication
redapple/pyvideo-data
Python related videos and metadata powering =>
redapple/batch-import
generic csv file neo4j batch importer
redapple/imapy
Imapy: Imap for Humans
redapple/lxml
The lxml XML toolkit for Python
redapple/musicbrainz-server
The official musicbrainz-server codebase
redapple/parsley
Parsley is a simple language for extracting structured data from web pages. Parsley consists of an powerful selector language wrapped with a JSON structure that can represent page-wide formatting.
redapple/parsluby
Parsley extraction language written in Ruby using Nokogiri
redapple/pyacoustid
Python bindings for Chromaprint acoustic fingerprinting and the Acoustid Web service
redapple/pycetr
Python implementation of CETR: Content Extraction via Tag Ratios
redapple/pyconfr2015
PyCon FR XPath talk material
redapple/python-musicbrainz-ngs
Python bindings for Musicbrainz' NGS webservice
redapple/python-simhash
An efficient simhash implementation for python
redapple/pyvideo
A Python media index
redapple/pyvideo-contrib
Contributions to pyvideo (using pyvideo/steve + cleaning things)
redapple/scrapy-issues
redapple/scrapy-tutorial
redapple/twisted
Event-driven networking engine written in Python.