Pinned Repositories
extruct
Extract embedded metadata from HTML markup
learn.scrapinghub.com
Scrapinghub Learning Center. Report issues in Jira: Report issues in Jira: https://scrapinghub.atlassian.net/projects/WEB
quotesbot
This is a sample Scrapy project for educational purposes
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
books_crawler
A Scrapy crawler for http://books.toscrape.com
flake8-scrapy
A Flake8 plugin to catch common issues on Scrapy spiders
HackerNewsDailyDigest
A toy project with Scrapy + Django + Celery to run on Heroku
impress-code-highlighter
A simple tool to highlight source code blocks in LibreOffice Impress.
scrapy-fieldstats
A Scrapy extension to log items coverage when the spider shuts down
scrapy_price_monitor
A simple price monitor built with Scrapy and Scrapy Cloud (https://app.scrapinghub.com)
stummjr's Repositories
stummjr/HackerNewsDailyDigest
A toy project with Scrapy + Django + Celery to run on Heroku
stummjr/sublimetext-markdown-to-wordpress
Sublime Text 2 plugin to publish to wordpress blogs the contents from a Markdown file (as HTML).
stummjr/inventwithpython
Book Invent With Python
stummjr/products-crawler
Sample Scrapy project to test with the Scrapy Cloud MonkeyLearn addon.
stummjr/pyfsa
Python FSA constructor, determinizer, and minimizer.
stummjr/scrapy-deltafetch
Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls
stummjr/bggen
A simple tool to generate GNOME/Unity wallpaper slideshow.
stummjr/cookiecutter-pypackage
Cookiecutter template for a Python package.
stummjr/django-oscar
Domain-driven e-commerce for Django
stummjr/django-survey
a simple django framework for creating and conducting surveys
stummjr/doc.scrapinghub.com
Scrapinghub Documentation
stummjr/extruct
Extract embedded metadata from HTML markup
stummjr/gae-simpleblog
A very simple blog tool written in Python + Google App Engine
stummjr/hackernews_crawler
A spider for hacker news.
stummjr/pydepta
A python implementation of DEPTA
stummjr/python-hubstorage
HubStorage client library
stummjr/python-scrapinghub
A client interface for Scrapinghub's API
stummjr/pythonbrasilblog
Blog oficial da Python Brasil 12
stummjr/scrapy-splash
Scrapy+Splash for JavaScript integration
stummjr/shub
Scrapinghub Command Line Client
stummjr/skinfer
Skinfer is a tool for inferring and merging JSON schemas
stummjr/spidyquotes
Example site for a web scraping tutorial
stummjr/stummjr.github.io-old
My homepage.
stummjr/sublimetext-markdown-preview
markdown preview plugin for sublime text 2
stummjr/tdc_crawler
Exemplo de crawler para a palestra no TDC 2016.