/magnets

magnets.js is a framework written on top of node.js which provides a module system and some sample modules to crawl pictures from websites

Primary LanguageJavaScript

FUCKING MAGNETS HOW DO THEY WORK?

This project is designed to be a modular image grabber/crawler written in node.js.

It's main purpose is derived from this XKCD quote:

With the collapse of the dollar the government has endorsed an alternate currency. Your monetary worth is now determined by the number of funny pictures savedto your hard drive.

Quote from XKCD

The name was borrowed from the 'fucking magnets' meme.

FEATURES

  • download all the funny pictures on the internet directly to your hard disk

  • extensible plugin facility with multi-module support

  • modules for high-quality funny pics

    • icanhascheezburger-network like failblog and lolcats
    • soup.io
    • kqe
    • bildschirmarbeiter
    • ... more
  • FUNNY PICTURES ALL THE WAY TODO: =====

    • Live Ticker (scheduler) - partly implemented
    • Other Plugins ( recently added cheezburger_network)
    • Add debugging and inspection howto
    • Support commandline options like loglevel
    • Use node-htmlparser instead of own regex (used in cheezburger as first plugin
    • meta data for every picture

Used node.js libraries:

in short: npm install trollop htmlparser soupselect http://github.com/pfleidi/node-wwwdude/tarball/master

Workaround to delete downloaded html files:

This is not necessary as wwwdude supports redirects! For historic reasons -> $ for i in *; do file $i; done | grep HTML | awk '{print $1}' | sed -e 's/://' | xargs rm -v