/tumblr-scraper

Scrape a Tumblr blog for personal posts

Primary LanguageJavaScript

Tumblr Personal Post Scraper

Scrape user uploaded content from a Tumblr blog. Note that Tumblr provides no way to do this naturally and alternative means are notoriously slow. Currently only supports domestic Tumblr blogs, custom domains cannot be scraped.

Utilizes: Electron Framework, React, Browserify & Babelify, .

example scrape for images on https://support.tumblr.com: https://gyazo.com/9eb0825ddca040f8467838ca519029e9

Contribution

git clone https://github.com/lluisrojass/Tumblr-Blog-Scraper.git
cd Tumblr-Post-Scraper
npm install 

the npm install command downloads several Babel presets & plugins utilized by the renderer process. Further Reading on them: Class Properties Transform, ES2015 Preset, React Preset.

When developing, run npm run start to run and npm run watch to execute a watchify script to monitor and update changes. Also change devmode to true in config.json to enable electron-reload and chrome devtools.

To-do list:

  • Option to export post links to a text file.
  • Proxy Support.
  • Gzip Response.