An updated version of impurge by hortinstein (https://github.com/hortinstein/impurge) This is a simple module meant to extract image URLs from imgur pages in a standard format despite differences in URL format.
Example:
var impurge = require('impurge-updated');
impurge.purge("http://imgur.com/IvpcP", function (e,r) {
console.log(r)
});
will result in:
http://i.imgur.com/IvpcP.jpg
or for multiple images
var impurge = require('impurge-updated');
impurge.purge("http://imgur.com/a/QgHRA", function (e,r) {
console.log(r)
});
will result in:
[ 'http://i.imgur.com/OzAiFJ5.jpg',
'http://i.imgur.com/AS76Rhx.jpg',
'http://i.imgur.com/gx436Y2.jpg' ]
It currently supports:
- album urls: http://imgur.com/a/{HASH}
- gallery urls: http://imgur.com/gallery/{HASH}
- hash urls http://imgur.com/{HASH}
- hash_url (multiple) http://imgur.com/{HASH},{HASH},{HASH},
- image_url http://i.imgur.com/{hash}.jpg?1
The test file included will check the currency of the regular expressions against live reddit data to ensure imgur is not changing link formats and will look for any it does not recognize.
A few additional methods that are used for testing the accuracy of the regex's:
impurge.is_imgur(url); //will output true if the url is imgur
The following takes a string input and will return an array of imgur links contained within a text input:
impurge.get_text_imgur_links(text); // will return an array of links
The following is a function to determine the link type (previously used internally)
impurge.determine_link_type(url, function(err, type, id, i_url){
//err is given if link is not recognized
//type is image_url, album_url, gallery_url, hash_url
//id is the id for the link (if applicable)
//i_url is the image url if this is a direct image
});