uploadfs
uploadfs copies files to a web-accessible location and provides a consistent way to get the URLs that correspond to those files. uploadfs can also resize, crop and autorotate uploaded images. uploadfs includes both S3-based and local filesystem-based backends and you may supply others. The API offers the same conveniences with both backends, avoiding the most frustrating features of each:
- Parent directories are created automatically as needed (like S3)
- Content types are inferred from file extensions (like the filesystem)
- Files are by default marked as readable via the web (like a filesystem + web server)
- Images can be automatically scaled to multiple sizes
- Images can be cropped
- Images are automatically rotated if necessary for proper display on the web (i.e. iPhone photos with rotation hints are right side up)
- Image width, image height and correct file extension are made available to the developer
- Non-image files are also supported
- Web access to files can be disabled and reenabled
- Animated GIFs are preserved, with full support for scaling and cropping (if you have
imagemagick
orimagecrunch
)
You can also remove a file if needed.
It is possible to copy a file back from uploadfs, but there is no API to retrieve information about files in uploadfs. This is intentional. Constantly manipulating directory information is much slower in the cloud than on a local filesystem and you should not become reliant on it. Your code should maintain its own database of file information if needed, for instance in a MongoDB collection. Copying the actual contents of the file back may occasionally be needed however and this is supported.
Requirements
You need:
-
A "normal" filesystem in which files stay put forever, OR Amazon S3, OR a willingness to write a backend for something else (look at
s3.js
andlocal.js
for examples; just supply an object with the same methods, you don't have to supply a factory function). -
Patience, to wait for Jimp to convert your images; or Imagemagick, if you want much better speed and full animated GIF support; OR, on Macs, the very fast imagecrunch utility. You can also write a backend for something else (look at
imagemagick.js
,imagecrunch.js
, andjimp.js
for examples; just supply an object with the same methods, you don't have to supply a factory function). -
gifsicle is an optional tool that processes large animated GIFs much faster. Currently, Imagemagick is a prerequisite for using it. Turn it on with the
gifsicle: true
option when callinginit
. Of course you must installgifsicle
to use it. (Hint: your operating system probably has a package for it. Don't compile things.) -
A local filesystem in which files stay put at least during the current request, to hold temporary files for Imagemagick's conversions. This is no problem with Heroku and most other cloud servers. It's just long-term storage that needs to be in S3 for some of them.
Note that Heroku includes Imagemagick. You can also install it with apt-get install imagemagick
on Ubuntu servers. The official Imagemagick binaries for the Mac are a bit busted as of this writing, but macports or homebrew can install it. Or, you can use imagecrunch, a fast, tiny utility that uses native MacOS APIs.
API Overview
Here's the entire API:
-
The
init
method passes options to the backend and invokes a callback when the backend is ready. -
The
copyIn
method takes a local filename and copies it to a path in uploadfs. (Note that Express conveniently sets us up for this by dropping file uploads in a temporary local file for the duration of the request.) -
The
copyImageIn
method works likecopyIn
. In addition, it also copies in scaled versions of the image, corresponding to the sizes you specify when callinginit()
. Information about the image is returned in the second argument to the callback. -
If you wish to crop the image, pass an options object as the third parameter to
copyImageIn
. Set thecrop
property to an object withtop
,left
,width
andheight
properties, all specified in pixels. These coordinates are relative to the original image. When you specify thecrop
property, both the "full size" image copied into uploadfs and any scaled images are cropped. The uncropped original is NOT copied into uploadfs. If you want the uncropped original, be sure to copy it in separately. Thewidth
andheight
properties of theinfo
object passed to your callback will be the cropped dimensions. -
The default JPEG quality setting for scaled-down versions of your image is
80
. This avoids unacceptably large file sizes for web deployment. You can adjust this via thescaledJpegQuality
option, either when initializing uploadfs or when callingcopyImageIn
. -
The
copyOut
method takes a path in uploadfs and a local filename and copies the file back from uploadfs to the local filesystem. This should be used only rarely. Heavy reliance on this method sets you up for poor performance in S3. However it may be necessary at times, for instance when you want to crop an image differently later. Heavy reliance on copyOut is a recipe for bad S3 performance. Use it only for occasional operations like cropping. -
The
remove
method removes a file from uploadfs. -
The
getUrl
method returns the URL to which you should append uploadfs paths to fetch them with a web browser. -
The
disable
method shuts off web access to a file. Depending on the storage backend it may also block thecopyOut
method, so you should be sure to callenable
before attempting any further access to the file. -
The
enable
method restores web access to a file. -
The
getImageSize
method returns the currently configured image sizes. -
The
identifyLocalImage
method provides direct access to theuploadfs
functionality for determining the extension, width, height and orientation of images. NormallycopyIn
does everything you need in one step, but this method is occasionally useful for migration purposes.
Working Example
For a complete, very simple and short working example in which a user uploads a profile photo, see sample.js
.
Here's the interesting bit. Note that we do not supply an extension for the final image file, because we want to let Imagemagick figure that out for us.
app.post('/', function(req, res) {
uploadfs.copyImageIn(req.files.photo.path, '/profiles/me', function(e, info) {
if (e) {
res.send('An error occurred: ' + e);
} else {
res.send('<h1>All is well. Here is the image in three sizes plus the original.</h1>' +
'<div><img src="' + uploadfs.getUrl() + info.basePath + '.small.' + info.extension + '" /></div>' +
'<div><img src="' + uploadfs.getUrl() + info.basePath + '.medium.' + info.extension + '" /></div>' +
'<div><img src="' + uploadfs.getUrl() + info.basePath + '.large.' + info.extension + '" /></div>' +
'<div><img src="' + uploadfs.getUrl() + info.basePath + '.' + info.extension + '" /></div>');
}
});
});
Note the use of uploadfs.getUrl()
to determine the URL of the uploaded image. Use this method consistently and your code will find the file in the right place regardless of the backend chosen.
Retrieving Information About Images
When you successfully copy an image into uploadfs with copyImageIn, the second argument to your callback has the following useful properties:
width
(already rotated for the web if necessary, as with iPhone photos)
height
(already rotated for the web if necessary, as with iPhone photos)
originalWidth
(not rotated)
originalHeight
(not rotated)
extension
(gif
,jpg
or png
)
You should record these properties in your own database if you need access to them later.
When cropping, the uncropped size of the original image is not returned by uploadfs. It is assumed that if you are cropping you already know what the original dimensions were.
The same information is available via identifyLocalImage
if you want to examine a local file before handing it off to copyImageIn
.
Removing Files
Here's how to remove a file:
uploadfs.remove('/profiles/me.jpg', function(e) { ... });
Disabling Access To Files
This call shuts off web access to a file:
uploadfs.disable('/profiles/me.jpg', function(e) { ... });
And this call restores it:
uploadfs.enable('/profiles/me.jpg', function(e) { ... });
Depending on the backend, disable
may also block the copyOut method, so be sure to call enable
before attempting any further access to the file. (Unfortunately S3 does not offer an ACL that acts exactly like chmod 000
, thus this slight inconsistency.)
Configuration Options
Here are the options we pass to init()
in sample.js
. Note that we define the image sizes we want the copyImageIn
function to produce. No image will be wider or taller than the limits specified. The aspect ratio is always maintained, so one axis will often be smaller than the limits specified. Here's a hint: specify the width you really want, and the maximum height you can put up with. That way only obnoxiously tall images will get a smaller width, as a safeguard.
{
backend: 'local',
image: 'imagemagick',
// Optional. If not specified, ImageMagick or imagecrunch will be used.
// Options are 'imagemagick', 'imagecrunch', 'jimp', or a custom image processing backend
uploadsPath: __dirname + '/public/uploads',
uploadsUrl: 'http://localhost:3000' + uploadsLocalUrl,
// Required if you use copyImageIn
// Temporary files are made here and later automatically removed
tempPath: __dirname + '/temp',
imageSizes: [
{
name: 'small',
width: 320,
height: 320
},
{
name: 'medium',
width: 640,
height: 640
},
{
name: 'large',
width: 1140,
height: 1140
}
],
// Render up to 4 image sizes at once. Note this means 4 at once per call
// to copyImageIn. There is currently no built-in throttling of multiple calls to
// copyImageIn
parallel: 4
}
Here is an equivalent configuration for S3:
{
backend: 's3',
// Get your credentials at aws.amazon.com
secret: 'xxx',
key: 'xxx',
// You need to create your bucket first before using it here
// Go to aws.amazon.com
bucket: 'getyourownbucketplease',
// For read-after-write consistency in the US East region.
// You could also use any other region name except us-standard
region: 'external-1',
// Required if you use copyImageIn
tempPath: __dirname + '/temp',
imageSizes: [
{
name: 'small',
width: 320,
height: 320
},
{
name: 'medium',
width: 640,
height: 640
},
{
name: 'large',
width: 1140,
height: 1140
}
],
// Render up to 4 image sizes at once. Note this means 4 at once per call
// to copyImageIn. There is currently no built-in throttling of multiple calls to
// copyImageIn
parallel: 4
}
"Why don't you put the temporary files for imagemagick in S3?"
Two good reasons:
-
Imagemagick doesn't know how to write directly to S3.
-
Constantly copying things to and from S3 is very slow compared to working with local temporary files. S3 is only fast when it comes to delivering your finished files to end users. Resist the temptation to use it for many little reads and writes.
Less Frequently Used Options
-
In backends like imagemagick that support it, even the "original" is rotated for you if it is not oriented "top left," as with some iPhone photos. This is necessary for the original to be of any use on the web. But it does modify the original. So if you really don't want this, you can set the
orientOriginals
option tofalse
. -
It is possible to pass your own custom storage module instead of
local
ors3
. Followlocal.js
ors3.js
as a model, and specify your backend like this:storage: require('mystorage.js')
-
You may specify an alternate image processing backend via the
image
option. Two backends,imagemagick
andimagecrunch
, are built in. imagecrunch is a Mac-specific optional tulity that is much faster thanimagemagick
. You may also supply an object instead of a string to use your own image processor. Just follow the existingimagecrunch.js
andimagemagick.js
files as a model.
Extra features for S3: caching and CDNs
By default, when users fetch files from S3 via the web, the browser is instructed to cache them for 24 hours. This is reasonable, but you can change that cache lifetime by specifying the cachingTime
option, in seconds:
// 60*60*24*7 = 1 Week
// Images are delivered with cache-control-header
cachingTime: 604800
Also, if you are using a CDN such as cloudfront that automatically mirrors the contents of your S3 bucket, you can specify that CDN so that the getUrl
method of uploadfs
returns the CDN's URL rather than a direct URL to Amazon S3:
cdn: {
enabled: true,
url: 'http://myAwesomeCDN'
}
Note that specifying a CDN in this way does not in any way activate that CDN for you. It just tells uploadfs
to return a different result from getUrl
. The rest is up to you. More CDN-related options may be added in the future.
Important Concerns With S3
Be aware that uploads to Amazon S3's us-standard region are not guaranteed to be readable the moment you finish uploading them. This is a big difference from how a regular filesystem behaves. One browser might see them right away while another does not. This is called "eventual consistency."
If you want your files served from the east coast of the US, set region
to external-1
instead. This causes uploadfs to use the s3-external-1
endpoint, for which Amazon guarantees "read-after-write consistency."
Currently us-standard
is the only region where this is an issue.
However, also be aware that no matter what region you choose, updates of an existing file or deletions of a file still won't always be instantly seen everywhere, even if you don't use the us-standard
region. To avoid this problem, include a version number or randomly generated ID in each filename.
In sample.js
we configure Express to actually serve the uploaded files when using the local backend. When using the s3 backend, you don't need to do this, because your files are served from S3. S3 URLs look like this:
http://yourbucketname.s3.amazonaws.com/your/path/to/something.jpg
But your code doesn't need to worry about that. If you use uploadfs.getUrl()
consistently, code written with one backend will migrate easily to the other.
It's up to you to create an Amazon S3 bucket and obtain your secret and key. See sample.js for details.
S3 support is based on the official AWS SDK.
About P'unk Avenue and Apostrophe
uploadfs
was created at P'unk Avenue for use in many projects built with Apostrophe, an open-source content management system built on node.js. Appy isn't mandatory for Apostrophe and vice versa, but they play very well together. If you like uploadfs
you should definitely check out apostrophenow.org. Also be sure to visit us on github.
Support
Feel free to open issues on github.
Changelog
CHANGES IN 1.5.0
- The s3 storage backend now uses the official AWS SDK for JavaScript. The knox module is no longer maintained and is missing basic request signature support that is mandatory for newer AWS regions. It is no longer a serious option.
Every effort has been made to deliver 100% backwards compatibility with the documented options of knox, and the full test suite is passing with the new AWS SDK.
CHANGES IN 1.4.0
- The new pure-JavaScript
jimp
image backend works "out of the box" even when ImageMagick is not installed. For faster operation and animated GIF support, you should still install ImageMagick. Thanks to Dave Ramirez for contributing this feature.
CHANGES IN 1.3.6
- Octal constants are forbidden in ES6 strict, use
parseInt(x, 8)
. No other changes.
CHANGES IN 1.3.5
- All tests passing.
- Rewrote automatic directory cleanup mechanism of local storage to cope correctly with more complex directory structures.
CHANGES IN 1.3.4
- Bumped dependencies to newer, better maintained versions. All tests passing.
- Removed accidental dependency on
global-tunnel-ng
and commented out a one-time test intest.js
.
CHANGES IN 1.3.3
- Dependency on
request
is no longer locked down to a minor version, which was unnecessary and caused peer dependency failures in some projects (an npm design flaw IMHO, but never mind)
CHANGES IN 1.3.2
- Updated dependency on
rimraf
module to eliminate deprecation warning forgraceful-fs
CHANGES IN 1.3.1
- Whoops, refer to original width and height properly for gifsicle
CHANGES IN 1.3.0
- The
imagemagick
image conversion backend now optionally usesgifsicle
to convert animated GIFs. Turn on this behavior with thegifsicle: true
option. There are tradeoffs:gifsicle
is much faster and uses much less RAM, but seems to produce slightly lower quality results. On a very large animation though, you're almost certain to run out of RAM withimagemagick
. Of course you must installgifsicle
to take advantage of this.
CHANGES IN 1.2.2
- The very short-lived version 1.2.1 did not retain the originals of GIFs (when desired). This has been fixed.
CHANGES IN 1.2.1
- Animated GIF conversion strategy has been customized once again. We found cases in which the combined pipeline was 4x slower (!) and also needed to add in
-coalesce
to prevent bad frames in some cases.
CHANGES IN 1.2.0
-
Added the
cachingTime
andcdn
options. Thanks to Vispercept. -
Fixed a bug where the local storage backend could invoke its callbacks twice, with both failure and success, when an error occurs reading from a local file in newer verisons of node (this bug did not appear in 0.10.x). The fix is backwards compatible.
CHANGES IN 1.1.10
Error message when imagemagick is not installed is a little more informative about what you must do.
CHANGES IN 1.1.9
Use latest knox. No functionality changes.
CHANGES IN 1.1.7-1.1.8
Supports multiple instances when using the default storage and image backends. Previously those backends only supported one instance. This was corrected without changing the public API for custom backends, which have always supported multiple instances.
CHANGES IN 1.1.5-1.1.6
GIF animations have been merged back into the main pipeline thanks to -clone 0--1
which preserves all frames of the animation. It's a little faster, and it's also less code to maintain.
CHANGES IN 1.1.4
GIF animations are preserved in the imagemagick backend, with full support for resizing and cropping. A separate, slower pipeline is used due to limitations of the +clone
mechanism in imagemagick. The API has not changed.
CHANGES IN 1.1.3
The imagecrunch backend now sets adjustedOriginal
correctly when it does a simple copy of the original of a PNG or JPEG.
CHANGES IN 1.1.0
The new disable
and enable
methods turn web access to the specified path off and on again, respectively. The new getImageSizes
method simply gives you access to the image sizes that are currently configured.
There are no changes elsewhere in the code.
CHANGES IN 1.0.0
None! Since the additions in version 0.3.14 we've had no real problems. We now support both alternate storage backends and alternate image rendering backends. Test coverage is thorough and everything's passing. What more could you want? It's time to declare it stable.
CHANGES IN 0.3.15
Decided that imagecrunch should output JSON, so that's now what the backend expects.
CHANGES IN 0.3.14
In addition to storage backends, you may also supply alternate image processing backends. The backend
option has been renamed to storage
, however backend
is accepted for backwards compatibility. The image
option has been introduced for specifying an image processing backend. In addition to the existing imagemagick
backend, there is now an imagecrunch
backend based on the Mac-specific imagecrunch utility.
If you do not specify an image
backend, uploadfs will look for imagecrunch and imagemagick in your PATH, stopping as soon as it finds either the imagecrunch
command or the identify
command.
CHANGES IN 0.3.13
copyImageIn
has been rewritten to run more than 4x faster! We now generate our own imagemagick convert
pipeline which takes advantage of two big optimizations:
- Load, orient and crop the original image only once, then output it at several sizes in the same pipeline. This yields a 2x speedup.
- First scale the image to the largest size desired, then scale to smaller sizes based on that as part of the same pipeline, without creating any lossy intermediate files. This yields another 2x speedup and a helvetica of designers were unable to see any difference in quality. ("Helvetica" is the collective noun for a group of designers.)
The new parallel
option allows you to specify the maximum number of image sizes to render simultaneously. This defaults to 1, to avoid using a lot of memory and CPU, but if you are under the gun to render a lot of images in a hurry, you can set this as high as the number of image sizes you have. Currently there is no throttling mechanism for multiple unrelated calls to uploadfs.copyImageIn
, this option relates to the rendering of the various sizes for a single call.
CHANGES IN 0.3.11
The new parallel
option allows you to specify the maximum number of image sizes to render simultaneously. This defaults to 1, to avoid using a lot of memory and CPU, but if you are under the gun to render a lot of images in a hurry, you can set this as high as the number of image sizes you have. Currently there is no throttling mechanism for multiple unrelated calls to uploadfs.copyImageIn
, this option relates to the rendering of the various sizes for a single call.
CHANGES IN 0.3.7-0.3.10
Just packaging and documentation. Now a P'unk Avenue project.
CHANGES IN 0.3.6
The uploadfs
functionality for identifying a local image file via ImageMagick has been refactored and made available as the identifyLocalImage
method. This method is primarily used internally but is occasionally helpful in migration situations (e.g. "I forgot to save the metadata for any of my images before").
CHANGES IN 0.3.5
Starting in version 0.3.5, you can set the quality level for scaled JPEGs via the scaledJpegQuality option, which defaults to 80. You can pass this option either when initializing uploadfs
or on individual calls to copyImageIn
. This option applies only to scaled versions of the image. If uploadfs modifies the "original" image to scale or orient it, Imagemagick's default behavior stays in effect, which is to attempt to maintain the same quality level as the original file. That makes sense for images that will be the basis for further cropping and scaling but results in impractically large files for web deployment of scaled images. Thus the new option and the new default behavior.
CHANGES IN 0.3.4
Starting in version 0.3.4, the getTempPath() method is available. This returns the same tempPath
that was supplied to uploadfs at initialization time. Note that at this point the folder is guaranteed to exist. This is useful when you need a good place to copyOut
something to, for instance in preparation to copyImageIn
once more to carry out a cropping operation.
CHANGES IN 0.3.3
Starting in version 0.3.3, cropping is available. Pass an options object as the third parameter to copyImageIn
. Set the crop
property to an object with top
, left
, width
and height
properties, all specified in pixels. These coordinates are relative to the original image. When you specify the crop
property, both the "full size" image copied into uploadfs and any scaled images are cropped. The uncropped original is NOT copied into uploadfs. If you want the uncropped original, be sure to copy it in separately. The width
and height
properties of the info
object passed to your callback will be the cropped dimensions.
Also starting in version 0.3.3, uploadfs
uses the gm
module rather than the node-imagemagick
module for image manipulation, but configures gm
to use imagemagick. This change was made because node-imagemagick
has been abandoned and gm
is being actively maintained. This change has not affected the uploadfs
API in any way. Isn't separation of concerns wonderful?
CHANGES IN 0.3.2
Starting in version 0.3.2, you can copy files back out of uploadfs with copyOut
. You should not rely heavily on this method, but it is occasionally unavoidable, for instance if you need to crop an image differently. When possible, cache files locally if you may need them locally soon.
CHANGES IN 0.3.0
Starting in version 0.3.0, you must explicitly create an instance of uploadfs. This allows you to have more than one, separately configured instance, and it also avoids serious issues with modules not seeing the same instance automatically as they might expect. For more information see Singletons in #node.js modules cannot be trusted, or why you can't just do var foo = require('baz').init().
Existing code that isn't concerned with sharing uploadfs between multiple modules will only need a two line change to be fully compatible:
// CHANGE THIS
var uploadfs = require('uploadfs');
// TO THIS (note the extra parens)
var uploadfs = require('uploadfs')();
If you use uploadfs in multiple source code files, you'll need to pass your uploadfs
object explicitly, much as you pass your Express app
object when you want to add routes to it via another file.