/baghand

A simple command-line tool for converting between zip format and tarballs of gzipped files, accomplished without any compressing or decompressing.

Primary LanguageCMIT LicenseMIT

baghand

A simple command-line tool for converting between zip format and tarballs of gzipped files, accomplished without any compressing or decompressing.

=====

naming

Baghand is stupidly named, this is intentional. The name comes from the fact that "zip" is (in my opinion) a terrible name. Zip and compression have become almost synonymous, but zip was named after the "zipper", which is a device for keeping a opening closed, it does not perform any compression, and if it is used for compression, is likely burst, making it a poor choice for such a task, and a silly name. If you think about how one might pack luggage for a long trip, the contents of the bag are compressed to fit, but the zipper does perform the compression, instead you would use your hand to compress the items into the bag before using the zipper to close the bag. Baghand is named after the tool that actually does the compression, instead of the closure mechanism. That being said, however, baghand does not actually do compression or decompression of files. Instead it changes the packaging around the compressed data from zip to tar. Thus rendering the name and analogy incorrect, however, the name baghand was chosen primarily because it serves to poke fun at the naming of zip (and derivatives). However, it does not serve this purpose well either. A solid argument could be made in favor of zip (but not gzip), since the actual compression algorithm is actually called "deflate", and the zip file is arguably just a container for deflated data. If one were to accept this argument, zip actually becomes an appropriate name for the format. That being said, I like reject all arguments. Baghand is the proper name for this tool, and zip is a stupid name.

justification

Baghand might seem somewhat pointless, since there exist already programs in common use to extract zip files on all operating systems, which is often what is desired. However, users that prefer command line interfaces often find zip files are less convenient to work with than other formats (such as gzipped tarballs). Baghand can be used to work with zip files without actually extracting the zip file, and instead simply converting it into a more favorable format. That being said, the files created by baghand from zip files will likely inherit many of the unfavorable elements of the zip file. Much of the benefit of tarballs over zip files comes from the internal directory structure of the files inside it instead of the format itself, and baghand will not solve this problem as it uses the directory structure from the zip instead. Additionally, baghand does not (currently) create gzipped tarballs (which are the familiar format to many command line users), instead it creates tarballs of gzipped files. Tarballs of gzipped files may be extracted with the same tools as gzipped tarballs (just in a different order), but some of the benefits of gzipped tarballs is lost. For instance, a tarball of multiple files could not be extracted by creating a pipeline of gzip and tar. Additionally, tarballs of gzipped files do not get the benefit of compression accross file boundaries. As such, baghand's purpose is mainly academic. It basically exists because it can exist. However, there is always a chance that this tool will have a real practical use to some users.

future

In the future, baghand might be able to convert from gz.tar to zip as well. However, the most interesting potential future feature would be converting a zip file into a complete gzipped tarball. I believe that there is a good chance this is possible (depending entirely on existing tar implementations). This would work by simply marking all generated headers as "stored" (a non-compressed type of block in a deflate stream), appending the file data to each header, and then joining all files together as separate gzip files concatenated together. Any tar implementation that calls the external program gzip to decompress should theoretically be able to decompressgzipped tarballs produced in this way, but there could be an issue if implementations exist that reimplement the gzip program's functionality and do not support concatenated gz files.