/mhtml2html

Convert MHTM file to single html embedded all resources with base64 encoding

Primary LanguageGoGNU General Public License v3.0GPL-3.0

mhtml2html

Convert MHTM file to a single HTML embedded all resources with base64 encoding, so that your Spotlight or other desktop search engine can index them by default.

Update

  • 2019-01-07: add script "convertConfluencePages.sh" to batch convert and clean MHT files.

    Note: it only work on where there is gnu sed command.

  • 2019-01-03: support minifiying HTML/CSS/scripts.

  • 2019-01-03: support removing elements or attributes.

  • 2019-01-01: embed all resources in a single HTML file, including scripts, style sheets and fonts.

  • 2018-12-31: clone fromUnMHT at gitlab .

Install

go get -u github.com/dingqiangliu/mhtml2html

Usage

# mhtml2html -h
mhtl2html [options] MHTMLFILE
  -b	optional: browsing result(default: false, ouput to stdout)
  -m	optional: need minify output(default: false)
  -ra value
    	repeatablely optional: pairs of jquery like elements selector and attribute to be removed
  -re value
    	repeatablely optional: jquery like elements selector to be removed

# convert MHTL file to a single HTML file.
mhtml2html previously-saved.mht > singlefile.html

# open MHTL with default browser, no matter it support MHTL format or not.
mhtml2html -b previously-saved.mht