Sort of a how-to on converting PDF files to HTML. The original PDF files can be found at http://www.iraqinquiry.org.uk
- Download all required PDF files
- Place PDF files in single folder (unless you want to work in batches)
- open a terminal and navigate to the folder (I use
~/Documents/publications/{$PUBNAME}
)
mkdir -p ./odt && unoconv -f odt--outputpath ./odt/ *.pdf
mkdir -p ./html && libreoffice --headless --convert-to html --outdir ./html ./odt/*.odt
Use Issues and pull-requests