XSweet docx to html extraction and more

Including extraction of document contents from MS Office Open XML into HTML

Roadmap

Project	Description	In progress?	Done	Issue
XSweet	Inline and class formatting extraction		✔	XSweet/XSweet#13
XSweet	Capture hyperlinks from MS Word		✔	XSweet/XSweet#3
XSweet	Capture end- and footnotes as linked HTML		✔	XSweet/XSweet#2, XSweet/XSweet#22
Editoria Typescript	Preserve note linkages for Wax		✔	XSweet/editoria_typescript#8
XSweet	Recreate Word tables as HTML		✔	XSweet/XSweet#66
Editoria Typescript	Handle tables in Wax
XSweet	Basic list HTML representation		✔	XSweet/XSweet#106
XSweet	Capture list type (unordered, numbered, etc.)
HTMLevator	Heading inferrer		✔	XSweet/HTMLevator/#13
HTMLevator	Heading inferencer Word style improvements			XSweet/HTMLevator/#14
HTMLevator	Plain-text output		✔	XSweet/HTMLevator/#12
HTMLevator	Section inferrer		✔
HTMLevator	Copyediting cleanups and mappings		✔	XSweet/editoria_typescript/issues#21
HTMLevator	Support customized transformations		✔
XSweet	Extract images to HTML; store image files	no but priority		XSweet/XSweet#110
Editoria Typescript	Convert image references for porting to Wax	no but priority
XSweet	Capture Math (possibly multipe formats)	no but priority
XSweet	Support auto-generated fields			XSweet/XSweet#98
XSweet	Support for language features