XSweet docx to html extraction and more
Including extraction of document contents from MS Office Open XML into HTML
Roadmap
Project | Description | In progress? | Done | Issue |
---|---|---|---|---|
XSweet | Inline and class formatting extraction | ✔ | XSweet/XSweet#13 | |
XSweet | Capture hyperlinks from MS Word | ✔ | XSweet/XSweet#3 | |
XSweet | Capture end- and footnotes as linked HTML | ✔ | XSweet/XSweet#2, XSweet/XSweet#22 | |
Editoria Typescript | Preserve note linkages for Wax | ✔ | XSweet/editoria_typescript#8 | |
XSweet | Recreate Word tables as HTML | ✔ | XSweet/XSweet#66 | |
Editoria Typescript | Handle tables in Wax | |||
XSweet | Basic list HTML representation | ✔ | XSweet/XSweet#106 | |
XSweet | Capture list type (unordered, numbered, etc.) | |||
HTMLevator | Heading inferrer | ✔ | XSweet/HTMLevator/#13 | |
HTMLevator | Heading inferencer Word style improvements | XSweet/HTMLevator/#14 | ||
HTMLevator | Plain-text output | ✔ | XSweet/HTMLevator/#12 | |
HTMLevator | Section inferrer | ✔ | ||
HTMLevator | Copyediting cleanups and mappings | ✔ | XSweet/editoria_typescript/issues#21 | |
HTMLevator | Support customized transformations | ✔ | ||
XSweet | Extract images to HTML; store image files | no but priority | XSweet/XSweet#110 | |
Editoria Typescript | Convert image references for porting to Wax | no but priority | ||
XSweet | Capture Math (possibly multipe formats) | no but priority | ||
XSweet | Support auto-generated fields | XSweet/XSweet#98 | ||
XSweet | Support for language features |
For full XSweet documentation, visit http://xsweet.coko.foundation/xsweet-core.
Check out the other XSweet tools at https://gitlab.coko.foundation/XSweet.