/python-ooxml

Python library for parsing .docx (Office Open XML) files

Primary LanguagePythonGNU Affero General Public License v3.0AGPL-3.0

About Python-OOXML

Python-OOXML is a Python library for parsing Office Open XML (Microsoft Word .docx) files. At the moment it only supports HTML as the output format. Strong emphasis is put on easy customization of the output.

The library comes with an importer which is capable of splitting a document into separate chapters. It works both with documents which use Word styles, and documents where they are not used.

Python-OOXML is used in Booktype to import and convert Word documents.

Documentation

Developer documentation for Python-OOXML can be found at Read the Docs.

License

Python-OOXML is licensed under the AGPL license.

Authors

Python-OOXML was written by Aleksandar Erkalovic <aerkalov@gmail.com>. Please see the AUTHORS file for a full list of contributors.