dbashford/textract

PPTX support?

AlexanderTserkovniy opened this issue · 2 comments

Does it work? Because for me it does not.

$ textract 'test.pptx'
textract not ready, retrying in .5 seconds
textract: 'drawingtotext' does not appear to be installed, so it will be unable
to extract DXFs.
textract: 'catdoc' does not appear to be installed, so it will be unable to extr
act DOCs.
[Error: extract powerpoint, pptx, exec error: Error: stdout maxBuffer exceeded.]

textract currently does not support providing configuration via the command line. The command line isn't meant to be fully featured.

https://github.com/dbashford/textract#configuration

You need to up the maxBuffer. You'll need to write a little code to get this done.

Beefing up the command line...