ankushshah89/python-docx2txt
A pure python based utility to extract text and images from docx files.
PythonMIT
Issues
- 1
docx2text - unwrapping zip - fails and crashes
#48 opened by ventz - 1
Is there the possibility to pass the entire file instead of the file name in the process() function?
#33 opened by Zast996 - 0
Python does not recognize the italic font in Docx
#46 opened by me-suzy - 3
Page numbers for each page
#44 opened by Higgs32584 - 0
- 1
- 0
Add license classifier to package metadata
#43 opened by C-nit - 0
how to maintain the format of the File
#42 opened by shashankmuralidhar - 0
The result contains extra newline
#39 opened by DanteAndroid - 1
Exception when using docx2txt
#38 opened by DanteAndroid - 1
difficulty with opening file- updated
#37 opened by mmiesner - 2
Reading .doc file format
#29 opened by bpkapkar - 0
- 0
strikethrough strings are not removed as of now
#35 opened by sdeepmars - 10
It does not convert numbered items
#12 opened by robo3945 - 1
- 1
Can I print all contents of a doc file including images as well as text with images in its original position?
#31 opened by Aniket573 - 5
docx created with word online
#16 opened by burbma - 1
BadZipFile: File is not a zip file (while iterating through directory of docx files)
#30 opened by youssefavx - 4
Save list numeration
#24 opened by goshulina - 0
- 2
Image Paths in generated Text
#21 opened by rushikesh988 - 7
Extract footnote?
#27 opened by vivlio-kumihan - 1
- 6
py3 support
#11 opened by deanmalmgren - 0
感谢!Thanks!
#23 opened by lcl1995225 - 0
- 3
extract Images?
#13 opened by lxj0276 - 1
- 4
Don't work how I expected
#17 opened by juniordiasjfd - 1
- 3
How to extract hyperlinks?
#9 opened by badbye - 3
excuse me,it can't run from python
#7 opened by Mengqi777 - 6
text = docx2txt.process("file.docx", "/tmp/img_dir") Function not working
#6 opened by GarrettHartley - 5
Error on print
#5 opened by MannyGrewal - 3
Is this a working application?
#3 opened by GarrettHartley - 3
Directory is not empty
#1 opened by superlou