Issues
- 0
CVE-2021-33623
#240 opened by david-nikolai-mueller - 0
CVE-2022-39353
#239 opened by david-nikolai-mueller - 0
- 0
fromUrl doesn't work when passing urls to PDF files?
#237 opened by davidawad - 1
CVE-2021-3803
#228 opened by prafullkulkarni - 0
how to get the total page number of the file
#236 opened by venkatesh-pro - 0
unrtf not throwing error
#235 opened by divyeshrajpura4114 - 0
Access (doc | docx) (20 MB) have no reaction
#232 opened by dengzhenhai - 0
OCR for PDFs
#231 opened by boazl-cyera - 0
Get picture from doc?
#229 opened by bigbird231 - 5
CVE-2021-23362
#217 opened by prafullkulkarni - 2
CVE-2021-33623
#220 opened by prafullkulkarni - 5
CVE-2021-21366
#215 opened by OlivierB-OB - 0
- 0
CVE-2021-23413
#226 opened by prafullkulkarni - 0
CVE-2021-33587
#225 opened by prafullkulkarni - 0
- 0
Abandoned project - viable forks or alternatives
#221 opened by nosferatu500 - 1
CVE-2021-23362
#216 opened by OlivierB-OB - 1
'pdftotext' does not appear to be installed
#213 opened by codingalien-d - 3
- 0
Method to check if mime type is supported
#212 opened by ari62 - 0
The docx extractor missed all the Emojis
#211 opened by andyli - 4
bug: update the j library dependency
#186 opened by qinst64 - 0
Support reading docx files in flat opc format
#207 opened by jessrosenfield - 5
image inside pdf
#183 opened by deepdil-sp - 1
Security: update package use of marked library
#202 opened by camsjams - 0
Support srt (application/x-subrip) files
#201 opened by altwohill - 0
- 1
Memory maxed out for a 70page document
#187 opened by tiholic - 0
- 1
Error: Incorrect parameters passed to textract.
#198 opened by sunnysharma03 - 2
Please update marked
#194 opened by ram-you - 0
- 1
Header and footer missing in .odt
#192 opened by fsandx - 4
- 0
not able to change language
#191 opened by swamyaddala - 0
Textract Returns Null Value
#185 opened by Jodyadriene - 1
get metainfo - count of pages
#170 opened by raulromanp - 0
- 6
AWS S3 bucket file gives does not exist erro
#178 opened by rmr-code - 0
Extract Hyperlink from images
#182 opened by deepdil-sp - 0
No PPT support
#181 opened by carlosvini - 1
pliz update the npm
#180 opened by apporoad - 0
How to use Regex with the text extracted ?
#177 opened by alexauvray - 1
Problems with garbled characters in docx files
#176 opened by uptown - 0
Equations in docx, pdf extraction
#174 opened by zscrca - 0
Temporary files are not cleaned up
#171 opened by edelache - 0
Support for msg files
#169 opened by roydiasbytes - 1
Error trying to read larger files
#168 opened by lic001rabby