Issues
- 0
"Error: source file could not be loaded"
#160 opened by Nakilon - 47
PDFtk dependency issues with CentOS-7/RHEL-7 | Build Fails | Dependencies libgc Unavailable
#123 opened by riker1 - 3
Break PDFs into chunks bigger than 1 page?
#128 opened by AbeHandler - 2
Docsplit.extract_images(path) => bin/rails: No such file or directory - file
#153 opened by crusadergo - 0
ruby 3.2 compatibility
#158 opened by jwoodrow - 3
- 0
Docsplit::ExtractionFailed: gm convert: Unable to open file (/tmp/docsplit/58371.pdf) [No such file or directory]
#156 opened by thanhtoan1196 - 0
- 1
diskspace leak when extracting text from pdf
#151 opened by KHMtravel - 1
- 0
- 0
Different behavior on mac and linux
#145 opened by jbmyid - 0
Email address contains more than three special chars(punctuation) is removed by Docsplit.clean_text method
#144 opened by mraj-rpx - 0
Docsplit.extract_text auto orientation detection 'detect_orientation: true' param does not work.
#143 opened by michaeltranlong - 8
"undefined method `strip' for nil:NilClass" occurs when attempting "Docsplit.extract_pdf"
#130 opened by mrmanishs - 2
Docsplit::TextExtractor#extract_text should return the path of the output text file?
#139 opened by nruth - 0
- 0
Downsampling has gotten worse in the last year
#140 opened by reefdog - 2
- 2
Extract Link (URL, Goto, etc)
#127 opened by dglunz - 1
Executable filename issue with latest version (5.0.4) of LibreOffice on RHEL
#137 opened by neilneyman - 1
rails invalid byte sequence in UTF-8
#135 opened by fjcaro - 0
Horizontal / table formatted text
#136 opened by nofxx - 1
encoding issue
#133 opened by dfang - 0
- 1
Deploy to heroku
#77 opened by josal - 12
- 0
Converting to .doc and .xls files in Plone does not work with latest Libreoffice
#125 opened by gregory-zero - 4
- 13
- 2
Corrupted pdf file from Chinese docx
#122 opened by intellisense - 5
- 1
- 1
- 1
undefined method `strip' for nil:NilClass
#107 opened by singhkishan - 6
German umlauts are replaced by ? after OCR
#116 opened by tbk303 - 3
*** glibc detected *** gm: realloc(): invalid next size: 0x00007f4b7e88e0c0 ***
#113 opened by lordfinal - 1
- 2
Extracting images from PDF hogs 100% CPU
#96 opened by tvsignal - 4
"Invalid byte sequence error" on master.
#106 opened by KurtPreston - 0
libreoffice path in FreeBSD
#109 opened by danniculescu - 2
- 0
- 2
Rubygems release no longer working with recent openoffice versions on Debian/Ubuntu
#99 opened by augustf - 2
Issues with Powerpoint OLE Objects
#94 opened by omsoft - 5
- 0
Extract images on win 7 platform error
#89 opened by eastxing - 0
Can't covert nil into string in ensure_pdfs on server, but works fine locally
#87 opened by chintanparikh - 3
- 2