Pinned issues
Issues
- 3
Use latest six
#465 opened by I-Good-Vegetable - 0
Deprecation Issue
#510 opened by M0inUddin - 1
- 0
Suggestion: Add support for .pdf files
#505 opened by Hala-Hamdoun - 0
pstotext Preventing Packaging
#504 opened by Arszilla - 5
Transfer the project to jazzband?
#498 opened by tfeldmann - 1
error message whilest pip installing
#483 opened by EliteLeadsAI - 1
Non-Standard Dependency Specifier with pip 24.0
#487 opened by mauricefreese - 2
- 0
Requesting compatibility for red hat linux
#486 opened by Tylersuard - 5
Is textract still maintained?
#470 opened by KamarajuKusumanchi - 2
textract 1.6.5 has a non-standard dependency specifier extract-msg<=0.29.*
#476 opened by chapmanjacobd - 0
Support for .one (OneNote) files
#475 opened by jw25116 - 2
- 2
Replace Antiword with a Python alternative
#468 opened by SMillerDev - 0
progress bar for long documents
#467 opened by chanansh - 2
OS (WINDOWS) SUPPORT
#459 opened by knana1662 - 1
textract3-1.6.4.post1 and textract-1.6.5 compilation error: error in beautifulsoup4 setup command: use_2to3 is invalid.
#464 opened by ashish-2022 - 6
Python2 deprecation notice
#390 opened by traverseda - 0
mp3 text extraction Exception - 5MB~ file
#460 opened by RiccardoRomagnoli - 1
- 0
Use of `antiword`
#454 opened by p-linnane - 0
- 0
Issues with textract.process while run within and executable created by pyinstaller
#449 opened by vq75 - 0
- 0
textract.exceptions.ShellError: The command antiword is not installed on your system. Please make sure the appropriate dependencies are installed before using textract
#444 opened by faridelya - 0
Support of Open Office Extesions
#441 opened by dezoito - 0
unsafe for multiprocessing?
#440 opened by chapmanjacobd - 0
Paddle ocr give multi language ?
#438 opened by vinothkanagaraj - 1
MacOS installation is outdated
#437 opened by roablep - 1
Unable to Install on Airflow
#435 opened by raj5287 - 1
Pdfminer on Windows searches for pdf2text.py.exe
#424 opened by PeterTillema - 1
Truncated File error
#373 opened by libgober - 0
Beautifulsoup version
#427 opened by supermanIT - 0
- 0
- 4
Dependencies update
#418 opened by VBobCat - 0
- 5
Transient AGPL Dependency `EbookLib`
#409 opened by thehale - 0
Environment Variable is set but still it can't read from the tessdata directory
#404 opened by raza8899 - 0
building fails on windows. Cannot build lxml wheel.
#401 opened by HourGlss - 1
ppt support
#398 opened by idoabelman - 1
Error extracting text from pdf
#396 opened by sirwentemi - 9
Get a new maintainer
#386 opened by traverseda - 2
PDF decoder encoding issue. Found temporary fix.
#383 opened by Vbansal21 - 3
Travis CI is broken
#387 opened by traverseda - 1
parse space different show between linux and mac
#388 opened by shzy2012 - 0
- 0
- 0
Text cut every 80 characters in .doc files
#367 opened by vesran