chezou/tabula-py

Exception During read_pdf: SEVERE: Cannot read JPEG2000 image: Java Advanced Imaging (JAI) Image I/O Tools are not installed

drjubbs opened this issue · 1 comments

I'm getting the following exception during tabula.read_pdf():
SEVERE: Cannot read JPEG2000 image: Java Advanced Imaging (JAI) Image I/O Tools are not installed

Running on Windows 10
java version "18.0.2" 2022-07-19

Link to the PDF causing trouble: https://api.environdec.com/api/v1/EPDLibrary/Files/6410b3fe-07f0-4766-b6b0-08da4d16d787/Data

tabula.environment_info()
Python version:
3.10.4 | packaged by conda-forge | (main, Mar 30 2022, 08:38:02) [MSC v.1916 64 bit (AMD64)]
Java version:
java version "18.0.2" 2022-07-19
Java(TM) SE Runtime Environment (build 18.0.2+9-61)
Java HotSpot(TM) 64-Bit Server VM (build 18.0.2+9-61, mixed mode, sharing)
tabula-py version: 2.4.0
platform: Windows-10-10.0.19044-SP0
uname:
uname_result(system='Windows', node='windows-knime', release='10', version='10.0.19044', machine='AMD64')
linux_distribution: ('', '', '')
mac_ver: ('', ('', '', ''), '')

tabula-py is working fine on a sample PDF lacking images (just a simple table).

@drjubbs this issue was automatically closed because it did not follow the issue template