chezou/tabula-py

[BUG] Error importing jpype dependencies. Fallback to subprocess. No module named 'org.apache'

tgallimore9 opened this issue · 1 comments

Summary

I'm trying to read a file from my local drive. I've checked my Java Environment and it seems to be fine. This Org.Apache issue seems to occur on both my mac and windows PC

Did you read the FAQ?

  • I have read the FAQ

Did you search GitHub Discussions?

  • I have searched the discussions

(Optional) PDF URL

No response

About your environment

Python version:
    3.8.18 (default, Sep 11 2023, 13:39:12) [MSC v.1916 64 bit (AMD64)]
Java version:
    java version "1.8.0_411"
Java(TM) SE Runtime Environment (build 1.8.0_411-b09)
Java HotSpot(TM) 64-Bit Server VM (build 25.411-b09, mixed mode)
tabula-py version: 2.9.0
platform: Windows-10-10.0.17763-SP0
uname:
    uname_result(system='Windows', node='vm-uks-dsvm13', release='10', version='10.0.17
763', machine='AMD64', processor='Intel64 Family 6 Model 85 Stepping 7, GenuineIntel')
linux_distribution: ('', '', '')
mac_ver: ('', ('', '', ''), '')

What did you do when you faced the problem?

Made sure my Java environment was set correctly
checked all different articles for Tabula-py

Code

import tabula
import pandas
pdf_path = "X:\GEM Opps\Toni\Barrick\PDF\Barrick_Q2_2023_MD-A.pdf"
dfs = tabula.read_pdf(pdf_path, stream=True, pages=5)

read_pdf returns list of DataFrames

print(len(dfs))
dfs[0]

Expected behavior

Not sure, I expect the table to show

Actual behavior

Error importing jpype dependencies. Fallback to subprocess.
No module named 'org.apache'
1

Related issues

Can I also ask if tabula can read to local drives. I'm not sure if this contributes to the reason but this is for a project so an answer for this would be really helpful. Thank you

It's just a warning message. It doesn't relate to the fact that no table was extracted.

tabula-py supports to read PDFs on a local drive.