[BUG] Error importing jpype dependencies. Fallback to subprocess. No module named 'org.apache'
tgallimore9 opened this issue · 1 comments
Summary
I'm trying to read a file from my local drive. I've checked my Java Environment and it seems to be fine. This Org.Apache issue seems to occur on both my mac and windows PC
Did you read the FAQ?
- I have read the FAQ
Did you search GitHub Discussions?
- I have searched the discussions
(Optional) PDF URL
No response
About your environment
Python version:
3.8.18 (default, Sep 11 2023, 13:39:12) [MSC v.1916 64 bit (AMD64)]
Java version:
java version "1.8.0_411"
Java(TM) SE Runtime Environment (build 1.8.0_411-b09)
Java HotSpot(TM) 64-Bit Server VM (build 25.411-b09, mixed mode)
tabula-py version: 2.9.0
platform: Windows-10-10.0.17763-SP0
uname:
uname_result(system='Windows', node='vm-uks-dsvm13', release='10', version='10.0.17
763', machine='AMD64', processor='Intel64 Family 6 Model 85 Stepping 7, GenuineIntel')
linux_distribution: ('', '', '')
mac_ver: ('', ('', '', ''), '')
What did you do when you faced the problem?
Made sure my Java environment was set correctly
checked all different articles for Tabula-py
Code
import tabula
import pandas
pdf_path = "X:\GEM Opps\Toni\Barrick\PDF\Barrick_Q2_2023_MD-A.pdf"
dfs = tabula.read_pdf(pdf_path, stream=True, pages=5)
read_pdf returns list of DataFrames
print(len(dfs))
dfs[0]
Expected behavior
Not sure, I expect the table to show
Actual behavior
Error importing jpype dependencies. Fallback to subprocess.
No module named 'org.apache'
1
Related issues
Can I also ask if tabula can read to local drives. I'm not sure if this contributes to the reason but this is for a project so an answer for this would be really helpful. Thank you
It's just a warning message. It doesn't relate to the fact that no table was extracted.
tabula-py supports to read PDFs on a local drive.