chezou/tabula-py

FutureWarning: errors='ignore' is deprecated and will raise in a future version.

Siddhesh-Agarwal opened this issue · 3 comments

Summary

Getting a warning

Did you read the FAQ?

  • I have read the FAQ

Did you search GitHub Discussions?

  • I have searched the discussions

(Optional) PDF URL

No response

About your environment

- Python 3.10.13
- openjdk version "11.0.16.1" 2022-08-12 LTS
- OpenJDK Runtime Environment Microsoft-40648 (build 11.0.16.1+1-LTS)
- OpenJDK 64-Bit Server VM Microsoft-40648 (build 11.0.16.1+1-LTS, mixed mode)
- Windows 11

What did you do when you faced the problem?

I just rand the read_pdf() function

Code

from tabula import read_pdf

read_pdf(<PATH TO PDF>, pages="all")

Expected behavior

NA

Actual behavior

/lib/site-packages/tabula/io.py:1045: FutureWarning: errors='ignore' is deprecated and will raise in a future version. Use to_numeric without passing errors and catch exceptions explicitly instead

df[c] = pd.to_numeric(df[c], errors="ignore")

Related issues

No response

So, It seems from the error message that this line is the cause of the warning.

Edit: It seems pandas is removing it very soon in the future

@Siddhesh-Agarwal Thank you for reporting. I made a patch to suppress the warning. Please have a try with master branch.