AbsaOSS/cobrix

DataFrame with some columns in EBCDIC

Opened this issue · 1 comments

I have this code:

val df = spark
.read
.format("za.co.absa.cobrix.spark.cobol.source")
.option("ebcdic_code_page", "common_extended")
.option("copybook", copybook)
.load("ruta"
.select("Campo")

but get this error:

za.co.absa.cobrix.cobol.parser.encoding.EBCDIC$ cannot be cast to za.co.absa.cobrix.cobol.parser.encoding.Encoding

if i read the parquet, with spark reading, the column contains infromation such as "1b64b427", i need convert to ascii

the file copyboock i`ve wirtten like

01 Campo String

And

01 Campo X(76)

But get the same error. What its no correct?

Hi, I can't reproduce the issue. Could you please, paste the exact code snipped you are using, and the exact copybook?

Also, which Cobrix version are you using?