Column values are not read correctly
nishi1708 opened this issue · 2 comments
Hi,
In my wide row column family, i am storing a byte[]. I have this CF exported in json format using Aegisthus to my HDFS. I have another MR job where i'm trying to read these column values. for some when i converted byte[](text to byte[] conversion), i'm not getting the same byte stream. So i'm not able to read the column values correctly. I'm getting following exception
Error: java.io.EOFException
at java.io.DataInputStream.readFully(DataInputStream.java:197)
at java.io.DataInputStream.readUTF(DataInputStream.java:609)
at java.io.DataInputStream.readUTF(DataInputStream.java:564)
at com.csr.lts.analytics.mapred.transformation.json.RawDataTransformationReducer.reduce(RawDataTransformationReducer.java:79)
Thanks,
Nishi
@nishi1708 I'm not sure why the bytes encoding would be wrong. Were you able to figure anything else out about this?
I know this doesn't help you a lot but in the next release we will let you use any of the Cassandra converters for the column value instead of always using BytesType.
Hi Daniel,
Thanks for your response. I'm closing this issue as i could read the column values now. I wasn't aware of the correct serialized format that why i got into this issue.
Thanks,
Nishi