Lzop Codec in Apache Spark
srinicodeit opened this issue · 1 comments
srinicodeit commented
Is there any documentation to use LzopCodec codec in apache-spark ?
dain commented
You will need to create a subclass of the codec using the actual name hadoop used for the codec, because Hadoop, unfortunately, encodes the class name into the file formats. Here is how we did this in a Trino test:
https://github.com/trinodb/trino/blob/master/lib/trino-hive-formats/src/test/java/com/hadoop/compression/lzo/LzopCodec.java#L18