airlift/aircompressor

Lzop Codec in Apache Spark

srinicodeit opened this issue · 1 comments

Is there any documentation to use LzopCodec codec in apache-spark ?

dain commented

You will need to create a subclass of the codec using the actual name hadoop used for the codec, because Hadoop, unfortunately, encodes the class name into the file formats. Here is how we did this in a Trino test:
https://github.com/trinodb/trino/blob/master/lib/trino-hive-formats/src/test/java/com/hadoop/compression/lzo/LzopCodec.java#L18