It possible to prevent decompression and/or splitting in local or inline mode
Opened this issue · 0 comments
anjackson commented
I'm dealing with some block-gzipped files that I would like to pass directly into my MyJob code, without the runner decompressing or splitting them. I've got this working under the hadoop
running, but the local
and inline
runner always force decompression and always force file splitting (defined in SimMRJobRunner
I think).
Is there a configuration option that would make it possible to prevent splitting, and ideally also another that prevents decompression?