Yelp/mrjob

It possible to prevent decompression and/or splitting in local or inline mode

Opened this issue · 0 comments

I'm dealing with some block-gzipped files that I would like to pass directly into my MyJob code, without the runner decompressing or splitting them. I've got this working under the hadoop running, but the local and inline runner always force decompression and always force file splitting (defined in SimMRJobRunner I think).

Is there a configuration option that would make it possible to prevent splitting, and ideally also another that prevents decompression?