Issues
- 0
Python 3.12 support
#2222 opened by dotlambda - 1
- 0
Read Specific Column From csv file
#2219 opened by Hetaksh - 1
total sort
#2217 opened by heckboy-star - 0
Failure to run mrjob on dataproc
#2216 opened by BradHolmes - 2
- 0
trying to run mr job python script
#2213 opened by iitspratham - 0
Hadoop counter in mrjob
#2212 opened by ShunyangLi - 1
Error when running on hadoop "Found 2 unexpected arguments on the command line"
#2201 opened by nadavdor15 - 0
running local mode error
#2168 opened by logique233 - 1
ignore unrecognized arguments
#2210 opened by dhuy237 - 0
Assign tags on EMR creation in single API call
#2207 opened by mgmarino - 0
Can I write map and reduce in many different class?
#2206 opened by dhuy237 - 0
It possible to prevent decompression and/or splitting in local or inline mode
#2205 opened by anjackson - 0
add_passthru_arg on hadoop
#2204 opened by lobequadrat - 1
NameError: argments is not defined
#2190 opened by azzedineA - 2
- 0
- 0
max_clusters_in_pool option
#2192 opened by coyotemarin - 0
add pool_timeout_minutes option
#2199 opened by coyotemarin - 0
pool_wait_minutes shouldn't wait if pool is empty
#2198 opened by coyotemarin - 1
add pool_jitter_seconds option
#2200 opened by coyotemarin - 1
- 0
integrate describe_cluster() calls with cluster cache
#2186 opened by coyotemarin - 2
join pooled clusters based on yarn cluster metrics
#2191 opened by coyotemarin - 0
- 6
concurrent steps on EMR clusters
#2185 opened by coyotemarin - 3
support docker on EMR 6.x AMIs
#2179 opened by coyotemarin - 1
- 0
- 0
How to launch more than one reducer to execute a job?
#2181 opened by ParadoxZW - 0
- 1
Spark harness is not populating counters when counter-output-dir is not an S3 path
#2176 opened by 88manpreet - 14
put most pooling info in cluster name
#2160 opened by coyotemarin - 8
lock clusters with EMR tags, not S3
#2161 opened by coyotemarin - 1
- 0
tags should start with "mrjob:" not "__mrjob"
#2173 opened by coyotemarin - 1
cluster locks are never released
#2162 opened by coyotemarin - 1
don't list steps when pooling
#2159 opened by coyotemarin - 2
default to 'python2.7', not 'python' when on Python 2
#2151 opened by coyotemarin - 1
Support Python 3.8
#2150 opened by coyotemarin - 0
extra_cluster_params should merge dictionaries
#2154 opened by coyotemarin - 2
newer PyYAML doesn't work with Python 3.4
#2149 opened by coyotemarin - 1
Suggestion: Add an option to accept empty lines in json
#2148 opened by trisch-me - 0
Support --conf-path when using mrjob spark-submit
#2147 opened by mj3c - 0
- 1
Support not waiting on job completion
#2145 opened by mgmarino - 2
Override SparkStep's input_path
#2144 opened by mj3c - 0
How to use mrjob to read pyspark rdds or dataframe?
#2140 opened by Alxe1 - 1