MAP REDUCE: Hints
Last updated
Was this helpful?
Last updated
Was this helpful?
Just in case you need to write something with MR Job
Hadoop example
Python mrjob on YARN example (put this to run.sh
in the project)
Specifying number of reducers in streaming jar options
Specifying key comparator in streaming jar options
streaming jar options via MRJob
Dynamic jobconf
Outputting CSV or SequenceFile
Enable Snappy compression of job's output (Hadoop Streaming options)
-D mapred.output.compress=true
-D mapred.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec
-D mapred.output.compression.type=BLOCK
Configuring options
Using distributed cache with Hadoop Streaming
Using distributed cache in mrjob