MultithreadedMapRunner (Apache Hadoop Main 3.4.1 API) (original) (raw)


@InterfaceAudience.Public
@InterfaceStability.Stable
public class MultithreadedMapRunner<K1,V1,K2,V2>
extends Object
implements MapRunnable<K1,V1,K2,V2>
Multithreaded implementation for MapRunnable.
It can be used instead of the default implementation, of MapRunner, when the Map operation is not CPU bound in order to improve throughput.
Map implementations using this MapRunnable must be thread-safe.
The Map-Reduce job has to be configured to use this MapRunnable class (using the JobConf.setMapRunnerClass method) and the number of threads the thread-pool can use with themapred.map.multithreadedrunner.threads property, its default value is 10 threads.