MultiFileSplit (Apache Hadoop Main 3.4.1 API) (original) (raw)


@InterfaceAudience.Public
@InterfaceStability.Stable
public class MultiFileSplit
extends CombineFileSplit
A sub-collection of input files. Unlike FileSplit, MultiFileSplit class does not represent a split of a file, but a split of input files into smaller sets. The atomic unit of split is a file.
MultiFileSplit can be used to implement RecordReader's, with reading one record per file.
See Also:
FileSplit, MultiFileInputFormat