sklearn.datasets.fetch_kddcup99 — scikit-learn 0.20.4 documentation (original) (raw)

subset : None, ‘SA’, ‘SF’, ‘http’, ‘smtp’

To return the corresponding classical subsets of kddcup 99. If None, return the entire kddcup 99 dataset.

data_home : string, optional

Specify another download and cache folder for the datasets. By default all scikit-learn data is stored in ‘~/scikit_learn_data’ subfolders. .. versionadded:: 0.19

shuffle : bool, default=False

Whether to shuffle dataset.

random_state : int, RandomState instance or None (default)

Determines random number generation for dataset shuffling and for selection of abnormal samples if subset=’SA’. Pass an int for reproducible output across multiple function calls. See Glossary.

percent10 : bool, default=True

Whether to load only 10 percent of the data.

download_if_missing : bool, default=True

If False, raise a IOError if the data is not locally available instead of trying to download the data from the source site.

return_X_y : boolean, default=False.

If True, returns (data, target) instead of a Bunch object. See below for more information about the data and target object.

New in version 0.20.