Characteristics
β Consists of .csv files:
β πΎπ·π·πππππ + with 125,973 data entries
β πΎπ·π·πππ π‘ + with 22,544 data entries
β 17.9% πππ‘π
β Difficulty levels: 21
β 49.66% of training set and 47.44% of test set are 21/21 level
β 43 columns: 1-41 β features, 42 β label, 43 β difficulty level
β Subsets: KDDTest-21, KDDTraini+_20Percent
β Their records are all included in the bigger datasets
10