Custom quantization borders and missing value modes

Contains

Custom quantization borders and the method for processing missing values for the dataset numerical features.

Format

• Each line contains information regarding a single border and optionally the missing values mode settings for the corresponding feature.

• Missing value modes are output for a feature if the following conditions are met at the same time:

• The chosen missing value mode for the feature differs from Forbidden.
• Missing values are present in the dataset.

The global missing value mode is specified in the --nan-mode (nan_mode) training parameter and can be overridden in the Custom quantization borders and missing value modes input file.

• Format of a single line:

<zero-based feature ID><\t><border value><\t><missing value mode>


Example

0<\t>0.25
0<\t>0.75
2<\t>0.3<\t>Max
2<\t>0.85<\t>Max