Model values

The results of applying the model on a dataset.

The output information and format depends on the machine learning problem being solved:

Regression

Contains
A number resulting from applying the model.
Header format

The first row in the output file contains a tab-separated description of data in the corresponding column.

Format:
[EvalSet:]SampleId<\t><Prediction type 1><\t>..<\t><Prediction type N>[<\t>Label]
  • EvalSet: is output for the evaluation file only if several validation datasets are input.
  • Prediction type is specified in the starting parameters and takes one or several of the following values:

    • Probability
    • Class
    • RawFormulaVal
  • Label is only output for the validation dataset in training mode and the cross-validation dataset in cross-validation mode if it is specified in the input dataset.
Format

Each row starting from the second contains tab-separated information about a single object from the input dataset.

Format:
[<Validation dataset ID>:]<SampleId><\t><model value for prediction type 1><\t>..<\t><model value for prediction type N>[<\t><Label>]
  • Validation dataset ID is the serial number of the input validation dataset. The value is output if several validation datasets are input for model evaluation purposes.
  • SampleId is an alphanumeric ID of the object given in the Dataset description. If the identifiers are not set in the input data the objects are sequentially numbered, starting from zero.
  • model value for prediction type is the float number resulting from applying the model for the corresponding prediction type.
  • Label is the label value for the object. This value is only output for the validation dataset in training mode and the cross-validation dataset in cross-validation mode if it is specified in the input dataset.
Example

The resulting file without alphanumeric IDs:

SampleId<\t>Probability<\t>Class
0<\t>0.8<\t>1
1<\t>0.3<\t>0

The resulting file for the cross-validation mode with alphanumeric IDs set:

SampleId<\t>Probability<\t>Label
LT<\t>75.1<\t>73.6
LV<\t>73.2<\t>72.15
PL<\t>78.22<\t>77.5

Classification

Contains

Depends on the selected output mode for approximated values of the formula:

  • RawFormulaVal —A number resulting from applying the model.
  • Probability — A number indicating the probability that the object belongs to the class (a sigmoid of the result of applying the model).
  • Class — The predicted class (output with the value “1” if the probability is higher than 0.5, otherwise “0”).
Header format

The first row in the output file contains a tab-separated description of data in the corresponding column.

Format:
[EvalSet:]SampleId<\t><Prediction type 1><\t>..<\t><Prediction type N>[<\t>Label]
  • EvalSet: is output for the evaluation file only if several validation datasets are input.
  • Prediction type is specified in the starting parameters and takes one or several of the following values:

    • Probability
    • Class
    • RawFormulaVal
  • Label is only output for the validation dataset in training mode and the cross-validation dataset in cross-validation mode if it is specified in the input dataset.
Format

Each row in the output file contains tab-separated information about a single object from the input dataset.

Format:
[<Validation dataset ID>:]<SampleId><\t><model value>[<\t><Label>]
  • Validation dataset ID is the serial number of the input validation dataset. The value is output if several validation datasets are input for model evaluation purposes.
  • SampleId is an alphanumeric ID of the object given in the Dataset description. If the identifiers are not set in the input data the objects are sequentially numbered, starting from zero.
  • model value is the number resulting from applying the model for the corresponding prediction type.
  • Label is the label value for the object. This value is only output for the validation dataset in training mode and the cross-validation dataset in cross-validation mode if it is specified in the input dataset.
Example

The resulting file for the RawFormulaVal cross-validation mode:

SampleId<\t>RawFormulaVal<\t>Label
0<\t>0.1685379577<\t>1
1<\t>0.2379356203<\t>1
2<\t>-0.04871954376<\t>1
The resulting file for the Probability cross-validation mode with alphanumeric IDs set for objects:
SampleId<\t>Probability<\t>Label
SampleId1<\t>0.5592048528<\t>1
SampleId2<\t>0.5595881735<\t>1
SampleId3<\t>0.5592048528<\t>1
The resulting file for the Class mode:
SampleId<\t>Class
0<\t>0
1<\t>1
2<\t>1
3<\t>s0

Multiclassification

Contains

Depends on the selected output mode for approximated values of the formula:

  • RawFormulaVal — A list of numbers resulting from applying the model. Values for the different classes are tab-separated.
  • Probability — A list of numbers indicating the probability that the object belongs to each of the classes. Values for the different classes are tab-separated.
  • Class —The number of the class that the object most likely belongs to.
Header format

The first row in the output file contains a tab-separated description of data in the corresponding column.

Format:

[EvalSet:]SampleId</t><PredictionType1>[:Class=<ClassID>]</t>..</t><PredictionTypeN>:Class=<ClassID>[<\t>Label]
  • EvalSet: is output for the evaluation file only if several validation datasets are input.
  • Prediction type is specified in the starting parameters and takes one or several of the following values:

    • Probability
    • Class
    • RawFormulaVal
  • ClassID is the identifier of the class being described in the column. It is omitted for the Class prediction type.
  • Label is only output for the validation dataset in training mode and the cross-validation dataset in cross-validation mode if it is specified in the input dataset.

The number of “Prediction type–ClassID” pairs depends on the input parameters. It is always limited to one pair for the Class prediction type.

Format

Each row in the output file contains tab-separated information about a single object from the input dataset.

Format:
[Validation dataset ID:]<SampleId><\t><Model value 1>..<Model value N>[<\t><Label>]
  • Validation dataset ID is the serial number of the input validation dataset. The value is output if several validation datasets are input for model evaluation purposes.
  • SampleId is an alphanumeric ID of the object given in the Dataset description. If the identifiers are not set in the input data the objects are sequentially numbered, starting from zero.
  • Model value is a number or a list of numbers depending on the selected output mode for approximated values of the formula for the corresponding prediction type.
  • Label is the label value for the object. This value is only output for the validation dataset in training mode and the cross-validation dataset in cross-validation mode if it is specified in the input dataset.
Example

The resulting file for prediction in  Class mode with alphanumeric IDs set for objects:

SampleId<\t>Class
SampleId1<\t>2
SampleId2<\t>1
SampleId3<\t>2

The resulting file for the Probability cross-validation mode:

SampleId<\t>Probability:Class=0<\t>CProbability:Class=1<\t>Probability:Class=2<\t>Label
1<\t>0.3232259635</t>0.315456703</t>0.3613173334</t>2
2<\t>0.335771253</t>0.3247524917</t>0.3394762553</t>0
3<\t>0.3181931812</t>0.3242628483</t>0.3575439705</t>1

The resulting file for the RawFormulaVal cross-validation mode:

SampleId<\t>RawFormulaVal:Class=0<\t>RawFormulaVal:Class=1<\t>RawFormulaVal:Class=2<\t>Label
1<\t>0.001232427024</t>-0.04141999431</t>0.04018756728</t>2
2<\t>-0.04822847313</t>-0.05520994445</t>0.1034384176</t>2
3<\t>-0.05717915565</t>-0.06548867981</t>0.1226678355</t>2
The resulting file for prediction in RawFormulaVal and Probability modes:
SampleId<\t>Probability:Class=0<\t>Probability:Class=1<\t>RawFormulaVal:Class=0<\t>RawFormulaVal:Class=1
1<\t>0.01593276944<\t>0.02337982256<\t>-1.494255509<\t>-1.110760101
2<\t>0.4060707366<\t>0.09565861257<\t>0.4137085351<\t>-1.032033103
3<\t>0.006235130003<\t>0.01759049831<\t>-2.03020042<\t>-0.9930409613