Let's assume that it is required to solve a classification problem on a dataset with grouped objects. For example, it may be required to predict user clicks on a search engine results page.
Generally, this task can be solved by the Logloss function: Logloss=i=1∑Nwi1group∑obj_in_group∑wi(ti⋅log(pi)+(1−ti)⋅log(1−pi))
ti is the label value for the i-th object (from the input data for training). Possible values are in the range [0;1].
Since the internal structure of the data is known, it can be assumed that the predictions in various groups are different. This can be modeled by adding a shift_group to each formula prediction for a group: pˉi=σ(ai+group_shift)
The shift_group parameter is jointly optimized for each group during the training.
In this case, the Logloss formula for grouped objects takes the following form: Loglossgroup=i=1∑Nwi1group∑obj_in_group∑wi(ti⋅log(pˉi)+(1−ti)⋅log(1−pˉi))
The QueryCrossEntropy metric is calculated as follows: QueryCrossEntropy(α)=(1−α)⋅LogLoss+α⋅LogLossgroup
User-defined parameters
Parameter: alpha
Description
The coefficient used in quantile-based losses. Defines the rules for mixing the