Text processing

Text Processing is used to solve different tasks, including but not limited to:

  • Use text data by machine learning algorithms.

    Raw texts can not be handled by machine learning algorithms and therefore must be preprocessed.

  • Improve the quality of resulting models.

    The usage of different types of text preprocessing (i.e., excluding numbers from the text or replacing upper case words with lower-case ones) can lead to significant improvements in the quality of the resulting model.

CatBoost provides the following classes for text processing: