Please use this identifier to cite or link to this item: https://dspace.chmnu.edu.ua/jspui/handle/123456789/2909
Title: Approach to Identification of Anomalous Values in Analysis Tasks and Data Pre-processing
Authors: Kalinina, I.
Bidyuk, P.
Gozhyj, A.
Gozhyi, V.
Nechakhin, V.
Keywords: ensemble algorithms
iterative method
machine learning methods
methodology for detecting abnormal values
metric methods
model tests
pre-processing of data
statistical tests
task substitution methods
Issue Date: 2025
Publisher: Springer Science and Business Media Deutschland GmbH
Abstract: The article presents an approach to the identification of anomalous values at the stage of data analysis and pre-processing when solving machine learning problems. Detection and identification of anomalous values in the data are important for the efficiency of the preprocessing of the data set. A methodology for identifying and processing emissions in data sets has been developed. The methodology was developed within the framework of a systematic approach to solving tasks of analysis and pre-processing of data. It consists of three steps. In the first step, methods are used to detect outliers in the data set. In the second step, the causes of emissions are analyzed. In the third step, methods for processing abnormal values are chosen. The following methods of detecting and processing extreme values are considered in the work: statistical tests, model tests, metric methods, iterative methods, task substitution methods, machine learning methods, algorithm ensembles. Examples of the use of all methods of identifying emissions in the data sets described in the first step of the algorithm were analyzed. The importance of a systematic approach to the detection and processing of anomalous values at the stage of data analysis and pre-processing in machine learning tasks is proven.
Description: Kalinina, I., Bidyuk, P., Gozhyj, A., Gozhyi, V., & Nechakhin, V. (2025). Approach to Identification of Anomalous Values in Analysis Tasks and Data Pre-processing. In: Babichev, S., Lytvynenko, V. (Eds) Lecture Notes on Data Engineering and Communications Technologies, 244, 114–133. Springer, Cham. DOI: 10.1007/978-3-031-88483-2_6
URI: https://www.scopus.com/pages/publications/105010175280
https://link.springer.com/chapter/10.1007/978-3-031-88483-2_6
https://dspace.chmnu.edu.ua/jspui/handle/123456789/2909
ISBN: 978-3-031-88482-5 print
978-3-031-88483-2 online
ISSN: 23674512
Appears in Collections:Публікації науково-педагогічних працівників ЧНУ імені Петра Могили у БД Scopus

Files in This Item:
File Description SizeFormat 
Kalinina, I., Bidyuk, P., Gozhyj, A., Gozhyi, V., & Nechakhin, V..txt477 BTextView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.