Applying the Deep Learning Techniques to Solve Classification Tasks Using Gene Expression Data

Babichev, S.; Liakh, I.; Kalinina, I.

Please use this identifier to cite or link to this item: https://dspace.chmnu.edu.ua/jspui/handle/123456789/1879

Title:	Applying the Deep Learning Techniques to Solve Classification Tasks Using Gene Expression Data
Authors:	Babichev, S. Liakh, I. Kalinina, I.
Keywords:	Convolution neural network LSTM recurrent neural network GRU recurrent neural network gene expression data classification hybrid model classification quality criteria cancer disease
Issue Date:	2024
Publisher:	IEEE
Abstract:	This manuscript explores the application of deep learning (DL) techniques for classifying gene expression data. A key aspect of our research is the comparative analysis of various DL neural network architectures, including Convolution Neural Networks (CNN), Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) Recurrent Neural Networks (RNN), as well as hybrid models that combine these networks. We applied the Bayesian optimization algorithm using 5-fold cross-validation for optimal hyperparameter tuning, which is crucial for DL algorithm performance. Significantly, we have advanced the methods for applying RNNs in processing gene expression data, particularly focusing on LSTM and GRU types. Our study introduces also a novel hybrid quality criterion for data classification, calculated as a weighted sum of partial quality criteria, incorporating an integrated F1-score derived through the Harrington desirability method. Furthermore, we investigate hybrid models that leverage various DL methods, enhancing decision-making objectivity in sample identification. This model uses a step-by-step information processing procedure, initially applying different DL models to gene expression data and subsequently processing these through a CART-based classifier for final decision-making. Our experiments, performed on gene expression data from patients with eight cancer types and one subset with normal samples (without cancer), demonstrated that GRU-RNN-based models, particularly a two-layer GRU-RNN, achieved the highest classification efficacy, with an accuracy of 97.8% on the test dataset. The performance of this model exceeded that of other models, whose accuracy varied between 96.6% and 97.3%. Comparative analysis with other studies in this field suggests that the proposed techniques demonstrate higher efficacy compared to similar research regarding the application of DL models for cancer-type diagnosis.
Description:	Babichev, S., Liakh, I., & Kalinina, I. (2024). Applying the Deep Learning Techniques to Solve Classification Tasks Using Gene Expression Data. IEEE Access, 12, 28437–28448. DOI: 10.1109/ACCESS.2024.3368070
URI:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186090110&doi=10.1109%2fACCESS.2024.3368070&partnerID=40&m https://ieeexplore.ieee.org/document/10440636 https://dspace.chmnu.edu.ua/jspui/handle/123456789/1879
ISSN:	e-ISSN: 2169-3536
Appears in Collections:	Публікації науково-педагогічних працівників ЧНУ імені Петра Могили у БД Scopus

Files in This Item:

File	Description	Size	Format
Applying_the_Deep_Learning_Techniques_to_Solve_Classification_Tasks_Using_Gene_Expression_Data.pdf		2.09 MB	Adobe PDF	View/Open
Babichev, S., Liakh, I., Kalinina, I..pdf		59.43 kB	Adobe PDF	View/Open

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets