Please use this identifier to cite or link to this item: https://dspace.chmnu.edu.ua/jspui/handle/123456789/1879
Title: Applying the Deep Learning Techniques to Solve Classification Tasks Using Gene Expression Data
Authors: Babichev, S.
Liakh, I.
Kalinina, I.
Keywords: Convolution neural network
LSTM recurrent neural network
GRU recurrent neural network
gene expression data
classification
hybrid model
classification quality criteria
cancer disease
Issue Date: 2024
Publisher: IEEE
Abstract: This manuscript explores the application of deep learning (DL) techniques for classifying gene expression data. A key aspect of our research is the comparative analysis of various DL neural network architectures, including Convolution Neural Networks (CNN), Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) Recurrent Neural Networks (RNN), as well as hybrid models that combine these networks. We applied the Bayesian optimization algorithm using 5-fold cross-validation for optimal hyperparameter tuning, which is crucial for DL algorithm performance. Significantly, we have advanced the methods for applying RNNs in processing gene expression data, particularly focusing on LSTM and GRU types. Our study introduces also a novel hybrid quality criterion for data classification, calculated as a weighted sum of partial quality criteria, incorporating an integrated F1-score derived through the Harrington desirability method. Furthermore, we investigate hybrid models that leverage various DL methods, enhancing decision-making objectivity in sample identification. This model uses a step-by-step information processing procedure, initially applying different DL models to gene expression data and subsequently processing these through a CART-based classifier for final decision-making. Our experiments, performed on gene expression data from patients with eight cancer types and one subset with normal samples (without cancer), demonstrated that GRU-RNN-based models, particularly a two-layer GRU-RNN, achieved the highest classification efficacy, with an accuracy of 97.8% on the test dataset. The performance of this model exceeded that of other models, whose accuracy varied between 96.6% and 97.3%. Comparative analysis with other studies in this field suggests that the proposed techniques demonstrate higher efficacy compared to similar research regarding the application of DL models for cancer-type diagnosis.
Description: Babichev, S., Liakh, I., & Kalinina, I. (2024). Applying the Deep Learning Techniques to Solve Classification Tasks Using Gene Expression Data. IEEE Access, 12, 28437–28448. DOI: 10.1109/ACCESS.2024.3368070
URI: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186090110&doi=10.1109%2fACCESS.2024.3368070&partnerID=40&m
https://ieeexplore.ieee.org/document/10440636
https://dspace.chmnu.edu.ua/jspui/handle/123456789/1879
ISSN: e-ISSN: 2169-3536
Appears in Collections:Публікації науково-педагогічних працівників ЧНУ імені Петра Могили у БД Scopus



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.