On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data

Citation

Ng, Grace Yee Lin and Tan, Shing Chiang and Ong, Chia Sui (2023) On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data. PLOS ONE, 18 (10). e0292961. ISSN 1932-6203

[img] Text
2.pdf - Published Version
Restricted to Repository staff only

Download (1MB)

Abstract

Cell type identification is one of the fundamental tasks in single-cell RNA sequencing (scRNA-seq) studies. It is a key step to facilitate downstream interpretations such as differential expression, trajectory inference, etc. scRNA-seq data contains technical variations that could affect the interpretation of the cell types. Therefore, gene selection, also known as feature selection in data science, plays an important role in selecting informative genes for scRNA-seq cell type identification. Generally speaking, feature selection methods are categorized into filter-, wrapper-, and embedded-based approaches. From the existing literature, methods from filter- and embedded-based approaches are widely applied in scRNA-seq gene selection tasks. The wrapper-based method that gives promising results in other fields has yet been extensively utilized for selecting gene features from scRNA-seq data; in addition, most of the existing wrapper methods used in this field are clustering instead of classification-based. With a large number of annotated data available today, this study applied a classification-based approach as an alternative to the clustering-based wrapper method. In our work, a quantum-inspired differential evolution (QDE) wrapped with a classification method was introduced to select a subset of genes from twelve well-known scRNA-seq transcriptomic datasets to identify cell types. In particular, the QDE was combined with different machine-learning (ML) classifiers namely logistic regression, decision tree, support vector machine (SVM) with linear and radial basis function kernels, as well as extreme learning machine. The linear SVM wrapped with QDE, namely QDE-SVM, was chosen by referring to the feature selection results from the experiment. QDE-SVM showed a superior cell type classification performance among QDE wrapping with other ML classifiers as well as the recent wrapper methods (i.e., FSCAM, SSD-LAHC, MA-HS, and BSF). QDE-SVM achieved an average accuracy of 0.9559, while the other wrapper methods achieved average accuracies in the range of 0.8292 to 0.8872.

Item Type: Article
Subjects: H Social Sciences > HF Commerce > HF5001-6182 Business > HF5548.7-5548.85 Industrial psychology
Divisions: Faculty of Information Science and Technology (FIST)
Depositing User: Ms Nurul Iqtiani Ahmad
Date Deposited: 07 Dec 2023 01:59
Last Modified: 07 Dec 2023 01:59
URII: http://shdl.mmu.edu.my/id/eprint/11923

Downloads

Downloads per month over past year

View ItemEdit (login required)