Identification of genomic markers correlated with sensitivity in solid tumors to Dasatinib using sparse principal components

Hossain, Ahmed and Khan, Hafiz T.A. ORCID: https://orcid.org/0000-0002-1817-3730 (2016) Identification of genomic markers correlated with sensitivity in solid tumors to Dasatinib using sparse principal components. Journal of Applied Statistics, 43 (14). pp. 2538-2549. ISSN 0266-4763

[thumbnail of Dasatinib_AH.pdf]
Preview
PDF
Dasatinib_AH.pdf - Accepted Version

Download (2MB) | Preview

Abstract

Differential analysis techniques are commonly used to offer scientists a dimension reduction procedure and an interpretable gateway to variable selection, especially when confronting high-dimensional genomic data. Huang et al. used a gene expression profile of breast cancer cell lines to identify genomic markers which are highly correlated with in vitro sensitivity of a drug Dasatinib. They considered three statistical methods to identify differentially expressed genes and finally used the results from the intersection. But the statistical methods that are used in the paper are not sufficient to select the genomic markers. In this paper we used three alternative statistical methods to select a combined list of genomic markers and compared the genes that were proposed by Huang et al. We then proposed to use sparse principal component analysis (Sparse PCA) to identify a final list of genomic markers. The Sparse PCA incorporates correlation into account among the genes and helps to draw a successful genomic markers discovery. We present a new and a small set of genomic markers to separate out the groups of patients effectively who are sensitive to the drug Dasatinib. The analysis procedure will also encourage scientists in identifying genomic markers that can help to separate out two groups.

Item Type: Article
Identifier: 10.1080/02664763.2016.1142941
Additional Information: This is an Accepted Manuscript of an article published by Taylor & Francis in Journal of Applied Statistics on 12/02/16, available online: http://www.tandfonline.com/10.1080/02664763.2016.1142941
Keywords: Differential gene expression; area under receiver operating characteristic curve; principal component analysis; sparse principal component analysis, clustering.
Subjects: Medicine and health > Health promotion and public health
Medicine and health > Health promotion and public health > Healthcare education
Medicine and health
Social sciences
Depositing User: Hafiz T.A. Khan
Date Deposited: 10 Jun 2017 10:30
Last Modified: 04 Nov 2024 12:07
URI: https://repository.uwl.ac.uk/id/eprint/3418

Downloads

Downloads per month over past year

Actions (login required)

View Item View Item

Menu