Repository: Freie Universität Berlin, Math Department

Epithelial Mesenchymal Transition Network-based Feature Engineering in Lung Adenocarcinoma Prognosis Prediction Using Multiple Omic Data

Shao, Borong and Carlo, Cannistraci and Conrad, T. O. F. (2017) Epithelial Mesenchymal Transition Network-based Feature Engineering in Lung Adenocarcinoma Prognosis Prediction Using Multiple Omic Data. Genomics and Computational Biology, 3 (3). ISSN 2365-7154

Full text not available from this repository.

Official URL: http://dx.doi.org/10.18547/gcb.2017.vol3.iss3.e57

Abstract

Epithelial mesenchymal transition (EMT) process has been shown as highly relevant to cancer prognosis. However, although different biological network-based biomarker identification methods have been proposed to predict cancer prognosis, EMT network has not been directly used for this purpose. In this study, we constructed an EMT regulatory network consisting of 87 molecules and tried to select features that are useful for prognosis prediction in Lung Adenocarcinoma (LUAD). To incorporate multiple molecular profiles, we obtained four types of molecular data including mRNA-Seq, copy number alteration (CNA), DNA methylation, and miRNA-Seq data from The Cancer Genome Atlas. The data were mapped to the EMT network in three alternative ways: mRNA-Seq and miRNA-Seq, DNA methylation, and CNA and miRNA-Seq. Each mapping was employed to extract five different sets of features using discretization and network-based biomarker identification methods. Each feature set was then used to predict prognosis with SVM and logistic regression classifiers. We measured the prediction accuracy with AUC and AUPR values using 10 times 10-fold cross validation. For a more comprehensive evaluation, we also measured the prediction accuracies of clinical features, EMT plus clinical features, randomly picked 87 molecules from each data mapping, and using all molecules from each data type. Counter-intuitively, EMT features do not always outperform randomly selected features and the prediction accuracies of the five feature sets are mostly not significantly different. Clinical features are shown to give the highest prediction accuracies. In addition, the prediction accuracies of both EMT features and random features are comparable as using all features (more than 17,000) from each data type.

Item Type:Article
Subjects:Mathematical and Computer Sciences > Mathematics > Applied Mathematics
Mathematical and Computer Sciences > Artificial Intelligence > Machine Learning
Medicine and Dentistry > Clinical Medicine
Divisions:Department of Mathematics and Computer Science > Institute of Mathematics
Department of Mathematics and Computer Science > Institute of Mathematics > BioComputing Group
Department of Mathematics and Computer Science > Institute of Mathematics > Comp. Proteomics Group
ID Code:1970
Deposited By: Admin Administrator
Deposited On:25 Oct 2016 14:30
Last Modified:13 Sep 2017 19:15

Repository Staff Only: item control page