-
Biomarker Selection and Classification of -Omics Data Using a Two-Step Bayes Classification Framework
- Back
Metadata
Document Title
Biomarker Selection and Classification of -Omics Data Using a Two-Step Bayes Classification Framework
Author
Assawamakin A, Prueksaaroon S, Kulawonganunchai S, Shaw PJ, Varavithya V, Ruangrajitpakorn T, Tongsima S
Name from Authors Collection
Affiliations
Mahidol University; Thammasat University; National Science & Technology Development Agency - Thailand; National Center Genetic Engineering & Biotechnology (BIOTEC); National Science & Technology Development Agency - Thailand; National Electronics & Computer Technology Center (NECTEC)
Type
Article
Source Title
BIOMED RESEARCH INTERNATIONAL
Year
2013
Volume
2013
Open Access
Green Published, Green Submitted, gold
Publisher
HINDAWI LTD
DOI
10.1155/2013/148014
Format
Abstract
Identification of suitable biomarkers for accurate prediction of phenotypic outcomes is a goal for personalized medicine. However, current machine learning approaches are either too complex or perform poorly. Here, a novel two-step machine-learning framework is presented to address this need. First, a Naive Bayes estimator is used to rank features from which the top-ranked will most likely contain the most informative features for prediction of the underlying biological classes. The top-ranked features are then used in a Hidden Naive Bayes classifier to construct a classification prediction model from these filtered attributes. In order to obtain the minimum set of the most informative biomarkers, the bottom-ranked features are successively removed from the Naive Bayes-filtered feature list one at a time, and the classification accuracy of the Hidden Naive Bayes classifier is checked for each pruned feature set. The performance of the proposed two-step Bayes classification framework was tested on different types of -omics datasets including gene expression microarray, single nucleotide polymorphism microarray (SNParray), and surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) proteomic data. The proposed two-step Bayes classification framework was equal to and, in some cases, outperformed other classification methods in terms of prediction accuracy, minimum number of classification markers, and computational time.
Funding Sponsor
National Electronics and Computer Technology Center (NECTEC); Faculty of Engineering, Thammasat University; National Center for Genetic Engineering and Biotechnology (BIOTEC) platform technology; National Science and Technology Development Agency (NSTDA); Thailand and TRF Career Development Grant [RSA5480026]; National Infrastructure program under National Science and Technology Development Agency (NSTDA) [C2-14]
License
CC BY
Rights
Authors
Publication Source
WOS