Title: Inferring pathway activity toward precise disease classification.
Authors: Lee, Eunjung; Chuang, Han-Yu; Kim, Jong-Won; Ideker, Trey; Lee, Doheon
Published In PLoS Comput Biol, (2008 Nov)
Abstract: The advent of microarray technology has made it possible to classify disease states based on gene expression profiles of patients. Typically, marker genes are selected by measuring the power of their expression profiles to discriminate among patients of different disease states. However, expression-based classification can be challenging in complex diseases due to factors such as cellular heterogeneity within a tissue sample and genetic heterogeneity across patients. A promising technique for coping with these challenges is to incorporate pathway information into the disease classification procedure in order to classify disease based on the activity of entire signaling pathways or protein complexes rather than on the expression levels of individual genes or proteins. We propose a new classification method based on pathway activities inferred for each patient. For each pathway, an activity level is summarized from the gene expression levels of its condition-responsive genes (CORGs), defined as the subset of genes in the pathway whose combined expression delivers optimal discriminative power for the disease phenotype. We show that classifiers using pathway activity achieve better performance than classifiers based on individual gene expression, for both simple and complex case-control studies including differentiation of perturbed from non-perturbed cells and subtyping of several different kinds of cancer. Moreover, the new method outperforms several previous approaches that use a static (i.e., non-conditional) definition of pathways. Within a pathway, the identified CORGs may facilitate the development of better diagnostic markers and the discovery of core alterations in human disease.
PubMed ID: 18989396
MeSH Terms: Artificial Intelligence; Biomarkers, Tumor/classification*; Biomarkers, Tumor/genetics*; Biomarkers, Tumor/metabolism; Diagnostic Techniques and Procedures*; Gene Expression Profiling/methods*; Gene Expression/physiology*; HeLa Cells; Humans; Neoplasms/classification; Neoplasms/genetics; Neoplasms/metabolism; Pattern Recognition, Automated/methods; Prognosis; Signal Transduction/genetics*; Systems Integration