Abstract
Genome-wide gene expression information has been very useful for understanding cancer at the molecular level. In particular, breast cancer has been widely studied by utilizing a large amount of transcriptome data. Although statistical selection of differentially expressed genes, e.g., PAM50, has been successful to classify breast cancer subtypes, understanding breast cancer in terms of biological functions or pathways is still limited. Thus, it is essential to develop a tailored model that unravels breast cancer mechanisms by identifying disease-specific functional units of biological pathways and apply the model for breast cancer prognosis. In this paper, a systematic characterization of breast cancer functional units or ‘subsystems’ is presented. We propose a novel concept of decomposing biological pathways into subsystems by utilizing protein interaction network, pathway information, and RNA-seq data. Subsystem activation score (SAS) was developed to measure the degree of activation for each subsystem and each patient. This method revealed distinctive genome-wide activation patterns or landscape of subsystems that are differentially activated among samples and among breast cancer subtypes. Then, we used SAS information for prognostic modeling by performing the classification and regression tree (CART) analysis. Eleven subgroups of patients, defined by 10 most significant subsystems, were identified with the maximal discrepancy in survival outcome. Our model not only defined patient subgroups with similar survival outcomes, but also provided patient-specific decision paths determined by subsystem activation status, suggesting functionally informative gene sets of breast cancer.
Original language | English |
---|---|
Pages (from-to) | 81-89 |
Number of pages | 9 |
Journal | Methods |
Volume | 110 |
DOIs | |
State | Published - Nov 1 2016 |
Externally published | Yes |
Funding
This research was supported by a grant of the Collaborative Genome Program for Fostering New Post-Genome industry through the National Research Foundation of Korea (NRF) funded by the Ministry of Science ICT & Future Planning (NRF-2014M3C9A3063541), a grant of Next-Generation Information Computing Development Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (No.NRF-2012M3C4A7033341) and a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI) funded by the Ministry of Health & Welfare, Republic of Korea (HI14C3405).
Funders | Funder number |
---|---|
Ministry of Health & Welfare | HI14C3405 |
Ministry of Science, ICT and Future Planning | NRF-2014M3C9A3063541 |
Korea Health Industry Development Institute | |
National Research Foundation of Korea |
Keywords
- Breast cancer
- Cohort stratification
- Prognostic modeling
- Protein interaction network
- Subsystem
- Subsystem activation score