TY - GEN
T1 - Finding Single and Multi-Gene Expression Patterns for Psoriasis Using Sub-Pattern Frequency Pruning
AU - Smith, Kenneth
AU - Lea, Jamie
AU - Climer, Sharlee
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Biomarker identification, such as gene expression, is used in several areas of medical research, including aiding in disease prediction and treatment. However, most gene expression analysis focuses on differently expressed genes, ignoring patterns in which the co-expression of non-differently expressed genes are associated with disease risk. In this manuscript, we make three contributions. First, we present an alternative definition for differential expression which captures associations that are missed using mean- or median-based methods, such as fold change. Second, we introduce an algorithm for identifying all patterns of analytes associated with a given phenotype within a given threshold of optimal by extensively pruning the solution space. Third, our demonstration on psoriasis gene expression data yields 6320 highly significant gene expression patterns associated with this common disease that are comprised of 2334 unique genes worthy of further exploration. Interestingly, these genes include 1021 genes that are not differentially expressed when examined in isolation. Our approach is computationally efficient and our open-source software is freely available. This method holds potential for biomarker discovery for diverse phenotypes and is also applicable for identifying patterns hidden within non-biological real-valued data sets.
AB - Biomarker identification, such as gene expression, is used in several areas of medical research, including aiding in disease prediction and treatment. However, most gene expression analysis focuses on differently expressed genes, ignoring patterns in which the co-expression of non-differently expressed genes are associated with disease risk. In this manuscript, we make three contributions. First, we present an alternative definition for differential expression which captures associations that are missed using mean- or median-based methods, such as fold change. Second, we introduce an algorithm for identifying all patterns of analytes associated with a given phenotype within a given threshold of optimal by extensively pruning the solution space. Third, our demonstration on psoriasis gene expression data yields 6320 highly significant gene expression patterns associated with this common disease that are comprised of 2334 unique genes worthy of further exploration. Interestingly, these genes include 1021 genes that are not differentially expressed when examined in isolation. Our approach is computationally efficient and our open-source software is freely available. This method holds potential for biomarker discovery for diverse phenotypes and is also applicable for identifying patterns hidden within non-biological real-valued data sets.
KW - biomarkers
KW - co-expression analysis
KW - gene expression
KW - psoriasis
UR - http://www.scopus.com/inward/record.url?scp=85125194583&partnerID=8YFLogxK
U2 - 10.1109/BIBM52615.2021.9669803
DO - 10.1109/BIBM52615.2021.9669803
M3 - Conference contribution
AN - SCOPUS:85125194583
T3 - Proceedings - 2021 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021
SP - 2322
EP - 2329
BT - Proceedings - 2021 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021
A2 - Huang, Yufei
A2 - Kurgan, Lukasz
A2 - Luo, Feng
A2 - Hu, Xiaohua Tony
A2 - Chen, Yidong
A2 - Dougherty, Edward
A2 - Kloczkowski, Andrzej
A2 - Li, Yaohang
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021
Y2 - 9 December 2021 through 12 December 2021
ER -