TY - GEN
T1 - Multi-stage framework to infer protein functional modules from mass spectrometry pull-down data with assessment of biological relevance
AU - Park, Byung Hoon
AU - Zhang, Bing
AU - Karpinets, Tatiana
AU - Samatova, Nagiza F.
PY - 2007
Y1 - 2007
N2 - Protein functional modules are fundamental units in protein interaction networks. High-throughput Mass Spectrometry (MS) technology has become valuable for discovery of protein functional modules. Yet, their computational inference from MS pull-down data and biological significance evaluation are still challenging. This paper introduces an integrated multi-step framework for (1) assessing protein-protein interaction affinities, (2) constructing a genome-wide protein association map, (3) finding putative protein functional modules, and (4) evaluating their biological relevance. The protein affinity score utilizes copurification pattern of two proteins and adopts an information theoretic-approach to build the protein affinity map. Putative protein modules are then derived using a graph-theoretical approach. A two-stage statistical procedure assesses biological relevance of identified modules. On Saccharomyces cerevisiae's pull-down data (Nature, vol. 415, pp. 141-7, 2002), the scoring scheme outperformed other methods by at least 10% in F1-measure, and statistical tests identified 489 protein modules enriched in all of three general GO categories with p-values less than 0.05.
AB - Protein functional modules are fundamental units in protein interaction networks. High-throughput Mass Spectrometry (MS) technology has become valuable for discovery of protein functional modules. Yet, their computational inference from MS pull-down data and biological significance evaluation are still challenging. This paper introduces an integrated multi-step framework for (1) assessing protein-protein interaction affinities, (2) constructing a genome-wide protein association map, (3) finding putative protein functional modules, and (4) evaluating their biological relevance. The protein affinity score utilizes copurification pattern of two proteins and adopts an information theoretic-approach to build the protein affinity map. Putative protein modules are then derived using a graph-theoretical approach. A two-stage statistical procedure assesses biological relevance of identified modules. On Saccharomyces cerevisiae's pull-down data (Nature, vol. 415, pp. 141-7, 2002), the scoring scheme outperformed other methods by at least 10% in F1-measure, and statistical tests identified 489 protein modules enriched in all of three general GO categories with p-values less than 0.05.
UR - http://www.scopus.com/inward/record.url?scp=49049104160&partnerID=8YFLogxK
U2 - 10.1109/BIBM.2007.14
DO - 10.1109/BIBM.2007.14
M3 - Conference contribution
AN - SCOPUS:49049104160
SN - 0769530311
SN - 9780769530314
T3 - Proceedings - 2007 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2007
SP - 223
EP - 229
BT - Proceedings - 2007 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2007
T2 - 2007 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2007
Y2 - 2 November 2007 through 4 November 2007
ER -