Abstract
Functionally similar IDPs (intrinsically disordered proteins) often have little sequence similarity. This is in stark contrast to folded proteins and poses a challenge for the inverse problem, functional classification of IDPs using sequence alignment. The problem is further compounded because of the lack of structure in IDPs, preventing structural alignment as an alternate tool for classification. Recent advances in heteropolymer theory unveiled a powerful set of sequence-patterning metrics bridging molecular interaction with chain conformation. Focusing only on charge patterning, these set of metrics yield a sequence charge decoration matrix (SCDM). SCDMs can potentially identify functionally similar IDPs not apparent from sequence alignment alone. Here, we illustrate how these information-rich “molecular blueprints” encoded in SCDMs can be used for functional classification of IDPs with specific application in three protein families—Ste50, PSC, and RAM—in which electrostatics is known to be important. For both the Ste50 and PSC protein family, the set of metrics appropriately classifies proteins in functional and nonfunctional groups in agreement with experiment. Furthermore, our algorithm groups synthetic variants of the disordered RAM region of the Notch receptor protein—important in gene expression—in reasonable accordance with classification based on experimentally measured binding constants of RAM and transcription factor. Taken together, the novel classification scheme reveals the critical role of a high-dimensional set of metrics—manifest in self-interaction maps and topology—in functional annotation of IDPs even when there is low sequence homology, providing the much-needed alternate to a traditional sequence alignment tool.
Original language | English |
---|---|
Pages (from-to) | 1860-1868 |
Number of pages | 9 |
Journal | Biophysical Journal |
Volume | 120 |
Issue number | 10 |
DOIs | |
State | Published - May 18 2021 |
Externally published | Yes |
Funding
We acknowledge support from the National Institutes of Health ( R15GM128162-01A1 and R01GM138901 ) and Knoebel Institute for Health Aging at the University of Denver .
Funders | Funder number |
---|---|
Knoebel Institute for Health Aging at the University of Denver | |
National Institutes of Health | R01GM138901 |
National Institute of General Medical Sciences | R15GM128162 |