Learning protein folding energy functions

Wei Guan, Arkadas Ozakin, Alexander Gray, Jose Borreguero, Shashi Pandit, Anna Jagielska, Liliana Wroblewska, Jeffrey Skolnick

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

A critical open problem in ab initio protein folding is protein energy function design, which pertains to defining the energy of protein conformations in a way that makes folding most efficient and reliable. In this paper, we address this issue as a weight optimization problem and utilize a machine learning approach, learning-to-rank, to solve this problem. We investigate the ranking-via-classification approach, especially the RankingSVM method and compare it with the state-of-theart approach to the problem using the MINUIT optimization package. To maintain the physicality of the results, we impose non-negativity constraints on the weights. For this we develop two efficient non-negative support vector machine (NNSVM) methods, derived from L2-norm SVM and L1-norm SVMs, respectively. We demonstrate an energy function which maintains the correct ordering with respect to structure dissimilarity to the native state more often, is more efficient and reliable for learning on large protein sets, and is qualitatively superior to the current state-of-the-art energy function.

Original languageEnglish
Title of host publicationProceedings - 11th IEEE International Conference on Data Mining, ICDM 2011
Pages1062-1067
Number of pages6
DOIs
StatePublished - 2011
Externally publishedYes
Event11th IEEE International Conference on Data Mining, ICDM 2011 - Vancouver, BC, Canada
Duration: Dec 11 2011Dec 14 2011

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference11th IEEE International Conference on Data Mining, ICDM 2011
Country/TerritoryCanada
CityVancouver, BC
Period12/11/1112/14/11

Keywords

  • Ab initio protein folding
  • Energy function
  • Learningto-rank
  • Non-negativity constrained SVM optimization
  • Support vector machine

Fingerprint

Dive into the research topics of 'Learning protein folding energy functions'. Together they form a unique fingerprint.

Cite this