Data-driven, data-intensive computing for modelling and analysis of biological networks: Application to bioethanol production

Byung Hoon Park, Nagiza F. Samatova, Tatiana Karpinets, Andrew Jallouk, Scott Molony, Scott Horton, Steven Arcangeli

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Modelling biological networks is inherently data-driven and data-intensive. The combinatorial nature of this type of modelling, however, requires new methods capable of dealing with the enormous size and irregularity of the search. Searching via "backtracking" is one possible solution that avoids exhaustive searches by constraining the search space to the subspace of feasible solutions. Despite its wide use in many combinatorial optimization problems, there are currently few parallel implementations of backtracking capable of effectively dealing with the memory-intensive nature of the process and the extremely unbalanced loads present. In this paper, a parallel, scalable, and memory-efficient backtracking algorithm within the context of maximal clique enumeration is presented, and its applicability to large-scale biological networks aimed at studying the mechanisms for efficient bioethanol production is discussed.

Original languageEnglish
Article number012061
JournalJournal of Physics: Conference Series
Volume78
Issue number1
DOIs
StatePublished - Jul 1 2007

Fingerprint

Dive into the research topics of 'Data-driven, data-intensive computing for modelling and analysis of biological networks: Application to bioethanol production'. Together they form a unique fingerprint.

Cite this