An SVM-based algorithm for identification of photosynthesis-specific genome features

G. X. Yu, G. Ostrouchov, A. Geist, N. F. Samatova

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

34 Scopus citations

Abstract

This paper presents a novel algorithm for identification and functional characterization of "key" genome features responsible for a particular biochemical process of interest. The central idea is that individual genome features are identified as "key" features if the discrimination accuracy between two classes of genomes with respect to a given biochemical process is sufficiently affected by the inclusion or exclusion of these features. In this paper, genome features are defined by high-resolution gene functions. The discrimination procedure utilizes the support vector machine classification technique. The application to the oxygenic photosynthetic process resulted in 126 highly confident candidate genome features. While many of these features are well-known components in the oxygenic photosynthetic process, others are completely unknown, even including some hypothetical proteins. It is obvious that our algorithm is capable of discovering features related to a targeted biochemical process.

Original languageEnglish
Title of host publicationProceedings of the 2003 IEEE Bioinformatics Conference, CSB 2003
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages235-243
Number of pages9
ISBN (Electronic)0769520006, 9780769520001
DOIs
StatePublished - 2003
Event2nd International IEEE Computer Society Computational Systems Bioinformatics Conference, CSB 2003 - Stanford, United States
Duration: Aug 11 2003Aug 14 2003

Publication series

NameProceedings of the 2003 IEEE Bioinformatics Conference, CSB 2003

Conference

Conference2nd International IEEE Computer Society Computational Systems Bioinformatics Conference, CSB 2003
Country/TerritoryUnited States
CityStanford
Period08/11/0308/14/03

Funding

FundersFunder number
National Science Foundation

    Keywords

    • genome comparative analysis
    • key genome features
    • oxygenic photosynthetic process
    • support vector machines

    Fingerprint

    Dive into the research topics of 'An SVM-based algorithm for identification of photosynthesis-specific genome features'. Together they form a unique fingerprint.

    Cite this