Identification of candidate genes in Arabidopsis and Populus cell wall biosynthesis using text-mining, co-expression network analysis and comparative genomics

Xiaohan Yang, Chu Yu Ye, Anjali Bisaria, Gerald A. Tuskan, Udaya C. Kalluri

Research output: Contribution to journalArticlepeer-review

29 Scopus citations

Abstract

Populus is an important bioenergy crop for bioethanol production. A greater understanding of cell wall biosynthesis processes is critical in reducing biomass recalcitrance, a major hindrance in efficient generation of biofuels from lignocellulosic biomass. Here, we report the identification of candidate cell wall biosynthesis genes through the development and application of a novel bioinformatics pipeline. As a first step, via text-mining of PubMed publications, we obtained 121 Arabidopsis genes that had the experimental evidence supporting their involvement in cell wall biosynthesis or remodeling. The 121 genes were then used as bait genes to query an Arabidopsis co-expression database, and additional genes were identified as neighbors of the bait genes in the network, increasing the number of genes to 548. The 548 Arabidopsis genes were then used to re-query the Arabidopsis co-expression database and re-construct a network that captured additional network neighbors, expanding to a total of 694 genes. The 694 Arabidopsis genes were computationally divided into 22 clusters. Queries of the Populus genome using the Arabidopsis genes revealed 817 Populus orthologs. Functional analysis of gene ontology and tissue-specific gene expression indicated that these Arabidopsis and Populus genes are high likelihood candidates for functional characterization in relation to cell wall biosynthesis.

Original languageEnglish
Pages (from-to)675-687
Number of pages13
JournalPlant Science
Volume181
Issue number6
DOIs
StatePublished - Dec 2011

Funding

We would like to thank T.J. Tschaplinski and T. Li for thoughtful and insightful comments on the manuscript. This research was supported by the U.S. DOE BioEnergy Science Center. The BioEnergy Science Center is a U.S. Department of Energy Bioenergy Research Center supported by the Office of Biological and Environmental Research in the DOE Office of Science. Oak Ridge National Laboratory is managed by UT-Battelle, LLC for the U.S. Department of Energy under Contract Number DE-AC05-00OR22725.

FundersFunder number
BioEnergy Science Center
DOE BioEnergy Science Center
DOE Office of Science
Office of Biological and Environmental Research
U.S. Department of Energy Bioenergy Research Center
U.S. Department of EnergyDE-AC05-00OR22725
Oak Ridge National Laboratory

    Keywords

    • Arabidopsis
    • Bioinformatics
    • Biomass
    • Cell wall
    • Gene expression
    • Populus

    Fingerprint

    Dive into the research topics of 'Identification of candidate genes in Arabidopsis and Populus cell wall biosynthesis using text-mining, co-expression network analysis and comparative genomics'. Together they form a unique fingerprint.

    Cite this