Abstract
Biologists hope to address grand scientific challenges by exploring the abundance of data made available through microarray analysis and other high-throughput techniques. However, the impact of this large volume of data is limited unless researchers can effectively assimilate the entirety of this complex information and integrate it into their daily research; interactive visualization tools are called for to support the effort. Specifically, typical studies of gene coexpression can make use of novel visualization tools that enable the dynamic formulation and fine-tuning of hypotheses to aid the process of evaluating sensitivity of key parameters and achieving data reduction. These tools should allow biologists to develop an intuitive understanding of the structure of biological networks and discover genes that reside in critical positions in networks and pathways. By using a graph as a universal data representation of correlation in gene expression data, our novel visualization tool employs several techniques that when used in an integrated manner provide innovative analytical capabilities. Our tool for interacting with gene coexpression data integrates techniques such as graph layout, qualitative subgraph extraction through a novel 2D user interface, quantitative subgraph extraction using graph-theoretic algorithms or by querying an optimized B-tree, dynamic level-of-detail graph abstraction, and template-based fuzzy classification using neural networks. We demonstrate our system using a real-world workflow from a large-scale systems genetics study of mammalian gene coexpression.
| Original language | English |
|---|---|
| Article number | 4492772 |
| Pages (from-to) | 1081-1094 |
| Number of pages | 14 |
| Journal | IEEE Transactions on Visualization and Computer Graphics |
| Volume | 14 |
| Issue number | 5 |
| DOIs | |
| State | Published - Sep 2008 |
Funding
The authors gratefully acknowledge Robert Williams and colleagues for making their BXD brain gene expression data public and Shawn Ericson of GeNetViz for the three additional data sets used in Table 1. The authors wish to thank Dr. Michael Langston et al. for the paraclique algorithm used for processing the biological data, Yihua Bai for her collaboration with the BTD code, Stephen Pitts for coexpres-sion network analysis, and Arne Frick for the GEM3D source. The work is funded in part by US National Science Foundation (NSF) CNS-0437508 and through the US Department of Energy (DOE) SciDAC Institute of Ultra-Scale Visualization under DOE DE-FC02-06ER25778. Elissa J. Chesler is supported by NIH/NIAAA Integrative Neuroscience Initiative on Alcoholism (U01AA13499, U24AA13513, U01AA016662), NIH/NIDA R01DA020677, and NICHD R01HD052472-01.
Keywords
- Bioinformatics visualization
- Focus+context techniques
- Graph and network visualization
- Life sciences and engineering
- Visualization in physical sciences