Editorials |
From The Center for Cardiovascular Bioinformatics and Modeling and The Whitaker Biomedical Engineering Institute, Johns Hopkins University School of Medicine and Whiting School of Engineering, Baltimore, MD.
Correspondence to Raimond L. Winslow, Rm 201B Clark Hall, The Johns Hopkins University, 3400 N. Charles St., Baltimore MD 21218. E-mail rwinslow{at}bme.jhu.edu
See related article, pages 617625
Key Words: microarrays gene expression
One of the most exciting new experimental technologies to emerge in recent years has been methods for obtaining genome-wide mRNA expression data using oligonucleotide1 and cDNA microarrays2 (for review, see Cook and Rosenzweig3). Application of gene expression profiling was limited initially by cost, which in turn imposed severe constraints on the number of hybridizations that could be performed and thus the statistical significance of experimental results. This barrier has been reduced as array fabrication techniques have advanced and it is now common for studies to investigate differential gene expression at relatively large sample numbers, see for example, Singh et al4 and Margulis et al5. Improved statistical methods for inferring regulated genes6 as well as for grouping genes on the basis of related expression patterns7 have also been developed and refined.
Application of these methods to analysis of differential gene expression in cardiovascular disease typically reveals large numbers of genes with altered expression. These numbers can range from hundreds8 to thousands5 and they pose a new challengehow best do we sift through these lists to identify those genes having relevance to disease development and progression (referred to as "candidate genes")? The most common approach is to perform a manual or semi-automated9 search of publicly available databases to evaluate gene function on the basis of annotation, and to then select as candidate genes those with function deemed most relevant to the disease process. While useful, such approaches are subjective and limited by the availability and accuracy of functional annotations. The study of Yagil et al presented in this issue of Circulation Research10 represents an alternative, experimentally-driven approach to the problem of identifying a restricted, biologically relevant set of candidate genes influencing the disease phenotype of hypertension.
In prior work, Yagil et al developed the Sabra hypertension prone (SBH/year) and resistant (SBN/year) rat model and used this animal model to perform QTL mapping of salt-induced hypertension susceptibility.11 They identified 2 QTLs (SS1a and SS1b) on chromosome 1 in males as well as an additional QTL (SS17) on chromosome 17 in females accounting for salt loading-induced variance of blood pressure.12 To provide functional evidence of the identified QTLs, Yagil et al13 constructed congenic strains of the salt-sensitivity QTLs SS1a and SS1b by replacing the QTLs in SBH/year with the chromosomal counterparts from SBN/year and found that blood pressure decreased, confirming that SS1a and SS1b are associated with salt susceptibility.
In this study, Yagil et al10 have gone a step beyond this approach to integrate gene expression analysis with prior information from QTL mapping to identify candidate genes responsible for salt-induced hypertension in rat. Gene expression profiling was performed using kidney tissue samples from four different populations consisting of male SBN/year and SBH/year rats fed with either a normal or salt-loaded diet. They first identified a total of 2470 genes that were differentially expressed across the 4 groups. Using 2-way analysis of variance and clustering methods, they then grouped these genes into 9 different clusters. Two of these clusters were associated with rat strain, 2 with the effect of salt-loading, and 2 with rat strain and salt-loading. The authors then focused on analysis of genes in the latter 2 clusters, identifying 7 genes that were cluster members, and with chromosomal location mapping within the boundaries of the SS1a and SS1b QTLs identified previously on chromosome 1. These 7 genes therefore have the properties that: (a) they are located within QTLs previously determined as being related to salt-sensitive hypertension; and (b) their expression is regulated by both rat strain and salt-loaded diet. Yagil et al propose these as novel candidate genes for hypertension.
This work provides a starting point for further development of data-driven methods for identifying small numbers of candidate genes likely to have biological relevance to a disease phenotype. Microarray technology is advancing rapidly and chip-based assays capable of genotyping many thousands of single oligonucleotide polymorphisms in large populations are now under development.14 Computational methods for performing genome-wide association analysis using data from these arrays, such as the exhaustive allelic transmission disequilibrium test of Lin et al,15 are being developed. When combined with data from expression profiling, such whole-genome association analyses have the potential to support high-throughput discovery of candidate coding and noncoding genomic regions regulating disease phenotype.
Yagil et al have submitted their hybridization data sets to the European Bioinformatics Institute, where they are available for download and subsequent analysis. Doing so is of vital importance to the research community, as it enables others to not only reproduce published research results, but to undertake additional data-mining studies to discover new biological knowledge. However, archiving of data should be viewed only as a necessary first step toward a more broad-based approach to data dissemination, data analysis and quantitative computational modeling of cardiovascular function. Increasingly, this capability is being provided by "biogrids"distributed networks of computers and storage systems that can make both data, and data analysis methods and models available to geographically-dispersed user groups.16 A leading example of a biogrid is the Biomedical Informatics Research Network (BIRN).17 Supported by the National Institutes of Health, BIRN is currently a consortium of 19 universities and 26 research groups developing methods for the collection, dissemination and analysis of brain imagery relating to neurologic disorders. BIRN provides users access to data, computational resources and data analysis applications through web portals. With the large investment being made by the National Heart, Lung and Blood Institute into genomic, proteomic and clinical proteomics centers, it is time to follow the lead of the BIRN project and develop a cardiovascular research grid supporting the collection and dissemination of cardiovascular data, data analysis methods, and quantitative models of cardiovascular system function at biological levels of organization ranging from that of the gene to the transcriptome, proteome, cellular, tissue and organ level. Such a grid would provide a valuable resource to the national and international cardiovascular research community.
| Footnotes |
|---|
| References |
|---|
|
|
|---|
Related Article:
This article has been cited by other articles:
![]() |
E.K. Moses, E. Fitzpatrick, K.A. Freed, T.D. Dyer, S. Forrest, K. Elliott, M.P. Johnson, J. Blangero, and S.P. Brennecke Objective prioritization of positional candidate genes at a quantitative trait locus for pre-eclampsia on 2q22 Mol. Hum. Reprod., August 1, 2006; 12(8): 505 - 512. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Circulation Research Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 2005 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |