Genetics and functional genomics of type 2 diabetes mellitus
© BioMed Central Ltd 2003
Published: 24 November 2003
Skip to main content
© BioMed Central Ltd 2003
Published: 24 November 2003
Genome-wide studies of transcription in the skeletal muscle of type 2 diabetic patients have identified coordinated changes in the expression of genes involved in oxidative phosphorylation, and have underlined the central role of the oxidative-phosphorylation regulator, PCG1α. These findings help unravel the complex pathogenesis and inheritance of polygenic type 2 diabetes mellitus.
Type 2 diabetes mellitus is a heterogeneous clinical entity, characterized by high blood sugar levels or hyperglycaemia, arising from a deteriorated tissue response to the biological effects of insulin (insulin resistance)  and impaired glucose-induced insulin secretion . It is a classical example of multifactorial disorder: the etiology of type 2 diabetes combines both genetic and environmental factors. The 'westernized lifestyle' is a likely cause of the recent rapid rise in disease incidence, which is projected to double worldwide by the year 2025 (of 150 million diabetics globally, approximately 90% live in western countries). The management of type 2 diabetes and its complications accounts for a significant share of national annual healthcare spending in all western countries. Together, these facts have galvanized international efforts to research the causes of type 2 diabetes mellitus and to develop effective preventive and therapeutic tools and strategies for managing the disease.
International genetic and genomic projects have delivered a wide range of tools and resources for genome-wide investigations to help the search for genetic factors involved in type 2 diabetes mellitus and have contributed to recent progress in our knowledge of the etiology of the disease (for the remainder of this article, when we say 'diabetes' we mean type 2 diabetes mellitus). Genetic markers and dense genetic maps allow extensive searches for gene variants that co-segregate with the disease. Although many genetic loci have been described that are linked to either diabetes or associated pathophysiological markers , CALPAIN10 remains the only known gene underlying polygenic type 2 diabetes mellitus .
The vast amount of information derived from the annotation of the human genome and the genomes of model organisms provide opportunities for using new strategies to identify genes underlying diabetes. Knowledge of the human transcriptome and proteome provides unparalleled power for searching comprehensively for the causes and consequences of the disease, and diagnostic and therapeutic targets and agents relevant to it. In theory it is possible to characterize gene-expression patterns (in terms of transcription, translation and post-translational changes) in any cell type and to generate a broad functional picture of the molecular mechanisms involved in health and disease. Differences in gene expression between diabetic patients and controls are indicative of a relationship between a specific gene or series of genes and the disease in a specific context.
The transcriptome forms the crucial intermediate between the genetic-code template and the protein products that constitute the structural components and operational machinery of organisms. The technology required to array large numbers of transcripts or oligonucleotides and detect differences in the amount of individual transcripts is obviously more straightforward than that required for high-throughput genome-wide protein screening. Fast and efficient genome-wide transcription profiling, using microarrays, can reveal an overall picture of genes potentially involved in the causes and consequences of a disease, and this technology has been successfully used in various fields of biomedical research.
Skeletal muscle constitutes about 40% of mammalian body mass and is, with fat and liver tissues, a key site of insulin stimulated glucose disposal . Impaired insulin-mediated glucose disposal by muscle is a key feature of diabetes  and skeletal muscles are, therefore, key tissue targets for gene-expression studies of diabetes. Two recent landmark studies [7, 8] have used microarray analyses to highlight the central role in the human diabetic-muscle phenotype of two key regulators of oxidative phosphorylation: the peroxisome proliferation-activated receptor-γ (PPARγ) coactivator 1 (PGC1α) and nuclear respiratory factor (NRF1). These studies were based on determination of the expression levels of large sets of 7,129  and 22,000 genes , respectively, in muscle biopsies of diabetic patients, prediabetics and controls.
Novel analytical methods used to exploit the microarray data generated in the two studies clearly underline the importance of considering modest but coordinated changes in genome-wide transcription patterns, rather than individual changes that meet an arbitrary threshold. None of the genes examined in either study was differentially expressed between the different groups tested (diabetic patients, prediabetics and controls) following robust statistical correction for false discovery during multiple hypothesis testing. Both studies examined differential covariation - between test subject groups or classes - of sets of genes (within gene group covariation). Genes were labeled by ontology classification, then ranked by differential expression, and this sequential procedure facilitated the detection of pathway-dependent co-regulated genes. Mootha et al.  computed a Kolmogorov-Smirnov test statistic (termed Gene Sets Enrichment Approach, or GSEA) for each gene, which was then evaluated against an empirically determined (permutation test) threshold for declaring association between a gene set and diabetes. In contrast, Patti et al.  examined only genes that were differentially expressed before correction for multiple testing (p < 0.05).
Remarkably, both studies found that the transcription levels of a class of genes involved in oxidative phosphorylation mechanisms were consistently lower in diabetics than in controls, even following robust correction for multiple testing, and both therefore suggested that the downregulation of mitochondrial oxidative phosphorylation genes may precede diabetes onset. The results of real-time quantitative PCR demonstrated that the expression of NRF1, which regulates the expression of nuclear-encoded mitochondrial genes, was decreased by 29% in diabetics compared to non-diabetics . Both studies [7, 8] also found that PGC1α expression was significantly lower (22-36%) in diabetics than in other test groups, and Patti et al.  showed that PGC1β expression was also significantly decreased (46%) in diabetics compared to non-diabetic subjects.
Poor replicability of significant genetic linkage and association to a disease is often attributed to a number of factors, including, for example, ethnic stratification, population-specific linkage disequilibrium between markers and causal variants, and gene-by-gene and gene-by-environment interactions. The possible importance of some of these factors has been addressed in an attempt to replicate previously published associations in 16 candidate genes for type 2 diabetes mellitus. The gene encoding PPARγ was the only one for which the study could confirm an association between common variants and a significant increase in diabetes risk . The existence in a non-diabetic population of disease susceptibility gene variants that are not expressed phenotypically, for example because of the late onset of diabetes, is also a factor that reduces the power of genetic-linkage and association analyses for this disease. Remarkably, although Mootha et al.  and Patti et al.  independently investigated patient samples from different populations (Mexican-Americans and Caucasians, respectively) using different microarrays and different data-analysis tools, the results from both studies converged at the same gene pathway. Using biological samples from patients selected for homogenous clinical phenotypes may have minimized the effect of genetic variability on gene transcription. It was, however, reported that reduced oxidative phosphorylation gene transcription was also found in 'prediabetics' .
Population size is also a key factor affecting the detection of genetic linkage and association. Results from association studies in type l diabetes mellitus provided support for the requirement of very large population and family datasets to improve the statistical robustness and reliability of genetic-association studies of complex traits . On the other hand, screening a very large number of single nucleotide polymorphisms (SNPs) in a relatively modest panel of patients was sufficient to identify a functional sequence variant associated with susceptibility to myocardial infarction . It is becoming apparent that, in contrast to traditional genetic association studies, gene-expression profiling studies can provide robust and functionally valid results with samples from a very limited number of well-selected patients and controls [7, 8].
But in contrast to gene-expression profiling, genetic-linkage mapping provides statistical evidence for the involvement of chromosomal regions co-segregating with a disease or disease-associated phenotypes. It is essentially a strategy to define etiologically important factors underlying multifactorial disorders, in which potential biases are either minimized or well defined. Unless all pathophysiological variables associated with diabetes - glucose intolerance, altered insulin secretion and action, obesity, dyslipidemia and high blood pressure - and its complications are comprehensively characterized in both patients and controls of a large cohort, results from linkage analysis can only identify chromosomal regions involved in the control of one phenotype. This strategy removes a number of the constraints and limitations of functional studies, including for example tissue- and developmental-stage-specific gene expression. Most importantly, linkage analysis can point to sequence elements, including pseudogenes , non polyadenylated RNAs and non-coding sequences  that may play an important unrecognized role in complex diseases but cannot be directly tested by current microarray-based transcription-profiling technologies.
Determining the causes and consequences of a complex disease is the most challenging issue that genetic and functional genomic studies currently attempt to address . It is pertinent to consider conceptually how decreased oxidative phosphorylation might result in diabetes. A large body of existing scientific literature supports the hypothesis of Patti et al. : decreased expression of PGC1α coupled with reduced expression of NRF1 and PPARγ eventually result in decreased oxidative phosphorylation and lipid oxidation, accumulation of lipid in skeletal muscle and ultimately diabetes. Many organs, including endocrine pancreas, liver and fat, are important in maintaining glucose homeostasis and energy balance. One must therefore consider whether differential transcription of components of the oxidative-phosphorylation pathway in diabetes is limited to muscle and how it relates to the whole-organism functional picture of diabetes. For example, ATP generated from oxidative phosphorylation is required for normal glucose-stimulated insulin release. The downregulation of oxidative-phosphorylation genes seen in diabetic muscle would be expected to be mirrored in pancreatic islets and may account for a lack of insulin related to beta-cell dysfunction in advanced diabetes .
In conclusion, transcriptomics is well advanced for technological reasons and will deliver vast amounts of information on the complex regulation of tissue- and time-specific gene expression in disease onset and progression. One of the important perspectives of genome-wide transcriptomic studies lies in the possibility to integrate genes and expressed sequence tags (ESTs) of unknown or poorly defined roles into functional pathways, thus improving functional genome annotation. Although GSEA is a significant advance for interpreting microarray data, it relies on our current understanding of the regulation of metabolic and hormonal networks. Results from the investigation of the diabetic muscle transcriptome [7, 8] suggest that gene-transcription profiling on its own may have a limited impact on the enrichment of system biology maps. Gene transcription, however, represents only a single dimension in functional genome maps. In particular, as exemplified by Epstein et al. , post-translational modification is a crucial mechanism for gene regulation that is not necessarily discernible by studying the transcriptome. Metabonomics and metabolomics are also genome-wide technologies that will enrich genome annotation in health and disease [17, 18]. Global integrative analyses of multidimensional genomic parameters exploiting the concept of a biological atlas advanced by Vidal et al.  probably represents the most optimal use of resources to isolate variables that are relevant to complex diseases in the whole organism, and to describe their interplay.
D.G. holds a Wellcome Senior Fellowship in Basic Biomedical Science (057733). The authors acknowledge support from the Wellcome Trust Functional Genomics Initiatives BAIR (Biological Atlas of Insulin Resistance) (066786) and CFG (Cardiovascular Functional Genomics) (066780).