Evolutionary rates and centrality in the yeast gene regulatory network
© Jovelin and Phillips; licensee BioMed Central Ltd. 2009
Received: 9 October 2008
Accepted: 9 April 2009
Published: 9 April 2009
Transcription factors play a fundamental role in regulating physiological responses and developmental processes. Here we examine the evolution of the yeast transcription factors in the context of the structure of the gene regulatory network.
In contrast to previous results for the protein-protein interaction and metabolic networks, we find that the position of a gene within the transcription network affects the rate of protein evolution such that more central transcription factors tend to evolve faster. Centrality is also positively correlated with expression variability, suggesting that the higher rate of divergence among central transcription factors may be due to their role in controlling information flow and may be the result of adaptation to changing environmental conditions. Alternatively, more central transcription factors could be more buffered against environmental perturbations and, therefore, less subject to strong purifying selection. Importantly, the relationship between centrality and evolutionary rates is independent of expression level, expression variability and gene essentiality.
Our analysis of the transcription network highlights the role of network structure on protein evolutionary rate. Further, the effect of network centrality on nucleotide divergence is different among the metabolic, protein-protein and transcriptional networks, suggesting that the effect of gene position is dependant on the function of the specific network under study. A better understanding of how these three cellular networks interact with one another may be needed to fully examine the impact of network structure on the function and evolution of biological systems.
Understanding of the function and evolution of any specific gene or protein requires knowledge of the context in which that gene operates, because change in any single component of a complex system can have ramifications for all other components. This system-orientated view, largely enabled by the omics revolution, has sparked increasing interest in the investigation of biological networks and has yielded promising results in the understanding of cellular , developmental  and ecological  processes. A major challenge within this area is to determine how the various parts of a system interact in order for the system as a whole to function. With a more global understanding of system function in hand, a larger question then emerges: in what ways does the structure of the network influence the evolution of the components of that network? For example, in the yeast protein-protein interaction (PPI) and metabolic networks, central and highly connected proteins tend to evolve more slowly than peripheral genes [4–7]. Is this a global feature of all such networks, or does the specific function of a given network have a strong influence on its evolutionary properties? Here, we address these questions by analyzing the evolution of the yeast transcription factors in the context of the structure of the transcriptional regulatory network.
The premise that biological systems are more than the sum of their parts implies that such systems possess emergent properties that cannot be captured by a purely reductionist approach. For a network, one such emergent property is its topology. Comparisons of entirely different types of networks, including social, technological and biological networks, have revealed intriguing shared topological properties, such as an overall hierarchical organization, similar node-degree distributions, and a tendency toward a small-world structure in which most nodes are connected by only a few other intervening nodes . The observation that both metabolic and PPI networks display approximately scale-free topologies, with a few highly connected nodes and a majority of nodes with only a few connections, leads to the proposal that network structure may be the result of selection, perhaps as a means of providing mutational robustness . This hypothesis remains uncertain, however, because networks with node connectivity following a power-law distribution can be assembled without natural selection  and because natural selection is very weak on second order network properties such as robustness . Further, networks with similar power-law distributions can have different fine-scale architectures, which may be functionally important .
In this study we examine the evolution of the yeast transcription factors and ask whether fine differences in network structure and function lead to different evolutionary impacts on the elements of those networks. Gene regulatory networks are of particular interest because they allow the cell to modify its physiology, cycle and shape in response to environmental or developmental demands . Metabolic and gene regulatory networks have a different level of complexity than PPI networks because they are directed and explicitly model the flow of information passing through the nodes. Moreover, even though all three cellular networks are characterized by having a small number of highly connected nodes, these networks differ in their node-degree distribution . The yeast transcription regulatory network consists of a mixed scale-free and exponential topology: only the number of target genes follows a power-law distribution whereas the number of regulators is exponential . These structural and functional differences may result in different effects on the evolution of network components. For instance, underlying the power-law distribution of target genes is a distributed architecture that may cause the apparent independence between connectivity and the retention of regulatory proteins across genomes .
Overall, we show that network structure does indeed lead to different evolutionary dynamics that depends more specifically on the overall function of the network. Therefore, understanding the relationship between network structure and the evolution of network components will depend on a deeper knowledge of gene function.
Central transcription factors tend to evolve faster
We obtained node statistics, specifically the number of regulatory inputs (in-degree, k in ), the number of target genes (out-degree, k out ), and betweenness, measuring the centrality of a gene in the network, from two separately derived representations of the yeast transcriptional network. The first dataset (YTN1)  includes 286 transcription factors, 3,369 target genes and 8,372 regulatory interactions. The second dataset (YTN2)  includes 157 transcription factors, 4,410 target genes and 12,873 regulatory interactions. Only transcription factors clearly identified as orthologs in the yeast genome database (Saccharomyces Genome Database (SGD)) were retained for analysis of evolutionary rates, leading to the retention of a set of 256 genes for YTN1 and a set of 138 genes for YTN2. Because the first network contains 85% more transcription factors than the second, we have much more power to detect significant effects using the first network and therefore focus most of our discussion on that dataset. Nevertheless, both datasets yield qualitatively similar results for each of our major conclusions.
Large-scale analyses have shown that multiple genomic variables have an effect on the rate of protein evolution [16, 17]. Among them, expression level has been shown to correlate strongly with a gene's evolutionary rate [18–22], leading to a wide debate about the importance of other genomic variables such as essentiality [21–25] and connectivity [7, 21, 24, 26–28]. Therefore, we first examine the separate effects of expression, function and network related variables on rates of transcription factor sequence evolution in turn, and then tease apart their independent effects using a multivariate approach.
To further investigate the impact of functional constraints on sequence evolution, we used the number of Gene Ontology (GO) terms [30, 31] as a proxy for a gene's pleiotropic effects. GO describe a gene's properties and functions by assigning attributes under the categories 'cellular component', 'biological process' and 'molecular function'. There is a correlation between the number of GO terms and essentiality (YTN1: Spearman's ρ = 0.171, P = 0.007), indicating that pleiotropy has direct fitness consequences. Accordingly, transcription factors with more GO terms tend to evolve more slowly (Figure 1), presumably because mutations arising in genes with larger pleiotropic effects are more likely to be deleterious and are thus selected against.
Finally, the position of a gene within the network, or its centrality, has a significant influence on its evolutionary rate (Figure 1). Previous studies have determined that central metabolic enzymes and central proteins in the PPI network are under stronger selective constraints and evolve slower [4, 6]. In contrast, we find that for the transcriptional network, protein evolution is positively correlated with betweenness, indicating that transcription factors that occupy a more central position in the network tend to evolve faster (Figure 1). Similarly, contrary to metabolic and PPI networks, protein sequence divergence correlates positively with connectivity. However, the relationship between out-degree and evolutionary rate differs between the two network datasets (Figure 1).
The effect of centrality on protein sequence evolution is independent of other genomic variables
Interpretation of these simple correlation patterns is complicated by the fact that different genetic properties are correlated with one another and so any single correlation between two characteristics might actually be generated by a shared correlation with a third causal element. To correct for this, we examined the relative contribution of function, network and expression-associated constraints on transcription factor evolution using multivariate analysis.
Multiple regression of genomic variables and protein evolutionary rates
Relationships between evolutionary rates and six predictor variables
Relationships between evolutionary rates and seven predictor variables
Multiple regression of genomic variables and rates of synonymous changes
Relationships between evolutionary rates and six predictor variables
Relationships between evolutionary rates and seven predictor variables
We find no significant correlation between in-degree and essentiality (k in : YTN1: Spearman's ρ = -0.082, P = 0.2; YTN2: ρ = 0.033, P = 0.7), although the relationship between out-degree and essentiality differs between the two datasets (k out : YTN1: Spearman's ρ = -0.071, P = 0.26; YTN2: ρ = 0.27, P = 0.002). However, when growth rate is measured under different conditions, transcription factors with numerous target genes in YTN2 are not enriched in essential genes . Nevertheless, the correlation between the number of target genes and protein sequence divergence is fairly weak, as multiple regression analysis failed to disentangle the effect of out-degree from the causal effect of other predictor variables (Table 1). Therefore, contrary to the PPI [4, 8] and metabolic [5, 6] networks, there is no significant correlation between connectivity and essentiality, while in-degree is in fact positively correlated with protein sequence divergence.
Importantly, this analysis also shows that the contribution of network centrality to protein divergence is independent of expression and function-related variables (Table 1). Thus, a striking difference among cellular networks lies in the influence of the position of a gene within the network on its rate of evolution. However, transcription factors that are more central in the network do tend to show higher variability in their expression level under changing conditions (YTN1: Spearman's ρ = 0.178, P = 0.004), but centrality is not correlated with expression level (YTN1: Spearman's ρ = 0.006, P = 0.924) and essentiality (YTN1: Spearman's ρ = -0.022, P = 0.735).
Principal component regression analysis: principal components PC1 to PC4
Percent variance explained by each PC
Effect of PCs on response variables
Contribution of predictor variables to each PC
Principal component regression analysis: principal components PC5 to PC8
Percent variance explained by each PC
Effect of PCs on response variables
Contribution of predictor variables to each PC
Results of multiple regression analysis on composite variables
Percent of variance explained by PC1
In summary, our results on the yeast transcription network and previous work on the yeast metabolic and PPI networks [4–6] show that the structure of cellular networks influences the evolution of proteins within these networks. However, the system-level pattern of selective constraints at individual nodes differ despite the three networks having grossly similar topologies, perhaps in relation with the function and the nature of the network.
Genomic information generated in recent years has not only offered new insights into biological processes at various levels of organization [1–3, 32], but has also enabled a shift from studying the evolution of single or few genes to a system-level view of molecular evolution that integrates interactions among genes within their cellular context. A first consequence of this new perspective is the recognition that several factors in addition to protein function control rate divergence in coding sequences [16, 17], with expression level having a strong effect [20, 21].
A second consequence of this systems molecular evolution perspective is that it yields novel insights into how cellular networks and their components evolve. Previous studies have noted that metabolic enzymes with high degree are no more essential than those with low degree, perhaps because rerouting of metabolic fluxes in highly connected regions circumvents loss of function mutations at a given node . The absence of correlation between connectivity and essentiality observed here may be the consequence of a similar mechanism of genetic robustness achieved through rerouting of information flow through the transcriptional network. This hypothesis is further suggested by a recent study showing that the mean sequence divergence among intermediate regulators between a top regulator and its target gene increases with the number of alternative pathways between the regulator-target gene pair .
We obtain qualitatively similar results from our analysis of both representations of the transcriptional network [14, 15]. This is especially true if we account for the overall correlation structure among the variables within the network, function, and expression classes (Table 5). Many more transcription factors are represented in the first network, however, which makes it much easier to detect significant evolutionary associations. It is clear, therefore, that completeness of the network will influence conclusions from global analyses such as that conducted here. Nevertheless, the fact that similar results are obtained from different network datasets, which undoubtedly capture different levels of network complexity, suggests that the results presented here are somewhat robust to overall sampling issues.
Our results on the yeast transcription network and previous work on the yeast metabolic and PPI networks [4–6] show that the structure of cellular networks influences selective constraints at individual nodes, but that these system-level constraints differ despite the three cellular networks having similar, although not identical, topological properties [1, 13]. These differences may ultimately be due to the nature of the networks and how they function. Highly connected proteins in the PPI and metabolic networks are subject to stronger purifying selection, presumably because of a larger fraction of sites involved in interactions and because of kinetic constraints due to highly used metabolites, respectively [5, 7].
In contrast, transcription networks play fundamental roles in regulating cell state during developmental processes and during physiological adjustment to changing environmental conditions . For instance, changes in growth conditions lead Escherichia coli to regulate transcript and protein levels to maximize growth rate and maintain stable metabolite levels, whereas when enzymes of the carbon metabolism network are disrupted, system stability is achieved through redundancy and flux rerouting . In eukaryotes other than yeast, transcriptional variability (which might serve as an indicator of environmental sensitivity), rather than expression level per se, seems to correlate better with protein divergence . Here, transcription factors that are more central in the network tend to show higher variability in their expression level in changing conditions. At a local scale, expression variability within a regulatory motif also depends on network structure . However, we do not find a significant effect of expression variation on transcription factor evolution (Table 1). The influence of centrality on the rate of protein evolution in the yeast transcription factor network is therefore not a secondary effect of selection acting directly on transcriptional variability. Because central transcription factors have rapid access to many regions of the network and may act to control the flow of information across the network, they may be important components of sensory systems that transduce environmental changes and coordinate the response of the regulatory network. It is possible that the higher level of amino acid change seen in central transcription factors is therefore the result of historical adaptation to changing environmental conditions. An alternative hypothesis is that more central transcription factors are instead more buffered from outside influences and therefore less subject to strong purifying selection.
Although the relationship between centrality and evolutionary rate is somewhat unexpected, examination of the fine scale structure of other networks indicates that this may be a general property of control systems. For example, although highly connected proteins (hubs) in the yeast PPI network evolve slowly [4, 7], intermodule hubs (those that display temporal variation in their connections) are more divergent than intramodule hubs (those displaying static patterns of interactions) . Similarly, directional selection has recently been inferred at controlling, branch-point enzymes in four out of five metabolic pathways converging to glucose-6-phosphate in Drosophila . Thus, proteins that exert some control in flux distribution, information processing or in connecting various protein complexes may, in general, be the target of adaptation because mutations arising in these proteins have the potential to affect the entire system and may, therefore, be more exposed to natural selection.
The system-level pattern of evolutionary rates is different from that observed in the protein-protein interaction and metabolic networks: central transcription factors tend to evolve faster. This suggests that the higher nucleotide rate divergence in central transcription factors may result from the role that these proteins play in controlling the flow of information and may be the result of adaptation to changing environmental conditions. The conclusions derived from network level analyses of molecular evolution can clearly vary depending on the functional role played by the components of that network. In the same way that we have shown that the particular function of a network can influence how one interprets the impact of its structure on protein evolution, it is clear that we must begin to link all of these networks (regulatory, protein-protein, and metabolic) together so that the complete nature and consequences of network structure on the function and evolution of biological systems can be examined.
Materials and methods
We used two distinct datasets of the yeast transcriptional network. The first dataset , YTN1, includes 286 transcription factors, 3,369 target genes and 8,372 regulatory interactions. The second dataset , YTN2, includes 157 transcription factors, 4,410 target genes and 12,873 regulatory interactions. The two networks were derived from largely independent genetic, biochemical and ChIP-chip experiments. Node statistics, including in-degree (k in ), out-degree (k out ) and betweenness, were obtained for each dataset using the tYNA platform .
Protein sequences of orthologous genes from Saccharomyces cerevisiae  and S. paradoxus, the most closely related species  having its genome sequenced , were retrieved from the SGD , aligned , and subsequently used to generate codon-based DNA sequence alignments. Maximum likelihood estimates of the rates of amino acid replacements (dN) and synonymous changes (dS) were computed in CODEML . In addition, we computed the rate of synonymous changes corrected for selection at silent sites (dS') . We also attempted to correct for the correlation between dN and dS by using the residuals of the regression between dN with dS in our analyses.
Essentiality was defined by a lethal phenotype in deletion strains . For a quantitative measure of a gene's essentiality we used growth rates measured in deletion mutants . The number of GO terms [30, 31] used as a proxy for a gene's pleiotropic effect was obtained from the SGD. Protein and mRNA abundance have been used as estimates of gene expression in studies of evolutionary rates in yeast [18–22, 24, 25, 37]. We obtained protein  and mRNA  abundance from the literature. However, in our sample faster evolving genes are more likely to be missing from the mRNA abundance (YTN1: N = 206; YTN2: N = 108) and protein abundance (YTN1: N = 195; YTN2: N = 96) datasets, leading to an obvious bias (YTN1: protein abundance: dN/dS: Wilcoxon two-sample P = 0.004; mRNA abundance: dN/dS: Wilcoxon two-sample P = 0.04; YTN2: protein abundance: dN: Wilcoxon two-sample P = 0.03; mRNA abundance: dN/dS: Wilcoxon two-sample P = 0.06). Nevertheless, the translational robustness hypothesis suggests that the frequency of translation events is a better indicator of evolutionary rate than the number of proteins per cell . Therefore, we used the codon adaptation index (CAI) , which measures synonymous codon usage bias and correlates with mRNA abundance , as well as direct measures of expression level, as substitutes for other abundance measures. CAI was computed  using the reference gene set defined by Carbone et al. . Expression level is the average level of expression across 198 microarrays from a wide range of conditions . Expression variation is measured by the coefficient of variation defined as the mean over the standard deviation .
Statistical analyses were performed using JMP 4.0.4 (SAS Institute Inc., Cary, NC, USA). The number of GO terms and k out were natural-logarithmic transformed to approximate a normal distribution. One unit was added to betweenness and k in , as well as k out in YTN2, prior to the natural logarithmic transformation because of null values for these variables. All variables, including predictor and response variables, were standardized to a mean of 0 and 1 standard deviation unit. In addition to Spearman's rank correlations and multiple regression analysis, we also performed principal component regression analysis, first using single predictor variables together and then by defining a new set of principal components separately from the expression, network and function-related variables. These composite variables were obtained from the first principal component of expression (CAI, expression level, expression variability), network (betweenness, k in , k out ) and function (GO, essentiality) related variables. Principal component analyses were performed on correlations.
codon adaptation index
- dN :
rate of nonsynonymous changes
- dS :
rate of synonymous changes
- dS' :
rate of synonymous changes corrected for selection at silent sites
- k in :
- k out :
Saccharomyces Genome Database.
Support was provided by a Doctoral Dissertation Improvement Grant (NSF DEB-0710378), the UO IGERT program in Evolution, Development and Genomics (NSF DGE-9972830) and by grants from the National Science Foundation (DEB-0441066) and the National Institutes of Health (AG029377).
- Barabasi AL, Oltvai ZN: Network biology: understanding the cell's functional organization. Nat Rev Genet. 2004, 5: 101-113. 10.1038/nrg1272.PubMedView ArticleGoogle Scholar
- Davidson EH, Rast JP, Oliveri P, Ransick A, Calestani C, Yuh CH, Minokawa T, Amore G, Hinman V, Arenas-Mena C, Otim O, Brown CT, Livi CB, Lee PY, Revilla R, Rust AG, Pan Z, Schilstra MJ, Clarke PJ, Arnone MI, Rowen L, Cameron RA, McClay DR, Hood L, Bolouri H: A genomic regulatory network for development. Science. 2002, 295: 1669-1678. 10.1126/science.1069883.PubMedView ArticleGoogle Scholar
- Proulx SR, Promislow DE, Phillips PC: Network thinking in ecology and evolution. Trends Ecol Evol. 2005, 20: 345-353. 10.1016/j.tree.2005.04.004.PubMedView ArticleGoogle Scholar
- Hahn MW, Kern AD: Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. Mol Biol Evol. 2005, 22: 803-806. 10.1093/molbev/msi072.PubMedView ArticleGoogle Scholar
- Vitkup D, Kharchenko P, Wagner A: Influence of metabolic network structure and function on enzyme evolution. Genome Biol. 2006, 7: R39-10.1186/gb-2006-7-5-r39.PubMedPubMed CentralView ArticleGoogle Scholar
- Lu C, Zhang Z, Leach L, Kearsey MJ, Luo ZW: Impacts of yeast metabolic network structure on enzyme evolution. Genome Biol. 2007, 8: 407-10.1186/gb-2007-8-8-407.PubMedPubMed CentralView ArticleGoogle Scholar
- Fraser HB, Hirsh AE, Steinmetz LM, Scharfe C, Feldman MW: Evolutionary rate in the protein interaction network. Science. 2002, 296: 750-752. 10.1126/science.1068696.PubMedView ArticleGoogle Scholar
- Jeong H, Mason SP, Barabasi AL, Oltvai ZN: Lethality and centrality in protein networks. Nature. 2001, 411: 41-42. 10.1038/35075138.PubMedView ArticleGoogle Scholar
- Wagner A: How the global structure of protein interaction networks evolves. Proc Biol Sci. 2003, 270: 457-466. 10.1098/rspb.2002.2269.PubMedPubMed CentralView ArticleGoogle Scholar
- Proulx SR, Phillips PC: The opportunity for canalization and the evolution of genetic networks. Am Nat. 2005, 165: 147-162. 10.1086/426873.PubMedView ArticleGoogle Scholar
- Keller EF: Revisiting "scale-free" networks. Bioessays. 2005, 27: 1060-1068. 10.1002/bies.20294.PubMedView ArticleGoogle Scholar
- Davidson EH: The Regulatory Genome. 2006, Amsterdam, Boston, Heidelberg, London, New York, Oxford, Paris, San Diego, San Francisco, Singapore, Sydney, Tokyo: Academic PressGoogle Scholar
- Guelzim N, Bottani S, Bourgine P, Kepes F: Topological and causal structure of the yeast transcriptional regulatory network. Nat Genet. 2002, 31: 60-63. 10.1038/ng873.PubMedView ArticleGoogle Scholar
- Balaji S, Iyer LM, Aravind L, Babu MM: Uncovering a hidden distributed architecture behind scale-free transcriptional regulatory networks. J Mol Biol. 2006, 360: 204-212. 10.1016/j.jmb.2006.04.026.PubMedView ArticleGoogle Scholar
- Yu H, Gerstein M: Genomic analysis of the hierarchical structure of regulatory networks. Proc Natl Acad Sci USA. 2006, 103: 14724-14731. 10.1073/pnas.0508637103.PubMedPubMed CentralView ArticleGoogle Scholar
- Rocha EP: The quest for the universals of protein evolution. Trends Genet. 2006, 22: 412-416. 10.1016/j.tig.2006.06.004.PubMedView ArticleGoogle Scholar
- Pál C, Papp B, Lercher MJ: An integrated view of protein evolution. Nat Rev Genet. 2006, 7: 337-348. 10.1038/nrg1838.PubMedView ArticleGoogle Scholar
- Pál C, Papp B, Hurst LD: Highly expressed genes in yeast evolve slowly. Genetics. 2001, 158: 927-931.PubMedPubMed CentralGoogle Scholar
- Wall DP, Hirsh AE, Fraser HB, Kumm J, Giaever G, Eisen MB, Feldman MW: Functional genomic analysis of the rates of protein evolution. Proc Natl Acad Sci USA. 2005, 102: 5483-5488. 10.1073/pnas.0501761102.PubMedPubMed CentralView ArticleGoogle Scholar
- Drummond DA, Bloom JD, Adami C, Wilke CO, Arnold FH: Why highly expressed proteins evolve slowly. Proc Natl Acad Sci USA. 2005, 102: 14338-14343. 10.1073/pnas.0504070102.PubMedPubMed CentralView ArticleGoogle Scholar
- Drummond DA, Raval A, Wilke CO: A single determinant dominates the rate of yeast protein evolution. Mol Biol Evol. 2006, 23: 327-337. 10.1093/molbev/msj038.PubMedView ArticleGoogle Scholar
- Pál C, Papp B, Hurst LD: Genomic function: Rate of evolution and gene dispensability. Nature. 2003, 421: 496-497. 10.1038/421496b. discussion 497-498.PubMedView ArticleGoogle Scholar
- Hirsh AE, Fraser HB: Protein dispensability and rate of evolution. Nature. 2001, 411: 1046-1049. 10.1038/35082561.PubMedView ArticleGoogle Scholar
- Plotkin JB, Fraser HB: Assessing the determinants of evolutionary rates in the presence of noise. Mol Biol Evol. 2007, 24: 1113-1121. 10.1093/molbev/msm044.PubMedView ArticleGoogle Scholar
- Kim SH, Yi SV: Understanding relationship between sequence and functional evolution in yeast proteins. Genetica. 2007, 131: 151-156. 10.1007/s10709-006-9125-2.PubMedView ArticleGoogle Scholar
- Bloom JD, Adami C: Apparent dependence of protein evolutionary rate on number of interactions is linked to biases in protein-protein interactions data sets. BMC Evol Biol. 2003, 3: 21-10.1186/1471-2148-3-21.PubMedPubMed CentralView ArticleGoogle Scholar
- Fraser HB, Hirsh AE: Evolutionary rate depends on number of protein-protein interactions independently of gene expression level. BMC Evol Biol. 2004, 4: 13-10.1186/1471-2148-4-13.PubMedPubMed CentralView ArticleGoogle Scholar
- Bloom JD, Adami C: Evolutionary rate depends on number of protein-protein interactions independently of gene expression level: response. BMC Evol Biol. 2004, 4: 14-10.1186/1471-2148-4-14.PubMedPubMed CentralView ArticleGoogle Scholar
- Giaever G, Chu AM, Ni L, Connelly C, Riles L, Veronneau S, Dow S, Lucau-Danila A, Anderson K, Andre B, Arkin AP, Astromoff A, El-Bakkoury M, Bangham R, Benito R, Brachat S, Campanaro S, Curtiss M, Davis K, Deutschbauer A, Entian KD, Flaherty P, Foury F, Garfinkel DJ, Gerstein M, Gotte D, Güldener U, Hegemann JH, Hempel S, Herman Z, et al: Functional profiling of the Saccharomyces cerevisiae genome. Nature. 2002, 418: 387-391. 10.1038/nature00935.PubMedView ArticleGoogle Scholar
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000, 25: 25-29. 10.1038/75556.PubMedPubMed CentralView ArticleGoogle Scholar
- Dwight SS, Harris MA, Dolinski K, Ball CA, Binkley G, Christie KR, Fisk DG, Issel-Tarver L, Schroeder M, Sherlock G, Sethuraman A, Weng S, Botstein D, Cherry JM: Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO). Nucleic Acids Res. 2002, 30: 69-72. 10.1093/nar/30.1.69.PubMedPubMed CentralView ArticleGoogle Scholar
- Sauer U, Heinemann M, Zamboni N: Genetics. Getting closer to the whole picture. Science. 2007, 316: 550-551. 10.1126/science.1142502.PubMedView ArticleGoogle Scholar
- Wagner A, Wright J: Alternative routes and mutational robustness in complex regulatory networks. Biosystems. 2007, 88: 163-172. 10.1016/j.biosystems.2006.06.002.PubMedView ArticleGoogle Scholar
- Ishii N, Nakahigashi K, Baba T, Robert M, Soga T, Kanai A, Hirasawa T, Naba M, Hirai K, Hoque A, Ho PY, Kakazu Y, Sugawara K, Igarashi S, Harada S, Masuda T, Sugiyama N, Togashi T, Hasegawa M, Takai Y, Yugi K, Arakawa K, Iwata N, Toya Y, Nakayama Y, Nishioka T, Shimizu K, Mori H, Tomita M: Multiple high-throughput analyses monitor the response of E. coli to perturbations. Science. 2007, 316: 593-597. 10.1126/science.1132067.PubMedView ArticleGoogle Scholar
- Choi JK, Kim SC, Seo J, Kim S, Bhak J: Impact of transcriptional properties on essentiality and evolutionary rate. Genetics. 2007, 175: 199-206. 10.1534/genetics.106.066027.PubMedPubMed CentralView ArticleGoogle Scholar
- Promislow D: A regulatory network analysis of phenotypic plasticity in yeast. Am Nat. 2005, 165: 515-523. 10.1086/429161.PubMedView ArticleGoogle Scholar
- Fraser HB: Modularity and evolutionary constraint on proteins. Nat Genet. 2005, 37: 351-352. 10.1038/ng1530.PubMedView ArticleGoogle Scholar
- Flowers JM, Sezgin E, Kumagai S, Duvernell DD, Matzkin LM, Schmidt PS, Eanes WF: Adaptive evolution of metabolic pathways in Drosophila. Mol Biol Evol. 2007, 24: 1347-1354. 10.1093/molbev/msm057.PubMedView ArticleGoogle Scholar
- Yip KY, Yu H, Kim PM, Schultz M, Gerstein M: The tYNA platform for comparative interactomics: a web tool for managing, comparing and mining multiple networks. Bioinformatics. 2006, 22: 2968-2970. 10.1093/bioinformatics/btl488.PubMedView ArticleGoogle Scholar
- Goffeau A, Barrell BG, Bussey H, Davis RW, Dujon B, Feldmann H, Galibert F, Hoheisel JD, Jacq C, Johnston M, Louis EJ, Mewes HW, Murakami Y, Philippsen P, Tettelin H, Oliver SG: Life with 6000 genes. Science. 1996, 274: 546-567. 10.1126/science.274.5287.546.PubMedView ArticleGoogle Scholar
- Rokas A, Williams BL, King N, Carroll SB: Genome-scale approaches to resolving incongruence in molecular phylogenies. Nature. 2003, 425: 798-804. 10.1038/nature02053.PubMedView ArticleGoogle Scholar
- Kellis M, Patterson N, Endrizzi M, Birren B, Lander ES: Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature. 2003, 423: 241-254. 10.1038/nature01644.PubMedView ArticleGoogle Scholar
- Cherry JM, Adler C, Ball C, Chervitz SA, Dwight SS, Hester ET, Jia Y, Juvik G, Roe T, Schroeder M, Weng S, Botstein D: SGD: Saccharomyces Genome Database. Nucleic Acids Res. 1998, 26: 73-79. 10.1093/nar/26.1.73.PubMedPubMed CentralView ArticleGoogle Scholar
- Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999, 41: 95-98.Google Scholar
- Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997, 13: 555-556.PubMedGoogle Scholar
- Hirsh AE, Fraser HB, Wall DP: Adjusting for selection on synonymous sites in estimates of evolutionary distance. Mol Biol Evol. 2005, 22: 174-177. 10.1093/molbev/msh265.PubMedView ArticleGoogle Scholar
- Ghaemmaghami S, Huh WK, Bower K, Howson RW, Belle A, Dephoure N, O'Shea EK, Weissman JS: Global analysis of protein expression in yeast. Nature. 2003, 425: 737-741. 10.1038/nature02046.PubMedView ArticleGoogle Scholar
- Holstege FC, Jennings EG, Wyrick JJ, Lee TI, Hengartner CJ, Green MR, Golub TR, Lander ES, Young RA: Dissecting the regulatory circuitry of a eukaryotic genome. Cell. 1998, 95: 717-728. 10.1016/S0092-8674(00)81641-4.PubMedView ArticleGoogle Scholar
- Sharp PM, Li WH: The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987, 15: 1281-1295. 10.1093/nar/15.3.1281.PubMedPubMed CentralView ArticleGoogle Scholar
- Coghlan A, Wolfe KH: Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae. Yeast. 2000, 16: 1131-1145. 10.1002/1097-0061(20000915)16:12<1131::AID-YEA609>3.0.CO;2-F.PubMedView ArticleGoogle Scholar
- Wu G, Culley DE, Zhang W: Predicted highly expressed genes in the genomes of Streptomyces coelicolor and Streptomyces avermitilis and the implications for their metabolism. Microbiology. 2005, 151: 2175-2187. 10.1099/mic.0.27833-0.PubMedView ArticleGoogle Scholar
- Carbone A, Zinovyev A, Kepes F: Codon adaptation index as a measure of dominating codon bias. Bioinformatics. 2003, 19: 2005-2015. 10.1093/bioinformatics/btg272.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.