- Open Access
Relationship between the tissue-specificity of mouse gene expression and the evolutionary origin and function of the proteins
© Freilich et al.; licensee BioMed Central Ltd. 2005
- Received: 17 February 2005
- Accepted: 11 May 2005
- Published: 29 June 2005
The combination of complete genome sequence information with expression data enables us to characterize the relationship between a protein's evolutionary origin or functional category and its expression pattern. In this study, mouse proteins were assigned into functional and phyletic groups and the gene expression patterns of the different protein groupings were examined by microarray analysis in various mouse tissues.
Our results suggest that the proteins that are universally distributed in all tissues are predominantly enzymes and transporters. In contrast, the tissue-specific set is dominated by regulatory proteins (signal transduction and transcription factors). An increased tendency to tissue-specificity is observed for metazoan-specific proteins. As the composition of the phyletic groups highly correlates with that of the functional groups, the data were tested in order to determine which of the two factors - function or phyletic age - is dominant in shaping the expression profile of a protein. The observed differences in expression patterns of genes between functional groups were found mainly to reflect their different phyletic origin. The connection between tissue specificity and phyletic age cannot be explained by the recent rate of evolution. Finally, although metazoan-specific proteins tend to be tissue-specific compared with phyletically conserved proteins present in all domains of life, many such 'universal' proteins are also tissue-specific.
The minimal cellular transcriptome of the metazoan cell differs from that of the ancestral unicellular eukaryote: new functions were added (metazoan-specific proteins), whilst other functions became specialized and no longer took place in all cells (tissue-specific pre-metazoan proteins).
- Additional Data File
- Signal Transduction Protein
- Mouse Protein
- Universal Protein
- Observe Test Statistic
Higher animals are characterized by differentiated tissue types, where each tissue has its own unique cellular composition and physiological function. Comparative genomic studies have shown that the evolution of the metazoan lineage involves the expansion of those specific protein families known to participate in cellular communication and transcriptional regulation [1, 2]. However, at the cellular level, it is not yet clear how processes taking place in specific tissues relate to similar processes that took place in the ancestral unicellular species. The recent availability of fully sequenced genomes, together with analysis platforms capable of generating 'global' profiles, enables us not only to identify those proteins that are unique to multicellular species but also to examine their contribution to tissue diversity. We can now study the protein content of a mammalian tissue in comparison with the protein content of unicellular organisms. To what extent does the differentiation process involve gaining new functions and to what extent does it involve specialization of pathways that existed in a unicellular ancestor? Will 'young' proteins (that is, proteins that are unique to multicellular species) exhibit a different expression pattern than 'ancient' or universal proteins?
Recent studies have related several characteristics of a protein to its expression profile. Subramanian and Kumar  have shown a connection between a protein's phyletic age and the intensity of expression, as measured by the number of expressed sequence tags. Lehner and Fraser  showed that protein domains differ in their tendency to be specifically or widely expressed and that many of the tissue-specific domains are metazoan-specific. Tissue-specific genes evolve more rapidly than broadly expressed ones [5–7]. We have studied the relationship between the phyletic age of a protein and its expression profile, and related this to the function of the protein. The term 'phyletic age' used here describes an estimated point in time when a protein integrated into the mouse genome. The universal and eukaryotic specific phyletic groups include proteins that are estimated to be found in the ancestral mouse genome before the transition from unicellularity to multicellularity. The metazoan-specific and mammalian-specific protein groups describe those proteins that are estimated to be integrated into the mouse genome after the transition. As the phyletic protein groups differ in their functions we wanted to determine whether a protein's expression profile better reflects function or age.
Finally, we wanted to verify that the phyletic age of a protein is indeed a major factor in shaping its expression profile rather than merely a reflection of the level of conservation in a protein - a factor that has already been shown to play a role in determining expression [5–7]. To rule out the possibility that the connection between age and expression is spurious due to the misclassification of rapidly evolving genes and the connection between tissue expression and recent rate of evolution, Subramanian and Kumar  showed that the connection still exists in a slowly evolving set of data. However, the assumption that the slow rate of evolution of the genes assumes a correct age classification may not hold if there has been a change in rate during their evolutionary history, for example, diversifying selection followed by conservation after a gene duplication . In this paper, we propose a direct test to show that the connection between phyletic age and tissue expression of a gene cannot be explained by the connection between rate and tissue expression alone, a test which does not assume homotachy and makes use of all the available data.
Comparing expression patterns within different tissues
Comparing expression patterns within functional and phyletic categories
We further studied the contribution of different functional and phyletic groups to tissue variation. Are some functional categories more tissue-specific than others? We examined the expression profile of proteins from the four functional categories within 14 different mouse tissues. For each group, we calculated the fraction of its protein members expressed in one tissue, two tissues, and so on (Figure 3b). Surprisingly, less than one-third of the enzymes and transporters are ubiquitously expressed in all tissues examined. The fraction is even lower for the other functional groups where only about one-tenth of the transcription factors and signal transduction proteins are expressed in all tissues examined. Two different patterns of expression can be observed: the relative abundance of enzymes and transporter proteins is higher among proteins that are ubiquitously expressed; in contrast, a larger fraction of transcription factors and signal transduction proteins are tissue-specific.
The distribution of function within the phyletic groups
Phyletic groups/functional groups
Total functionally annotated proteins (%)
Transcription factors, %
Signal transduction, %
The inter-relationship between function, 'phyletic age' and expression
In order to identify whether a protein's expression pattern better reflects function or age, the inter-relationship between these three factors was compared statistically. After phyletic age was taken into account, only a weak dependence between function and tissue specificity was detected (test statistic 264.7, p value 0.04), suggesting that most of the relationship observed between function and tissue specificity is accounted for by the age of the gene. The relationship between phyletic age and tissue specificity is not explained by a gene's function (test statistic 339.1, p < 0.0001), nor is the relationship between phyletic age and function explained by the tissue specificity (test statistic 967.2, p << 0.0001). Therefore, the results imply that enzymes tend to be ubiquitously expressed mainly due to their phyletic universal origin (rather than due to their functional classification).
From the expression distribution of metabolic proteins (enzymes and transporters, Figure 4a), one can observe (as can be inferred from the statistical test reported above) obvious differences between the 'older' pre-metazoan proteins (universal and eukaryote-specific groups) and the more recent metazoan proteins. Differences can be observed for the regulatory proteins as well (Figure 4b): metazoan-specific proteins tend to be more tissue-specific compared with pre-metazoan ones, regardless of their functional class. A notable difference between the expression patterns in Figure 4a and 4b occurs for specifically expressed pre-metazoan proteins, where the proportion of regulatory proteins is much higher than metabolic proteins, and it is not significantly different from the fraction of metazoan-specific proteins. This confirms that the function has some influence on expression independent of age but suggests that the effect is stronger in specifically expressed pre-metazoan proteins.
Yet, although pre-metazoan proteins tend to be more widely expressed, less than one-third of the pre-metazoan metabolic proteins are expressed in all tissues. Ldhc (testis-specific lactate dehydrogenase) is one example of a universal enzyme whose expression is limited to few cell types in mammals. Ldh participates in anaerobic glycolysis - a nearly universal pathway that converts glucose into pyruvate. The sequence of reactions in the pathway is similar in all organisms and in all cell types. In contrast, the fate of pyruvate is variable. In a variety of microorganisms, lactate is normally formed from pyruvate in a reaction catalyzed by Ldh. In higher organisms, most cells do not convert pyruvate to lactate and the reaction is limited to few tissues . In germ cells, where lactate is a preferred energy source , we observe specific expression of Ldhc (testis-specific expression). The expression of Ldhc is an example of a function occurring in the ancestral unicellular cell that becomes tissue-specific in multicellular species.
The testis-specific expression of two other universal enzymes in our dataset - glucose-6-phosphate dehydrogenase 2 (G6pd-2) and phosphoglycerate kinase 2 (Pgk-2) - provides a different example for a specific expression of universal enzymes. G6pd-2 and Pgk-2 are believed to arise from their isoenzymes, G6pd and Pgk-1, respectively, by a gene duplication event. G6pd and Pgk-1 are essential, widely expressed, X chromosome-encoded genes. The absence of those two enzymes during the inactivation of the X chromosome in postmeiotic spermatogenic cells is compensated for by the expression of their autosomal testis-specific isoenzymes G6pd-2 and Pgk-2 [11, 12]. Duplication events can therefore explain some of the cases where universal enzymes are specifically expressed.
The inter-relationship between 'phyletic age', evolutionary rate and expression
Tissue-specific genes tend to evolve more rapidly than broadly expressed ones [5–7]. Therefore, difficulties might arise in identifying their distant homologs, leading to a correlation between age and rate of evolution. We wanted to verify that the expression patterns observed here cannot be explained purely in terms of variation in the recent rate of evolution, and so we have studied the expression profile of different phyletic groups in a subset of the data where all phyletic groups have evolved at approximately the same rate. For each protein in our dataset we calculated an evolutionary rate by measuring the Ka/Ks ratio with its ortholog in rat (see Methods). The chi-squared test statistic for the independence of age and tissue expression given rate in our data was 226.5, whereas the maximum observed statistic in 10,000 random draws, generated as described in Methods, was 84.3. The connection between phyletic age and tissue specificity that we observed in our data cannot be explained purely in terms of both factors' mutual correlation with the recent rate of evolution.
It is important to remember that our analysis is based only on those proteins that are present on the Affymetrix chip and have GO annotation. Our dataset covers approximately one-quarter of mouse proteins. Clearly, a better coverage for the expression and annotation of proteins is desirable and could change the conclusion presented below. In order to decrease the probability that our results are arbitrary, we repeated the experiment with a different set of tissues (seven components of the gastrointestinal tract). The results obtained are compatible with the observations we report here (data not shown).
We show here that multicellular specific proteins tend to be more tissue-specific than 'ancient' universal proteins. Most of the 'late' evolutionary proteins are transcription factors and signal transduction proteins, categories that have previously been suggested to play a crucial role in tissue differentiation. However, our analysis suggests that more recent enzymes and transporters also contribute to tissue diversity as many of them are tissue-specific (Figure 4a). The selective expression pattern of recent genes implies that a new protein is often selected to perform a tissue-specific function rather than a global one. A greater evolutionary flexibility of tissue-specific proteins is compatible with previous studies suggesting that tissue-specific proteins evolve more rapidly [5–7] due to less strict functional constraints compared with broadly expressed proteins [5, 13].
Despite this trend, many metazoan-specific proteins are ubiquitous and many universal proteins are tissue-specific. The minimal cellular transcriptome of the metazoan cell differs from that of the ancestral unicellular eukaryote: new functions were added (metazoan-specific proteins), whilst other functions became specialized and no longer took place in all cells (tissue-specific pre-metazoan proteins). The extent of the cellular specialization can be implied from the observation that only one-third of the proteins are expressed in all tissues examined. In some of these cases, functions occurring in the unicellular cell become tissue-specific in multicellular species. In other cases, universal genes that have been duplicated become specific to a tissue whilst a second copy maintains its original expression pattern. Only about one-third of the pre-metazoan metabolic enzymes are expressed in all tissues. Tissue differentiation is at least in part achieved by tissue specialization of metabolism - either by differentially expressing two-thirds of the ancient metabolic proteins, or by encoding new metabolic proteins. Presumably, the additional transcription-related proteins provide the necessary control. We aim to further characterize the expression patterns of processes that exist in the ancestral metazoa and those that are specific to metazoa. In particular, we are interested in studying the contribution of function differentiation versus gene duplication to tissue diversity in multicellular species.
Expression profile determination
Tissues were dissected and snap frozen from 8-12-week old C57/BL6 male mice with the exception of the ovaries, which were taken from females. Tissue was pooled from between four to six animals, the RNA extracted and 10 mg of total RNA was labeled and then hybridized to the Affymetrix U74AV2 GeneChip using standard protocols (Affymetrix Inc, CA, USA); complete experimental details for each of these stages are given at . Expression values and presence/absence flags were generated from the CEL files using the Microarray Suite 5.0 package (Affymetrix MAS 5.0) and its default settings. The global scaling normalization method operated with a target value setting of 100. Expression data flagged with a marginal call was excluded from further analysis. The quality of the microarray data was assessed first using the parameters defined in the report file generated by MAS 5.0, next through recording the percentage of outliers reported by dCHIP on calculating expression values  and finally by using the Affy package in the BioConductor suite of microarray analysis programs to generate degradation plots . Chips failing the quality control parameters recommended by the authors of these programs were omitted from further analysis. The microarray data (accession ID = E-HGMP-2) used in this study is now available to download from ArrayExpress .
A subset of 14 samples representing distinct non-redundant organs was chosen out of the complete dataset. The tissues are listed in Figure 2. The use of different cut-offs within the range of 0.025 <P value < 0.075 for absent/present flags labeling has no effect on the analysis. Only absent/present calls were used to define tissue-specificity and expression levels have not been a factor in this analysis.
Mapping probe sets to mouse proteins
Out of 12,487 probe sets, 8,218 were mapped into EnsEmbl mouse transcripts using Ensmart  (version 13.1, ). Mapping of EnsEmbl transcripts to SWISS-PROT  (release 41.25) and TrEmbl proteins (Release 24.13) were obtained from the International Protein Index (IPI, ). Proteins represented by more than a single probe set were discarded in order to avoid re-counting. Different proteins sharing the same probe sets were also eliminated (with the exception of splice variants). A single and unique probe set therefore represents each of the 6,242 remaining proteins.
Mouse protein functional annotation
A total of 4,918 proteins were assigned with a GO annotation . For simplicity and in order to avoid overlaps between the functional categories, we studied the tissue distribution of proteins assigned to four main categories from the highest hierarchy level of the functional classification: enzymatic activity, transporters, transcription regulation and signal transduction. Proteins assigned to more than a single category were discarded, leaving 2,686 proteins distributed as follows: 1,400 enzymes, 384 transporters, 617 proteins involved in signal transduction and 285 proteins that regulate transcription. The functional assignments are available from .
Mouse protein phyletic assignment
We used four categories to describe the evolutionary origin of mouse proteins: universal proteins, that is, ubiquitous in the three domains of life (bacteria, archaea and eukaryotes), eukaryote-specific proteins, metazoan-specific proteins and mammalian-specific proteins. The 6,242 mouse proteins were classified into the phyletic categories according to the results of a BLAST  search against 146 fully sequenced species. A protein could only be assigned to a single category. The classification process is hierarchical: proteins with hits to more than five prokaryote species are classified as universal; the remaining mouse proteins with at least a single hit to non-metazoan eukaryotes are classified as eukaryote-specific; the remaining mouse proteins with at least a single hit to non-mammalian metazoa are classified as metazoan-specific; and, finally, proteins recognizing only other mammalian proteins are classified as mammalian-specific. The cut-off used was BLAST e-score < 1e-3. Genomes were downloaded from the COGENT  database (release 152).
The observations reported here are maintained using different cut-offs within the range of 1e-10 < e-score < 1e-1. The observations are also maintained when a universal protein is defined as a protein with a hit in at least a single prokaryote species or when it defined as a protein with a hit in at least ten prokaryote species (examined under e-value cut-off of e-score < 1e-3). Additional tests were performed in order to assure that the phyletic distribution truly describes a complete sequence distribution rather than domain distribution. The classification of a protein to a phyletic group was done when additional filters were added. This was in order to discard those cases where a match between query and hit is based only on recognition of a conserved domain rather than a complete sequence (e-score <1e-3). Firstly, pfam domain composition: all query-hit pairs that do not share an identical pfam domain composition were discarded from our dataset. Secondly, full coverage: all query-hit pairs where the alignment does not cover the full length (80%) of both proteins were discarded from our dataset.
When repeating the analysis with the filtered data, the results confirm that the trends reported here are maintained (not shown). Similar results were also obtained when using the homologous clusters database STRING  for a phyletic classification (universal proteins have to be recognized in at least five prokaryote species). The 6,242 proteins are distributed in phyletic categories as follows: 1,428 universal proteins, 2,088 eukaryote specific proteins, 1,567 metazoan proteins, 1,123 mammalian-specific proteins; and 36 unclassified proteins. The phyletic assignments are available from .
Mouse and rat 1:1 ortholog pairs were obtained from EnsEmbl . In those cases where a mouse protein had more than a single rat ortholog it was discarded from the analysis unless one of the ortholog pairs was annotated as 'best reciprocal hit' (BRH). The ratio of Ka (the number of non-synonymous substitutions per non-synonymous site) to Ks (the number of synonymous substitutions per synonymous site) was calculated using the codeml program from the PAML 3.13d package . Two sequences had one or fewer nucleotide mutations and so their Ka/Ks ratio could not be reliably estimated. These sequences were discarded from the analysis. In total, the Ka/Ks ratio was calculated for 4,056 mouse proteins from the 5,501 proteins classified to a phyletic category and expressed in at least a single tissue (as shown in Figure 3c).
By grouping the genes into equal bins of similar recent rates of evolution (Ka/Ks values) and then shuffling the phyletic ages within each group, sample sets of data can be created which have no connection between phyletic age and tissue expression other than through their common connection to recent rate of evolution. The connection between rate and tissue specificity in each of these sample sets is identical to the observed data and, because all genes in each bin have a similar rate of evolution, the connection between age and rate is similar to the observed data. By generating random samples in the manner as described above, the expected contingency table of age/expression dependence and the null distribution of the chi-squared test statistic can be estimated. Using the estimated expected table, the chi-squared statistic for the observed data can be calculated. The significance of the observation was assessed by comparing the observed test statistic with those from 10,000 sets of data, of equal size to the observed data, randomly generated according to the expected contingency table (and so satisfying the null hypothesis).
The relationship between tissue specificity and phyletic age and function was investigated using a contingency table test under the null hypothesis that function and specificity are independent of given age. Conceptually, the genes are divided up according to age and a separate contingency table for specificity and function is formed for each group. The chi-squared test statistic  for independence between function and specificity is calculated for each table and then pooled, weighted by the proportion of genes of each age, to give the test statistic for independence between function and specificity given age. The dependence between specificity and age given function, and age and function given specificity, were calculated similarly. The tables analyzed had cells expected to contain a small number of observations, so it was inappropriate to assess the significance of the test statistic using tables of pre-calculated critical values. Instead, 10,000 sets of data, of equal size to that observed, were generated in accordance to the expected contingency table and the test statistics of these were used to form an estimate of their distribution under the null hypothesis, to which the observed test statistic can be compared.
The following additional data are available with the online version of this paper. Additional data file 1 contains the functional assignments of the proteins used in the analysis. Additional data file 2 contains the phyletic assignments of the proteins used in the analysis. Additional data file 3 contains the sequences of the proteins used in the analysis.
We thank Eric Blanc for his suggestions on microarray data analysis and Christian Von Mering for help with using the STRING database. Shiri Freilich is supported by EMBL fellowship. Tim Massingham is supported by BBRC grant 721/BEP17055. Tom Freeman, Paul Lyons and Sumit Bhattacharyya are supported by the UK MRC.
- Chervitz SA, Aravind L, Sherlock G, Ball CA, Koonin EV, Dwight SS, Harris MA, Dolinski K, Mohr S, Smith T, et al: Comparison of the complete protein sets of worm and yeast: orthology and divergence. Science. 1998, 282: 2022-2028. 10.1126/science.282.5396.2022.PubMedPubMed CentralView ArticleGoogle Scholar
- Aravind L, Subramanian G: Origin of multicellular eukaryotes - insights from proteome comparisons. Curr Opin Genet Dev. 1999, 9: 688-694. 10.1016/S0959-437X(99)00028-3.PubMedView ArticleGoogle Scholar
- Subramanian S, Kumar S: Gene expression intensity shapes evolutionary rates of the proteins encoded by the vertebrate genome. Genetics. 2004, 168: 373-381. 10.1534/genetics.104.028944.PubMedPubMed CentralView ArticleGoogle Scholar
- Lehner B, Fraser AG: Protein domains enriched in mammalian tissue-specific or widely expressed genes. Trends Genet. 2004, 20: 468-472. 10.1016/j.tig.2004.08.002.PubMedView ArticleGoogle Scholar
- Duret L, Mouchiroud D: Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. Mol Biol Evol. 2000, 17: 68-74.PubMedView ArticleGoogle Scholar
- Winter EE, Goodstadt L, Ponting CP: Elevated rates of protein secretion, evolution, and disease among tissue-specific genes. Genome Res. 2004, 14: 54-61. 10.1101/gr.1924004.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang L, Li WH: Mammalian housekeeping genes evolve more slowly than tissue-specific genes. Mol Biol Evol. 2004, 21: 236-239. 10.1093/molbev/msh010.PubMedView ArticleGoogle Scholar
- Hughes ALP: Adaptive evolution of genes and genomes. 1999, New York: Oxford University PressGoogle Scholar
- Stryer L: Biochemistry. 1995, New York, NY: Freeman, 4Google Scholar
- Goddard I, Florin A, Mauduit C, Tabone E, Contard P, Bars R, Chuzel F, Benahmed M: Alteration of lactate production and transport in the adult rat testis exposed in utero to flutamide. Mol Cell Endocrinol. 2003, 206: 137-146. 10.1016/S0303-7207(02)00433-1.PubMedView ArticleGoogle Scholar
- Hendriksen PJ, Hoogerbrugge JW, Baarends WM, de Boer P, Vreeburg JT, Vos EA, van der Lende T, Grootegoed JA: Testis-specific expression of a functional retroposon encoding glucose-6-phosphate dehydrogenase in the mouse. Genomics. 1997, 41: 350-359. 10.1006/geno.1997.4673.PubMedView ArticleGoogle Scholar
- Boer PH, Adra CN, Lau YF, McBurney MW: The testis-specific phosphoglycerate kinase gene pgk-2 is a recruited retroposon. Mol Cell Biol. 1987, 7: 3107-3112.PubMedPubMed CentralView ArticleGoogle Scholar
- Hastings KE: Strong evolutionary conservation of broadly expressed protein isoforms in the troponin I gene family and other vertebrate gene families. J Mol Evol. 1996, 42: 631-640.PubMedView ArticleGoogle Scholar
- Affymetrix GeneChip® probe array methods. [http://www.hgmp.mrc.ac.uk/Research/Microarray/Affymetrix_Genechip/protocols_affymetrix.jsp]
- Li C, Wong WH: Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. Proc Natl Acad Sci USA. 2001, 98: 31-36. 10.1073/pnas.011404098.PubMedPubMed CentralView ArticleGoogle Scholar
- Bioconductor. [http://www.bioconductor.org]
- ArrayExpress. [http://www.ebi.ac.uk/arrayexpress]
- Kasprzyk A, Keefe D, Smedley D, London D, Spooner W, Melsopp C, Hammond M, Rocca-Serra P, Cox T, Birney E: EnsMart: a generic system for fast and flexible access to biological data. Genome Res. 2004, 14: 160-169. 10.1101/gr.1645104.PubMedPubMed CentralView ArticleGoogle Scholar
- Ensembl MartView. [http://www.ensembl.org/Multi/martview]
- Bairoch A, Apweiler R: The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 2000, 28: 45-48. 10.1093/nar/28.1.45.PubMedPubMed CentralView ArticleGoogle Scholar
- International Protein Index. [http://www.ebi.ac.uk/IPI/IPIhelp.html]
- Camon E, Magrane M, Barrell D, Binns D, Fleischmann W, Kersey P, Mulder N, Oinn T, Maslen J, Cox A, Apweiler R: The Gene Ontology Annotation (GOA) project: implementation of GO in SWISS-PROT, TrEMBL, and InterPro. Genome Res. 2003, 13: 662-672. 10.1101/gr.461403.PubMedPubMed CentralView ArticleGoogle Scholar
- Supplemental data. [http://www.ebi.ac.uk/~shirigo/gb_sup]
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMedPubMed CentralView ArticleGoogle Scholar
- Janssen P, Enright AJ, Audit B, Cases I, Goldovsky L, Harte N, Kunin V, Ouzounis CA: COmplete GENome Tracking (COGENT): a flexible data environment for computational genomics. Bioinformatics. 2003, 19: 1451-1452. 10.1093/bioinformatics/btg161.PubMedView ArticleGoogle Scholar
- von Mering C, Huynen M, Jaeggi D, Schmidt S, Bork P, Snel B: STRING: a database of predicted functional associations between proteins. Nucleic Acids Res. 2003, 31: 258-261. 10.1093/nar/gkg034.PubMedPubMed CentralView ArticleGoogle Scholar
- Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997, 13: 555-556.PubMedGoogle Scholar
- Howell DC: Statistical methods for psychology. 1992, Belmont, CA: Duxbury Press, 4thGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/2.0