Iterative orthology prediction uncovers new mitochondrial proteins and identifies C12orf62 as the human ortholog of COX14, a protein involved in the assembly of cytochrome coxidase
- Radek Szklarczyk†1Email author,
- Bas FJ Wanschers†1, 2,
- Thomas D Cuypers1, 3,
- John J Esseling4,
- Moniek Riemersma2,
- Mariël AM van den Brand2,
- Jolein Gloerich5,
- Edwin Lasonder1,
- Lambert P van den Heuvel2,
- Leo G Nijtmans2 and
- Martijn A Huynen1Email author
© Szklarczyk et al.; licensee BioMed Central Ltd. 2012
Received: 29 November 2011
Accepted: 22 February 2012
Published: 22 February 2012
Orthology is a central tenet of comparative genomics and ortholog identification is instrumental to protein function prediction. Major advances have been made to determine orthology relations among a set of homologous proteins. However, they depend on the comparison of individual sequences and do not take into account divergent orthologs.
We have developed an iterative orthology prediction method, Ortho-Profile, that uses reciprocal best hits at the level of sequence profiles to infer orthology. It increases ortholog detection by 20% compared to sequence-to-sequence comparisons. Ortho-Profile predicts 598 human orthologs of mitochondrial proteins from Saccharomyces cerevisiae and Schizosaccharomyces pombe with 94% accuracy. Of these, 181 were not known to localize to mitochondria in mammals. Among the predictions of the Ortho-Profile method are 11 human cytochrome c oxidase (COX) assembly proteins that are implicated in mitochondrial function and disease. Their co-expression patterns, experimentally verified subcellular localization, and co-purification with human COX-associated proteins support these predictions. For the human gene C12orf62, the ortholog of S. cerevisiae COX14, we specifically confirm its role in negative regulation of the translation of cytochrome c oxidase.
Divergent homologs can often only be detected by comparing sequence profiles and profile-based hidden Markov models. The Ortho-Profile method takes advantage of these techniques in the quest for orthologs.
From the publication of the first genome sequences, the identification of orthologs has been a central theme in comparative genomics . Functional genomics as well as genome annotation have greatly benefited from the wealth of experimental data available for model species. To formulate hypotheses about gene functions in remaining organisms, including human, it is necessary to unambiguously resolve the phylogenetic relationships among homologs . The detection of homology, and therewith also orthology, can be crippled by the lack of detectable sequence similarity. Large evolutionary distances, high rates of sequence evolution, low complexity regions and short protein length can preclude homology detection by pairwise sequence similarity approaches such as FASTA or BLAST [3, 4]. More sensitive methods can detect remote homologs by replacing general amino acid similarity matrices with position-specific vectors of amino acid frequencies in a profile-to-sequence comparison (PSI-BLAST)  or in a profile-to-profile comparison . Profile-based hidden Markov models (HMM) additionally contain information about insertions and deletions and enable the detection of even more remote homologs , especially in HMM-to-HMM comparisons .
Homology is widely used to transfer information on protein function from model species. For example, homologs of yeast mitochondrial proteins have been used to predict mitochondrial proteins in human , and homology-based presence-absence patterns of genes have been applied to subcellular localization prediction . However, assigning subcellular localization based on solely the homology criterion leads to a high false discovery rate of 38% . For larger evolutionary distances (homology with proteins from Rickettsia prowazekii, a bacterial relative of mitochondria) inferring subcellular localization based on the homology criterion yields an estimated 73% false positives , rendering homology of limited value for localization prediction. Additionally, evolutionary events such as gene duplications often prompt a change of subcellular localization, while one-to-one orthologs tend to localize to the same compartment . This suggests that orthology relationships are more reliable to infer the localization of proteins than just homology relationships. Indeed, manual analyses of orthology relationships between mitochondrial protein complexes from yeast and human [13–17] and automated analyses of complex membership in general  have confirmed that orthologous proteins remain involved in the same protein complexes. Importantly, profile-based methods have detected homology between proteins from the same mitochondrial complex in various species that went undetected by pairwise sequence comparison methods. For example, profile-based methods were crucial in the detection of a number of subunits of the NADH:ubiquinone oxidoreductase (complex I) [13, 14, 17, 19, 20], the mitochondrial ribosome [16, 21] and the mitochondrial Holliday junction resolvase domain . Such ad hoc procedures have, however, not been systematically assessed for their quantitative contribution and qualitative reliability in the large-scale detection of orthology relationships.
To include profiles in large-scale orthology inference, we introduce a three-phase procedure (Ortho-Profile) that applies reciprocal best hits at the sequence-to-sequence, the profile-to-sequence and finally the profile-based HMM-to-HMM level. To test the quality of our orthology assignment, we use protein subcellular localization, an important aspect of protein function that has been established experimentally in a number of species and is amenable to large-scale analysis. Mitochondrial localization has been established on a genome-wide scale (as well as in small-scale experiments) for proteins in Saccharomyces cerevisiae  and Schizosaccharomyces pombe . The mitochondrial proteins of these distant eukaryotic relatives have previously been used as models for mammalian mitochondrial proteins and for systematic predictions of human mitochondrial disease genes .
In the analysis presented here the fungal mitochondrial proteins serve as a starting point for large-scale orthology prediction in human. Of the one-to-one orthologs predicted between fungal mitochondrial proteins and human, 181 proteins have to date not been shown to localize to mitochondria in human (Table S6 in Additional file 1). For 15 proteins we find corroborating evidence for their mitochondrial localization using a probabilistic analysis of genome-wide data from Pagliarini and co-workers .
Cytochrome c oxidase (COX) is a 13-subunit enzyme complex in mammals that catalyzes the terminal step of the mitochondrial respiratory chain, accepting electrons from cytochrome c and passing them to molecular oxygen, producing water. Early biochemical analyses of COX defects in human have suggested that most COX deficiencies stem from decreased stability or failure to complete assembly of the holoenzyme [26, 27]. Defects in the assembly process cause severe neuromuscular disorders in human, the so-called mitochondrial encephalomyopathies . The identification of human orthologs of yeast COX assembly factors has helped to identify pathogenic mutations in human , including the first mutations in a nuclear gene involved in human COX deficiency, SURF1 [30, 31]. The Ortho-Profile method contributed to the recent identification of a mutation in the COA5 (previously C2orf64) gene that leads to COX deficiency , but causal genes for many other disease cases are still not known. In this work we predict 11 candidates for COX assembly factors and confirm the mitochondrial localization of four human candidate proteins. The high co-expression of the majority of the candidates with mammalian oxidative respiratory complexes as well as co-purifications of the candidates with COX-associated proteins give additional weight to our predictions. We experimentally confirm the role of C12orf62 as a COX assembly factor binding to COX1 and acting as its translation regulator.
Results and discussion
In the sequence-to-sequence phase  the method uses BLAST to search for a homolog of a S. cerevisiae protein in human and tests whether the human protein is also the reciprocally best (most similar and statistically significant) homolog in S. cerevisiae. If no homolog is found in the sequence-to-sequence phase (see Materials and methods for details), a profile search is initiated to increase the sensitivity. In the profile-to-sequence search PSI-BLAST  is employed, using the nr database of protein sequences (Materials and methods). If no statistically significant (E < 0.01) human homolog of a S. cerevisiae mitochondrial protein has been found among the PSI-BLAST hits in the first iteration, subsequent iterations extend the profile with (non-human) homologs that have been found (inclusion threshold 0.005). Up to three iterations are carried out to find a statistically significant human homolog. If a human homolog of the S. cerevisiae mitochondrial protein has been identified, a reciprocal search is carried out. The reciprocal search starts with the human protein to find a yeast homolog, again in up to three PSI-BLAST iterations. To satisfy the bi-directional best hit criterion, the first statistically significant S. cerevisiae gene to be encountered in the reciprocal search phase should be the original query gene. Finally, if no bi-directional best hit has been detected in the profile-to-sequence phase, a profile-to-profile search is carried out to increase sensitivity even further. The HMM phase operates on the databases of HMMs built for each protein sequence from fungal and human genomes using homologs in a wide range of species (see Materials and methods for details). The profile-representing HMM for the S. cerevisiae protein is retrieved from an HMM database. Subsequently, the database of HMMs that represent human proteins is searched for a homologous HMM using HHsearch . Analogous to the first phases, if a homologous best hit is found, a reciprocal HMM search is carried out, by comparing the HMM that contains the human protein with the S. cerevisiae HMM database. The same iterative procedure is carried out for S. pombe mitochondrial proteins.
Subcellular localization of human orthologs of yeast mitochondrial proteins
Accuracy of the pipeline
A number of benchmarks indicate the high quality of the orthology prediction. Firstly, the method recovers all but one manually annotated human ortholog of the small and the large subunits of the fungal mitochondrial ribosome, 51 proteins in total. Also, for all but one S. pombe mitoribosomal fusion protein, orthology relationships were resolved correctly when compared to the phylogeny-based orthology prediction . Benchmarking with a manually curated ortholog inventory of S. pombe and S. cerevisiae  shows that orthologs of human proteins in the two yeasts are consistent with the curated inventory in 95% cases, that is, the manually curated S. pombe-S. cerevisiae orthologs are orthologous to the same human protein (see Materials and methods for details). A domain composition analysis using PFAM  reveals that 84% of the predicted orthologs have an identical domain composition in human and fungi (504/598, including 5% of orthologs that have no detectable protein domains), corroborating the orthology prediction. However, domain composition data on their own, without inferred orthology, are not a strong predictor of subcellular localization (Materials and methods).
There are 417 human orthologs of fungal mitochondrial proteins that localize to mitochondria according to annotations based on experimental data in human and mouse or are part of a compendium of mammalian mitochondrial proteins that is based on integrated experimental data and sequence-based predictions  (Table 1; Table S6 in Additional file 1). This encompasses 70% of the complete set of 598 orthologs that we identified, with 192 proteins (32%) corroborated by both human and mouse localization data. Among the 181 proteins (30%) that are not annotated as mitochondrial in mammals, 92 proteins (15% of all orthologs) are annotated with another subcellular compartment (Table 1). The non-mitochondrial localization may, at least for some proteins, be an indication of dual localization, a phenomenon not uncommon in eukaryotes . Only 20 proteins (3%) have been found in the same non-mitochondrial compartment in both human and mouse (Table 1). The limited number of non-mitochondrial proteins among mammalian orthologs of fungal mitochondrial proteins demonstrates the predictive power of orthology prediction with respect to subcellular localization.
We tested if the reciprocal best hit as well as the homology criteria are actually both necessary for a correct prediction of subcellular localization. For human homologs of fungal mitochondrial proteins that are not reciprocal best hits (212 proteins), and therewith not one-to-one orthologs, only 38 are mitochondrial (18% of non-orthologs compared to 70% orthologs). Out of 212 non-orthologous proteins, 75 are known to localize to other subcellular compartments (35% of the non-orthologous homologs compared to 3% of the orthologs). Among orthologs of fungal mitochondrial proteins there are 4.5 more mitochondrial than non-mitochondrial human proteins (Table 1). For homologs there are two times less mitochondrial than non-mitochondrial ones (Table S7 in Additional file 2), implying that a homology relationship on its own does not predict localization as accurately as orthology. High conservation of localization also holds for more divergent orthologs (detectable only with profile and HMM methods), where homology has limited predictive power (Figure S4 in Additional file 2).
We also evaluated the reciprocal best hit criterion, without homology or a statistically significant similarity required. Protein pairs that fall outside the significance threshold in the sequence-to-sequence comparison (E ≥ 0.01) might still be reciprocal best hits based on their raw BLOSUM similarity scores. Among these reciprocal best hits only 23% of human proteins were annotated as mitochondrial in human. Thus, both homology and reciprocal best criteria are important for high-quality localization prediction.
Orthologs of yeast proteins involved in cytochrome coxidase assembly
Candidate COX assembly factors
Negative translation regulation of COX1 translation
Proteolytic processing of Cox2p and its assembly into COX
Cytochrome oxidase assembly
Required for accumulation of spliced COX1 mRNA
Cytochrome oxidase assembly
Negative regulation of COX1 subunit
UTR translation COX1 regulation
Assembly of COX
Assembly of COX
Assembly of COX
Translation activator of COX1
Putative protein of unknown function
Four predicted COX assembly factors are targeted to, and reside in, mitochondria
COX-associated proteins co-purify with predicted COX assembly factors
Proteins co-purified with candidate COX assembly factors
C12orf62 overexpression reduces COX protein levels
C12orf62 is complexed to COX1
The C12orf62-TAP affinity purification was carried out and analyzed with SDS-PAGE and western blotting. Based on the observation that Cox1 in yeast is found in a complex with COX14 [40, 43], we also tested for a possible co-purification of the human COX1 protein with C12orf62, detecting COX1 in the C12orf62-TAP eluate. Despite low C12orf62-TAP protein levels (possibly caused by limited accessibility of the TAP-tag) the eluate revealed specific interaction with COX1 (Figure 4d).
We introduce the Ortho-Profile method that identifies orthologs in the sequence homology 'twilight zone', where short proteins, high rates of sequence evolution and composition biases make genes' evolutionary relationships difficult to infer. Ortho-Profile, owing to the iterative approach combined with the high sensitivity of profile-to-sequence and HMM-to-HMM searches, allows detection of even remote orthologs and thus complements other large-scale orthology prediction systems. In-paranoid , Ortho-MCL  and phylogeny-based orthology determination [46, 47] are applicable for orthology reconstruction when homology is detectable at the protein sequence level. For more divergent proteins, homology detection based on sequence-profiles is sometimes used (including the presence of PFAM domains) or overlooked. We show that homology does not predict subcellular localization as accurately as orthology, and that orthology confidently predicts localization, even when it is inferred even for very divergent sequences. With no detectable homology in the sequence-to-sequence comparisons, reciprocal best hits at the profile-to-sequence and HMM-to-HMM levels enable orthology inference. Conserved subcellular localization indicates that the quality of inferred orthologs does not reduce significantly for very divergent genes in the 'orthology twilight zone' (Table 1 and Figure S4 in Additional file 2), confirming the accuracy of the presented approach.
We employ subcellular localization, an essential aspect of protein function, to evaluate the quality of orthology prediction. Localization has been established experimentally on the complete proteome-scale in an unbiased manner, independently in both human and fungi, and serves as a proxy for conservation of protein function that is amenable for large-scale analysis. We show that the identification of orthologs is instructive for establishing protein localization and the role of proteins in the cell. The Ortho-Profile method derives 181 new orthology relations between fungal mitochondrial proteins and human (including 59 from profile and HMM phases; Table 1) that have not been previously linked with mitochondria in mammals. As knowledge about the human mitochondrial proteome is not yet complete, many of these orthologs may localize to the organelle, their detection obscured by limited tissue distribution, low protein expression or absence from gene catalogs. These candidates were re-analyzed using a Bayesian framework, integrating the orthology data with co-expression, targeting signal prediction and proteomics data [9, 11]. The analysis suggests 15 additional candidate proteins for the inclusion in the human mitochondrial proteome (Materials and methods; Table S3 in Additional file 2). This is an underestimate of the real number of novel mitochondrial proteins, as even proteins for which we confirm the mitochondrial localization by GFP tagging (C7orf44 and PET117) do not receive strong support from other types of genome-wide data and do not reach the threshold that corresponds to a 10% false discovery rate.
We predict the human COX assembly candidates based on orthology with S. cerevisiae proteins and provide experimental validation for their subcellular localization (Table 3). An important difference between mitochondrial COX genes of mammals and of yeast is that the latter include introns, and a number of COX assembly factors in yeast are actually involved in splicing. Consistently, we do not detect orthologs of these splicing genes in the human genome (Supplemental Methods in Additional file 2). In contrast there appears to be more conservation at the level of translation. SURF1, the human ortholog of yeast SHY1, is a known COX assembly factor [30, 31] that participates in COX1 translation. The Ortho-Profile method identifies orthologs of multiple genes that control the COX1 translation process in fungi (COA1-C7orf44, COA3-CCDC56, COX14-C12orf62; Table 2). The proposed role of the human orthologs in COX1 translation is corroborated by their mitochondrial localization (Table 3) and the observed negative effect of C12orf62 overexpression on COX1 translation, as well as the physical association of the latter two proteins (Figure 4). Additional genes have been implicated in COX1 translation in human: TACO1  and PET309's homolog pentatricopeptide repeat-containing LRPPRC [48–50]. Our method identifies pentatricopeptide repeat-rich protein PTCD1 as an ortholog of the fungal PET309 gene. PTCD1 has been recently implicated in negative regulation of leucine tRNA levels, as well as negative regulation of mitochondria-encoded proteins and COX activity .
Recently, the identification of human orthologs of yeast COX assembly factors allowed us to prioritize C2orf64/COA5 (Table 2) as a candidate gene for a neonatal cardiomyopathy . Additionally, while this work was under review, a report on neonatal lactic acidosis was published  that supports our prediction and experimental confirmation of C12orf62 as a COX assembly factor. Of note, the authors argue that C12orf62 is a vertebrate-specific protein, while we show that it is orthologous to COX14. These discoveries signify the relevance of orthology prediction using profile-based approaches, such as Ortho-Profile, for biomedical research.
Materials and methods
The pipeline uses multiple homology detection methods to establish reciprocal best hits between a set of query genes (representing yeast mitochondrial proteomes) and human genes. The less divergent members of large protein families with multiple members per genome may hinder correct identification of orthologs if only the profile-based phases (profile-to-sequence or HMM-to-HMM) are used; thus, the pipeline was designed with three phases of increasing sensitivity, proceeding to a subsequent phase only if no orthology was detected in the previous phase.
In the first stage a BLAST search is performed on the human gene complement, using a yeast mitochondrial protein as the query sequence. If a significant similarity (E < 0.01) has been found, a reverse search is performed with the best hit (lowest E-value), to establish whether the human homolog is a statistically significant reciprocal best hit of the yeast mitochondrial protein (E < 0.01). In the case that no homolog is found, the search proceeds to the profile-to-sequence (PSI-BLAST) stage. In this second stage, three iterations of PSI-BLAST are run (E < 0.01, profile inclusion threshold 0.005), using the complete nr database for the construction of profiles. The first statistically significant homolog (E < 0.01) from the earliest PSI-BLAST iterations is selected, even if in the following iterations homologs with lower E-values are detected. In the last Ortho-Profile phase, a profile-based HMM that represents the query sequence is retrieved. Subsequently, the HMM is compared to an HMM database that represents the complete genome of the subject species. The best hit (based on the E-value) is used to establish reciprocity, analogously to the previous stages (E < 0.01).
HMM profile construction
The database of profiles for human and S. cerevisiae that were constructed using the HHPred toolkit, version 184.108.40.206  were downloaded from . For the profile construction of S. pombe, default options were used for the iterative multiple sequence alignment building stage PSI-BLAST (2.2.18) , running for eight cycles or until convergence. After each cycle of the standard PSI-BLAST algorithm, portions with insufficient similarity to the sequences in the multiple sequence alignment were pruned, in addition to trimming start and end portions of newly found matches, largely preventing the contamination with non-homologous extensions . These searches were performed against two subsets of the nr (non-redundant) database (downloaded from the NCBI website in July 2009), containing sequences filtered by CD-HIT  to a maximum pairwise sequence identity of 70% and 90%. To the final multiple sequence alignment, a representation of the predicted secondary structure, generated by the psipred program , is added to improve the profile-to-profile alignment.
Yeast mitochondrial proteomes
We collected the protein complement of the fission and budding yeast mitochondrion from the respective gene annotating consortia. Proteins with experimental evidence of mitochondrial localization were downloaded from GeneDB  (S. pombe), and the Saccharomyces Genome Database .
Evaluation of the Ortho-Profile method with the manually curated ortholog inventory
To evaluate the quality of orthology prediction, we took orthologs of human genes that were found in both S. pombe and S. cerevisiae (356 human genes), and for which at least one of the fungal proteins was known to be mitochondrial. For every human gene, their two orthologs in fungi were compared with the fungal ortholog inventory, manually curated by the S. pombe community  (obtained on June 2009); 95%, or 337 of the fungal orthologs had the same orthologous gene in human as inferred with Ortho-Profile. Additionally, 242 human genes had orthologs in only one fungal genome (95 in S. cerevisiae and 147 in S. pombe).
Protein domain analysis
Among orthologs determined with the Ortho-Profile method, 84% have an identical domain composition in human and fungi and for an additional 9% of orthologs (52 proteins) the human proteins contain extra domains compared to the fungal orthologs. Given the large number of proteins with identical domains, we decided to determine to what extent the domain composition data on their own can predict the subcellular localization. We found 1,627 human genes with the same PFAM  domain composition as yeast mitochondrial proteins (proteins without detectable domains were excluded). Of these genes, 34% (560 genes) encode proteins localized to mitochondria , compared to 67% for orthologs (see the Results section). This constitutes three-fold enrichment over 173 non-mitochondrial proteins, compared to 15-fold enrichment for orthologs determined by the Ortho-Profile method (see the Results section).
Selection of COX assembly factors
We collected 42 yeast genes from databases (Saccharomyces Genome Database) and literature that were previously implicated in COX transcription, translation, assembly, maintenance or regulation (Table S5 in Additional file 2). Additionally, we included YMR244C-A, a yeast gene of unknown function that has not been previously linked to COX in yeast, but that is a paralog of the COX12/COX6B subunit and has a respiratory-deficient knock-out phenotype  (Supplemental Methods in Additional file 2).
Integration of orthology data in the probabilistic framework
where P(orth|T mito) describes the probability that the ortholog of a yeast mitochondrial protein is an experimentally confirmed mitochondrial protein in human. Analogously, P(orth|T ~mito) reflects the probability that the ortholog is an experimentally confirmed non-mitochondrial human protein (based on the Gene Ontology annotation. As a result of the probabilistic integration, 15 of the 181 human proteins inferred with Ortho-Profile to be orthologous to fungal mitochondrial proteins that were previously not regarded to be mitochondrial  now received support from the framework at a 10% false discovery rate threshold (Table S3 in Additional file 2). Another 31 proteins of the 181 were not considered in the compendium at all. For example, PET100 and PET117 were not annotated as genes at the time when the compendium was prepared, precluding their detection in the proteomics experiment. With the inclusion of PET100 in the predicted gene set, the protein becomes identifiable in purified mitochondria.
Cloning of the predicted COX assembly factors and plasmid construction
The predicted COX assembly factors were PCR amplified without a stop codon from a human heart cDNA library with gene-specific primers adding Attb recombination sites (see Supplemental Methods in Additional file 2 for details).
Cell culture and transfection
T-REx™ Flp-In™ embryonic kidney 293 cells (HEK293; Invitrogen, Carlsbad, CA, USA) were maintained in DMEM (Biowhitaker, Verviers, Belgium) supplemented with 10% (v/v) fetal calf serum (FCS; PAA Laboratories, Pasching, Austria) and 1% (v/v) penicillin and streptomycin (GIBCO, Carlsbad, CA, USA), 5 μg/ml blasticidin (Invitrogen) and 300 μg/ml zeocin (Invitrogen) and grown at 37°C under an 5% CO2 atmosphere. For the generation of stable cell lines expressing HEK293 T-REx™ Flp-In™, cells were transfected with the GFP- and TAP-constructs together with the pOG44 recombinase expression vector using SuperFect transfection reagent (Qiagen, Hilden, Germany). Selection of stable transfectants was achieved by replacing the zeocin in the culture medium with hygromycin B (200 μg/ml; Calbiochem, Amsterdam, the Netherlands). Transgene expression was induced by adding doxycycline (Sigma, St Louis, MO, USA) to the culture medium (final concentration 1 μg/ml) for a minimum of 24 h.
BN-PAGE, SDS-PAGE, western blotting and immunodetection
BN-PAGE was done as described before . A total of 80 μg of protein was loaded per lane. Incubations with first antibodies were followed by incubations with secondary horse radish peroxidase conjugated goat-anti-mouse or goat-anti-rabbit IgGs and visualized using the enhanced chemiluminescence kit (see Supplemental Methods in Additional file 2).
Antibodies used in BN- and SDS-PAGE analysis
Antibodies used in BN- and SDS-PAGE analysis were rabbit anti-GFP antibody (dilution 1:5,000) , anti-CBP antibody (dilution 1:1,000; GenScript, Piscataway, NJ, USA) anti-ND1 (dilution 1:1,000; kindly provided by A Lombès, Unite de Recherche INSERM 153, Hospital de la Salpetriere, Paris, France ), mouse anti-SDHA (dilution 1:10,000), anti-COX1, anti-COX2, anti-COX4, anti-ATP5A1 (dilution 1,000) and anti-Core2 (1:5,000) (all from MitoSciences, Eugene, OR, USA), anti-TOM20 antibody (dilution 1:5000; BD Transduction Laboratories, Franklin Lakes, NJ, USA) anti-CK-B 21E10 antibody (dilution 1:2,000) .
Affinity purification and FT/MS analysis
T-REx™ Flp-In™ HEK293 cells were induced by doxycycline to express the TAP-tagged COX assembly factors. As a negative control unmodified HEK293 cells were used. After harvesting, cells were resuspended in lysis buffer (30 mM Tris-HCl pH 7.4, 150 mM NaCl, 1 mM EDTA, 1% (w/v) lauryl maltoside and protease inhibitor cocktail) and subjected to three cycles of freeze-thawing. The lysates were centrifuged for 10 minutes at 10,000xg after which the supernatant was incubated under rotation in the presence of Strep-tactin Superflow beads (IBA, Göttingen, Germany) for a minimum of 2 h at 4°C. After the incubation, beads were washed six times with washing buffer (30 mM Tris-HCl pH 7.4, 150 mM NaCl and 0.1% (w/v) lauryl maltoside). Retained proteins were eluted from the beads in washing buffer containing D-Desthiobiotin (IBA). Finally, the eluates were concentrated by passing them through a 3 kDa cutoff filter (Millipore, Cork, Ireland) and further processed for nanoLC-MS/MS analysis. The proteins were digested in-solution  and the nanoLC-MS/MS analysis was performed as described previously  (Supplemental Methods in Additional file 2).
Analysis of co-purified proteins
TAP uses the InterPlay mammalian TAP-system protocol (Agilent Technologies, Santa Clara, CA, USA), which contains a streptavidin and calmodulin binding part. Mitochondrial proteins co-purified with the five TAP-tagged constructs (C7orf44-TAP, PET100-TAP, PET117-TAP, C12orf62-TAP, AURKAIP1-TAP) were analyzed by nanoLC-MS/MS. We selected proteins that are expressed in the transfected cells solely following the doxycycline-stimulated expression, and not detectable without the treatment (Table S4 in Additional file 2). Additionally, we removed mitochondrial proteins that are non-specifically co-purified, based on the four additional control genes encoding mitochondrial proteins that are not directly functionally linked to respiratory chain complexes (GTPBP8, C10orf65, C7orf30 and BOLA1). Proteins co-purified with these control proteins (both upon doxycycline induction, as well as in non-induced cells, 119 proteins in total) were regarded as not specific to COX maintenance and/or assembly.
Mitochondrial translation assay
In vivo mitochondrial protein synthesis in cultured cells was analyzed as described previously . Briefly, cells overexpressing C12orf62 and the non-induced control were labeled for 60 minutes in L-methionine and L-cysteine free DMEM containing 10% dialyzed FCS, emetine (100 ug/ml) and 200 μCi/ml [35S]-methionine and [35S]-cysteine (Tran35S-Label; MP Biomedicals, Eindhoven, The Netherlands). After labeling, cells were chased for 10 minutes in regular DMEM with 10% FCS, harvested and resuspended in PBS containing 2% (w/v) lauryl maltoside. To remove insolubilized material the lysate was centrifuged for 10 minutes at 10,000xg. Next, equal amounts of protein were separated by SDS-PAGE on a 16% gel. To visualize labeled proteins the gel was dried and exposed to a Phosphorimager screen that was subsequently scanned with a FLA5100 scanner (Fujiimager, Tilburg, the Netherlands). Equal loading of proteins was confirmed by staining the gels with Coomassie Brilliant Blue G-250 after rehydration .
Coomassie Brilliant Blue G-250
calmodulin binding peptide: a part of the TAP tag
cytochrome c oxidase
Dulbecco's modified Eagle's medium
fetal calf serum
green fluorescent protein
hidden Markov model
liquid chromatography tandem mass spectrometry
polymerase chain reaction
tandem affinity purification
tetramethyl rhodamine methyl ester.
We would like to thank Michael Remmert and Johannes Söding for their help with preparing profiles and John van Dam and Robin van der Lee for critically reading the manuscript. We would also like to thank Karl R Clauser for support with the proteomics data analysis, Jack Fransen for excellent technical support, Anne Lombès for the ND1 antibody and Berdine Boks for technical support. This work was supported by the Netherlands Genomics Initiative (Horizon Programme) and the Centre for Systems Biology and Bioenergetics.
- Bork P, Koonin EV: Predicting functions from protein sequences - where are the bottlenecks?. Nat Genet. 1998, 18: 313-318. 10.1038/ng0498-313.View ArticlePubMedGoogle Scholar
- Gabaldón T, Dessimoz C, Huxley-Jones J, Vilella AJ, Sonnhammer EL, Lewis S: Joining forces in the quest for orthologs. Genome Biol. 2009, 10: 403-10.1186/gb-2009-10-9-403.PubMed CentralView ArticlePubMedGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.View ArticlePubMedGoogle Scholar
- Pearson WR, Lipman DJ: Improved tools for biological sequence comparison. Proc Natl Acad Sci USA. 1988, 85: 2444-2448. 10.1073/pnas.85.8.2444.PubMed CentralView ArticlePubMedGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMedGoogle Scholar
- Pietrokovski S: Searching databases of conserved sequence regions by aligning protein multiple-alignments. Nucleic Acids Res. 1996, 24: 3836-3845. 10.1093/nar/24.19.3836.PubMed CentralView ArticlePubMedGoogle Scholar
- Eddy SR: Profile hidden Markov models. Bioinformatics. 1998, 14: 755-763. 10.1093/bioinformatics/14.9.755.View ArticlePubMedGoogle Scholar
- Söding J: Protein homology detection by HMM-HMM comparison. Bioinformatics. 2005, 21: 951-960. 10.1093/bioinformatics/bti125.View ArticlePubMedGoogle Scholar
- Calvo SE, Jain M, Xie X, Sheth SA, Chang B, Goldberger OA, Spinazzola A, Zeviani M, Carr SA, Mootha VK: Systematic identification of human mitochondrial disease genes through integrative genomics. Nat Genet. 2006, 38: 576-582. 10.1038/ng1776.View ArticlePubMedGoogle Scholar
- Marcotte EM, Xenarios I, van Der Bliek AM, Eisenberg D: Localizing proteins in the cell from their phylogenetic profiles. Proc Natl Acad Sci USA. 2000, 97: 12115-12120. 10.1073/pnas.220399497.PubMed CentralView ArticlePubMedGoogle Scholar
- Pagliarini DJ, Calvo SE, Chang B, Sheth SA, Vafai SB, Ong S-E, Walford GA, Sugiana C, Boneh A, Chen WK, Hill DE, Vidal M, Evans JG, Thorburn DR, Carr SA, Mootha VK: A mitochondrial protein compendium elucidates complex I disease biology. Cell. 2008, 134: 112-123. 10.1016/j.cell.2008.06.016.PubMed CentralView ArticlePubMedGoogle Scholar
- Szklarczyk R, Huynen MA: Expansion of the human mitochondrial proteome by intra- and inter-compartmental protein duplication. Genome Biol. 2009, 10: R135-10.1186/gb-2009-10-11-r135.PubMed CentralView ArticlePubMedGoogle Scholar
- Heazlewood JL, Howell KA, Millar AH: Mitochondrial complex I from Arabidopsis and rice: orthologs of mammalian and fungal components coupled with plant-specific subunits. Biochim Biophys Acta. 2003, 1604: 159-169. 10.1016/S0005-2728(03)00045-8.View ArticlePubMedGoogle Scholar
- Cardol P, Vanrobaeys F, Devreese B, Van Beeumen J, Matagne RF, Remacle C: Higher plant-like subunit composition of mitochondrial complex I from Chlamydomonas reinhardtii: 31 conserved components among eukaryotes. Biochim Biophys Acta. 2004, 1658: 212-224. 10.1016/j.bbabio.2004.06.001.View ArticlePubMedGoogle Scholar
- Gabaldón T, Rainey D, Huynen MA: Tracing the evolution of a large protein complex in the eukaryotes, NADH:ubiquinone oxidoreductase (Complex I). J Mol Biol. 2005, 348: 857-870. 10.1016/j.jmb.2005.02.067.View ArticlePubMedGoogle Scholar
- Smits P, Smeitink JAM, van den Heuvel LP, Huynen MA, Ettema TJG: Reconstructing the evolution of the mitochondrial ribosomal proteome. Nucleic Acids Res. 2007, 35: 4686-4703. 10.1093/nar/gkm441.PubMed CentralView ArticlePubMedGoogle Scholar
- Huynen MA, de Hollander M, Szklarczyk R: Mitochondrial proteome evolution and genetic disease. Biochim Biophys Acta. 2009, 1792: 1122-1129.View ArticlePubMedGoogle Scholar
- van Dam TJP, Snel B: Protein complex evolution does not involve extensive network rewiring. PLoS Comput Biol. 2008, 4: e1000132-10.1371/journal.pcbi.1000132.PubMed CentralView ArticlePubMedGoogle Scholar
- Videira A, Duarte M: From NADH to ubiquinone in Neurospora mitochondria. Biochim Biophys Acta. 2002, 1555: 187-191. 10.1016/S0005-2728(02)00276-1.View ArticlePubMedGoogle Scholar
- Cardol P: Mitochondrial NADH:ubiquinone oxidoreductase (complex I) in eukaryotes: A highly conserved subunit composition highlighted by mining of protein databases. Biochim Biophys Acta. 2011, 1807: 1390-1397. 10.1016/j.bbabio.2011.06.015.View ArticlePubMedGoogle Scholar
- Desmond E, Brochier-Armanet C, Forterre P, Gribaldo S: On the last common ancestor and early evolution of eukaryotes: reconstructing the history of mitochondrial ribosomes. Res Microbiol. 2011, 162: 53-70. 10.1016/j.resmic.2010.10.004.View ArticlePubMedGoogle Scholar
- Minczuk M, He J, Duch AM, Ettema TJ, Chlebowski A, Dzionek K, Nijtmans LGJ, Huynen MA, Holt IJ: TEFM (c17orf42) is necessary for transcription of human mtDNA. Nucleic Acids Res. 2011, 39: 4284-4299. 10.1093/nar/gkq1224.PubMed CentralView ArticlePubMedGoogle Scholar
- Reinders J, Zahedi RP, Pfanner N, Meisinger C, Sickmann A: Toward the complete yeast mitochondrial proteome: multidimensional separation techniques for mitochondrial proteomics. J Proteome Res. 2006, 5: 1543-1554. 10.1021/pr050477f.View ArticlePubMedGoogle Scholar
- Matsuyama A, Arai R, Yashiroda Y, Shirai A, Kamata A, Sekido S, Kobayashi Y, Hashimoto A, Hamamoto M, Hiraoka Y, Horinouchi S, Yoshida M: ORFeome cloning and global analysis of protein localization in the fission yeast Schizosaccharomyces pombe. Nat Biotechnol. 2006, 24: 841-847. 10.1038/nbt1222.View ArticlePubMedGoogle Scholar
- Steinmetz LM, Scharfe C, Deutschbauer AM, Mokranjac D, Herman ZS, Jones T, Chu AM, Giaever G, Prokisch H, Oefner PJ, Davis RW: Systematic screen for human disease genes in yeast. Nat Genet. 2002, 31: 400-404.PubMedGoogle Scholar
- Glerum DM, Tzagoloff A: Affinity purification of yeast cytochrome oxidase with biotinylated subunits 4, 5, or 6. Anal Biochem. 1998, 260: 38-43. 10.1006/abio.1998.2683.View ArticlePubMedGoogle Scholar
- Lombes A, Nakase H, Tritschler HJ, Kadenbach B, Bonilla E, DeVivo DC, Schon EA, DiMauro S: Biochemical and molecular analysis of cytochrome c oxidase deficiency in Leigh's syndrome. Neurology. 1991, 41: 491-498.View ArticlePubMedGoogle Scholar
- DiMauro S, Schon EA: Mitochondrial respiratory-chain diseases. N Engl J Med. 2003, 348: 2656-2668. 10.1056/NEJMra022567.View ArticlePubMedGoogle Scholar
- Zee JM, Glerum DM: Defects in cytochrome oxidase assembly in humans: lessons from yeast. Biochem Cell Biol. 2006, 84: 859-869. 10.1139/o06-201.View ArticlePubMedGoogle Scholar
- Tiranti V, Hoertnagel K, Carrozzo R, Galimberti C, Munaro M, Granatiero M, Zelante L, Gasparini P, Marzella R, Rocchi M, Bayona-Bafaluy MP, Enriquez JA, Uziel G, Bertini E, Dionisi-Vici C, Franco B, Meitinger T, Zeviani M: Mutations of SURF-1 in Leigh disease associated with cytochrome c oxidase deficiency. Am J Hum Genet. 1998, 63: 1609-1621. 10.1086/302150.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhu Z, Yao J, Johns T, Fu K, De Bie I, Macmillan C, Cuthbert AP, Newbold RF, Wang J, Chevrette M, Brown GK, Brown RM, Shoubridge EA: SURF1, encoding a factor involved in the biogenesis of cytochrome c oxidase, is mutated in Leigh syndrome. Nat Genet. 1998, 20: 337-343. 10.1038/3804.View ArticlePubMedGoogle Scholar
- Huigsloot M, Nijtmans LG, Szklarczyk R, Baars MJH, van den Brand MAM, Hendriksfranssen MGM, van den Heuvel LP, Smeitink JAM, Huynen MA, Rodenburg RJT: A mutation in c2orf64 causes impaired cytochrome C oxidase assembly and mitochondrial cardiomyopathy. Am J Hum Genet. 2011, 88: 488-493. 10.1016/j.ajhg.2011.03.002.PubMed CentralView ArticlePubMedGoogle Scholar
- Wood V: Schizosaccharomyces pombe comparative genomics; from sequence to systems. Topics Curr Genet. 2006, 15: 233-285. 10.1007/4735_97.View ArticleGoogle Scholar
- Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, Hotz H-R, Ceric G, Forslund K, Eddy SR, Sonnhammer ELL, Bateman A: The Pfam protein families database. Nucleic Acids Res. 2008, 36: D281-288. 10.1093/nar/gkn226.PubMed CentralView ArticlePubMedGoogle Scholar
- Menachem RB, Tal M, Pines O: A third of the yeast mitochondrial proteome is dual localized: a question of evolution. Proteomics. 2011, 11: 4468-4476. 10.1002/pmic.201100199.View ArticleGoogle Scholar
- Baughman JM, Nilsson R, Gohil VM, Arlow DH, Gauhar Z, Mootha VK: A computational screen for regulators of oxidative phosphorylation implicates SLIRP in mitochondrial RNA homeostasis. PLoS Genet. 2009, 5: e1000590-10.1371/journal.pgen.1000590.PubMed CentralView ArticlePubMedGoogle Scholar
- Glerum DM, Shtanko A, Tzagoloff A: Characterization of COX17, a yeast gene involved in copper metabolism and assembly of cytochrome oxidase. J Biol Chem. 1996, 271: 14504-14509. 10.1074/jbc.271.24.14504.View ArticlePubMedGoogle Scholar
- Oswald C, Krause-Buchholz U, Rödel G: Knockdown of human COX17 affects assembly and supramolecular organization of cytochrome c oxidase. J Mol Biol. 2009, 389: 470-479. 10.1016/j.jmb.2009.04.034.View ArticlePubMedGoogle Scholar
- Church , Goehring B, Forsha D, Wazny P, Poyton RO: A role for Pet100p in the assembly of yeast cytochrome c oxidase: interaction with a subassembly that accumulates in a pet100 mutant. J Biol Chem. 2005, 280: 1854-1863.View ArticlePubMedGoogle Scholar
- Barrientos A, Zambrano A, Tzagoloff A: Mss51p and Cox14p jointly regulate mitochondrial Cox1p expression in Saccharomyces cerevisiae. EMBO J. 2004, 23: 3472-3482. 10.1038/sj.emboj.7600358.PubMed CentralView ArticlePubMedGoogle Scholar
- Fornuskova D, Stiburek L, Wenchich L, Vinsova K, Hansikova H, Zeman J: Novel insights into the assembly and function of human nuclear-encoded cytochrome c oxidase subunits 4, 5a, 6a, 7a and 7b. Biochem J. 2010, 428: 363-374. 10.1042/BJ20091714.View ArticlePubMedGoogle Scholar
- Weraarpachai W, Antonicka H, Sasarman F, Seeger J, Schrank B, Kolesar JE, Lochmüller H, Chevrette M, Kaufman BA, Horvath R, Shoubridge EA: Mutation in TACO1, encoding a translational activator of COX I, results in cytochrome c oxidase deficiency and late-onset Leigh syndrome. Nat Genet. 2009, 41: 833-837. 10.1038/ng.390.View ArticlePubMedGoogle Scholar
- Perez-Martinez X, Butler CA, Shingu-Vazquez M, Fox TD: Dual functions of Mss51 couple synthesis of Cox1 to assembly of cytochrome c oxidase in Saccharomyces cerevisiae mitochondria. Mol Biol Cell. 2009, 20: 4371-4380. 10.1091/mbc.E09-06-0522.PubMed CentralView ArticlePubMedGoogle Scholar
- O'Brien KP, Remm M, Sonnhammer ELL: Inparanoid: a comprehensive database of eukaryotic orthologs. Nucleic Acids Res. 2005, 33: D476-480.PubMed CentralView ArticlePubMedGoogle Scholar
- Li L, Stoeckert CJ, Roos DS: OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003, 13: 2178-2189. 10.1101/gr.1224503.PubMed CentralView ArticlePubMedGoogle Scholar
- van der Heijden RTJM, Snel B, van Noort V, Huynen MA: Orthology prediction at scalable resolution by phylogenetic tree analysis. BMC Bioinformatics. 2007, 8: 83-10.1186/1471-2105-8-83.PubMed CentralView ArticlePubMedGoogle Scholar
- Huerta-Cepas J, Bueno A, Dopazo J, Gabaldón T: PhylomeDB: a database for genome-wide collections of gene phylogenies. Nucleic Acids Res. 2008, 36: D491-496. 10.1093/nar/gkn241.PubMed CentralView ArticlePubMedGoogle Scholar
- Mootha VK, Lepage P, Miller K, Bunkenborg J, Reich M, Hjerrild M, Delmonte T, Villeneuve A, Sladek R, Xu F, Mitchell GA, Morin C, Mann M, Hudson TJ, Robinson B, Rioux JD, Lander ES: Identification of a gene causing human cytochrome c oxidase deficiency by integrative genomics. Proc Natl Acad Sci USA. 2003, 100: 605-610. 10.1073/pnas.242716699.PubMed CentralView ArticlePubMedGoogle Scholar
- Xu F, Morin C, Mitchell G, Ackerley C, Robinson BH: The role of the LRPPRC (leucine-rich pentatricopeptide repeat cassette) gene in cytochrome oxidase assembly: mutation causes lowered levels of COX (cytochrome c oxidase) I and COX III mRNA. Biochem J. 2004, 382: 331-336. 10.1042/BJ20040469.PubMed CentralView ArticlePubMedGoogle Scholar
- Tavares-Carreón F, Camacho-Villasana Y, Zamudio-Ochoa A, Shingú-Vázquez M, Torres-Larios A, Pérez-Martínez X: The pentatricopeptide repeats present in Pet309 are necessary for translation but not for stability of the mitochondrial COX1 mRNA in yeast. J Biol Chem. 2008, 283: 1472-1479.View ArticlePubMedGoogle Scholar
- Rackham O, Davies SMK, Shearwood A-MJ, Hamilton KL, Whelan J, Filipovska A: Pentatricopeptide repeat domain protein 1 lowers the levels of mitochondrial leucine tRNAs in cells. Nucleic Acids Res. 2009, 37: 5859-5867. 10.1093/nar/gkp627.PubMed CentralView ArticlePubMedGoogle Scholar
- Weraarpachai W, Sasarman F, Nishimura T, Antonicka H, Auré K, Rötig A, Lombès A, Shoubridge EA: Mutations in C12orf62, a factor that couples COX I synthesis with cytochrome c oxidase assembly, cause fatal neonatal lactic acidosis. Am J Hum Genet. 2012, 90: 142-151. 10.1016/j.ajhg.2011.11.027.PubMed CentralView ArticlePubMedGoogle Scholar
- HHPred toolkit. [ftp://toolkit.lmb.uni-muenchen.de/HHsearch/]
- Li W, Godzik A: Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006, 22: 1658-1659. 10.1093/bioinformatics/btl158.View ArticlePubMedGoogle Scholar
- McGuffin LJ, Bryson K, Jones DT: The PSIPRED protein structure prediction server. Bioinformatics. 2000, 16: 404-405. 10.1093/bioinformatics/16.4.404.View ArticlePubMedGoogle Scholar
- Hertz-Fowler C, Peacock CS, Wood V, Aslett M, Kerhornou A, Mooney P, Tivey A, Berriman M, Hall N, Rutherford K, Parkhill J, Ivens AC, Rajandream M-A, Barrell B: GeneDB: a resource for prokaryotic and eukaryotic organisms. Nucleic Acids Res. 2004, 32: D339-343. 10.1093/nar/gkh007.PubMed CentralView ArticlePubMedGoogle Scholar
- Fisk DG, Ball CA, Dolinski K, Engel SR, Hong EL, Issel-Tarver L, Schwartz K, Sethuraman A, Botstein D, Cherry JM: Saccharomyces cerevisiae S288C genome annotation: a working hypothesis. Yeast. 2006, 23: 857-865. 10.1002/yea.1400.PubMed CentralView ArticlePubMedGoogle Scholar
- Ugalde C, Vogel R, Huijbens R, Van Den Heuvel B, Smeitink J, Nijtmans L: Human mitochondrial complex I assembles through the combination of evolutionary conserved modules: a framework to interpret complex I deficiencies. Hum Mol Genet. 2004, 13: 2461-2472. 10.1093/hmg/ddh262.View ArticlePubMedGoogle Scholar
- Cuppen E, Wijers M, Schepens J, Fransen J, Wieringa B, Hendriks W: A FERM domain governs apical confinement of PTP-BL in epithelial cells. J Cell Sci. 1999, 112: 3299-3308.PubMedGoogle Scholar
- Procaccio V, Mousson B, Beugnot R, Duborjal H, Feillet F, Putet G, Pignot-Paintrand I, Lombès A, De Coo R, Smeets H, Lunardi J, Issartel JP: Nuclear DNA origin of mitochondrial complex I deficiency in fatal infantile lactic acidosis evidenced by transnuclear complementation of cultured fibroblasts. J Clin Invest. 1999, 104: 83-92. 10.1172/JCI6184.PubMed CentralView ArticlePubMedGoogle Scholar
- Sistermans EA, de Kok YJ, Peters W, Ginsel LA, Jap PH, Wieringa B: Tissue- and cell-specific distribution of creatine kinase B: a new and highly specific monoclonal antibody for use in immunohistochemistry. Cell Tissue Res. 1995, 280: 435-446. 10.1007/BF00307817.View ArticlePubMedGoogle Scholar
- Wessels HJCT, Gloerich J, van der Biezen E, Jetten MSM, Kartal B: Liquid chromatography-mass spectrometry-based proteomics of nitrosomonas. Meth Enzymol. 2011, 486: 465-482.View ArticlePubMedGoogle Scholar
- Wessels HJCT, Vogel RO, van den Heuvel L, Smeitink JA, Rodenburg RJ, Nijtmans LG, Farhoud MH: LC-MS/MS as an alternative for SDS-PAGE in blue native analysis of protein complexes. Proteomics. 2009, 9: 4221-4228. 10.1002/pmic.200900157.View ArticlePubMedGoogle Scholar
- Boulet L, Karpati G, Shoubridge EA: Distribution and threshold expression of the tRNA(Lys) mutation in skeletal muscle of patients with myoclonic epilepsy and ragged-red fibers (MERRF). Am J Hum Genet. 1992, 51: 1187-1200.PubMed CentralPubMedGoogle Scholar
- Rodenburg RJT: Biochemical diagnosis of mitochondrial disorders. J Inherit Metab Dis. 2011, 34: 283-292. 10.1007/s10545-010-9081-y.PubMed CentralView ArticlePubMedGoogle Scholar
- Emanuelsson O, Brunak S, von Heijne G, Nielsen H: Locating proteins in the cell using TargetP, SignalP and related tools. Nat Protoc. 2007, 2: 953-971. 10.1038/nprot.2007.131.View ArticlePubMedGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.PubMed CentralView ArticlePubMedGoogle Scholar
- Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ: Jalview Version 2--a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009, 25: 1189-1191. 10.1093/bioinformatics/btp033.PubMed CentralView ArticlePubMedGoogle Scholar
- Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, Minguez P, Doerks T, Stark M, Muller J, Bork P, Jensen LJ, von Mering C: The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res. 2011, 39: D561-568. 10.1093/nar/gkq973.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.