Skip to main content

Table 1 A list of predicted selenoproteins encoded by UGA read-through

From: A computational method to predict genetically encoded rare amino acids in proteins

Accession ID Organism Computationally identified selenoproteins* annotated by their homologs
AE000657 Aquifex aeolicus 1. gi|12515210|gb|AAG56295.1|AE005358_3 formate dehydrogenase-N, nitrate-inducible, alpha subunit [Escherichia coli]
   2. gi|51589698|emb|CAH21328.1| selenide, water dikinase [Yersinia pseudotuberculosis IP 32953]
AE017125 Helicobacter hepaticus 1.gi|27362035|gb|AAO10941.1|AE016805_198 formate dehydrogenase, alpha subunit [Vibrio vulnificus CMCP6]
   2. gi|46914191|emb|CAG20971.1| putative selenophosphate synthase [Photobacterium profundum]
AE017143 Haemophilus ducreyi 35000HP 1. gi|26108424|gb|AAN80626.1|AE016761_201 selenide, water dikinase [Escherichia coli CFT073]
AE004439 Pasteurella multocida 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   2. gi|5103639|dbj|BAA79160.1| 194 amino acid long hypothetical protein [ Aeropyrum pernix K1]
AE005674 Shigella flexneri 2a 1. gi|12515215|gb|AAG56300.1|AE005358_8 orf; unknown function [ Escherichia coli O157:H7 EDL933]
   2. gi|1788928|gb|AAC75627.1| quinolinate synthetase, B protein; quinolinate synthetase, B protein, catalytic and NAD/flavoprotein subunit [ Escherichia coli >K12]
   3. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   4. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   5. gi|3868721|gb|AAD13462.1| selenopolypeptide subunit of formate dehydrogenase H; formate dehydrogenase H, selenopolypeptide subunit [Escherichia coli K12]
AE014073 Shigella flexneri 2a 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   2. gi|1788928|gb|AAC75627.1| quinolinate synthetase, B protein; quinolinate synthetase, B protein, catalytic and NAD/flavoprotein subunit [ Escherichia coli K12]
   3. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   4. gi|3868721|gb|AAD13462.1| selenopolypeptide subunit of formate dehydrogenase H; formate dehydrogenase H, selenopolypeptide subunit [Escherichia coli K12]
AE006469 Sinorhizobium meliloti 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AE008691 Thermoanaerobacter tengcongensis 1. gi|41816370|gb|AAS11237.1| glycine reductase complex selenoprotein GrdA [Treponema denticola ATCC 35405]
   2. gi|51857693|dbj|BAD41851.1| glycine reductase complex selenoprotein B [Symbiobacterium thermophilum IAM 14863]
   3. gi|46914191|emb|CAG20971.1| putative selenophosphate synthase [Photobacterium profundum]
AE014075 Escherichia coli CFT073 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   2. gi|56130341|gb|AAV79847.1| formate dehydrogenase H [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150]
   3. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
BA000007 Escherichia coli O157H7 1. gi|56130341|gb|AAV79847.1| formate dehydrogenase H [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150]
   2. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   3. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
U00096 Escherichia coli K12 1. gi|5105267|dbj|BAA80580.1| 114 amino acid long hypothetical protein [ Aeropyrum pernix K1]
   2. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   3. gi|56130341|gb|AAV79847.1| formate dehydrogenase H [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150]
   4. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AE014299 Shewanella oneidensis 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AE015451 Pseudomonas putida KT2440 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AE004091 Pseudomonas aeruginosa 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AE016958 Mycobacterium avium paratuberculosis 1. gi|13880045|gb|AAK44759.1| hypothetical protein MT0536 [ Mycobacterium tuberculosis CDC1551]
   2. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AE017042 Yersinia pestis biovar Mediaevalis 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AE009952 Yersinia pestis KIM 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AL590842 Yersinia pestis CO92 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AE017180 Geobacter sulfurreducens 1. gi|19918170|gb|AAM07420.1| 4-carboxymuconolactone decarboxylase [Methanosarcina acetivorans str. C2A]
   2. gi|21956737|gb|AAM83670.1|AE013608_5 glutaredoxin 3 [Yersinia pestis KIM]
   3. gi|37201109|dbj|BAC96933.1| thiol-disulfide isomerase and thioredoxins [Vibrio vulnificus YJ016]
   4. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   5. gi|34105000|gb|AAQ61356.1| conserved hypothetical protein [Chromobacterium violaceum ATCC 12472]; gi|53758707|gb|AAU92998.1| HesB/YadR/YfhF family protein [Methylococcus capsulatus str. Bath];
   6. gi|46914191|emb|CAG20971.1| Putative selenophosphate synthase [Photobacterium profundum]
   7. gi|32448022|emb|CAD77542.1| peroxiredoxin [Pirellula sp.]
   8. gi|29605647|dbj|BAC69712.1 hypothetical protein [Streptomyces avermitilis MA-4680] (SelW)
   9. gi|34482757|emb|CAE09757.1| sulfur transferase precursor [Wolinella succinogenes]
AE017226 Treponema denticola ATCC 35405 1. gi|51857694|dbj|BAD41852.1| glycine reductase complex selenoprotein A [Symbiobacterium thermophilum IAM 14863]
   2. gi|51857693|dbj|BAD41851.1| glycine reductase complex selenoprotein B [Symbiobacterium thermophilum IAM 14863]
   3. gi|56380162|dbj|BAD76070.1| glutathione peroxidase [Geobacillus kaustophilus HTA426]
   4. gi|51857693|dbj|BAD41851.1| glycine reductase complex selenoprotein B [Symbiobacterium thermophilum IAM 14863]
   5. gi|26108424|gb|AAN80626.1|AE016761_201 selenide, water dikinase [Escherichia coli CFT073]
   6. gi|52209545|emb|CAH35498.1| thioredoxin 1 [Burkholderia pseudomallei K96243]
AL111168 Campylobacter jejuni 1. gi|27362035|gb|AAO10941.1|AE016805_198 formate dehydrogenase, alpha subunit [Vibrio vulnificus CMCP6]
   2. gi|54018125|dbj|BAD59495.1| hypothetical protein [Nocardia farcinica IFM 10152]; (SelW)
AL513382 Salmonella typhi 1. gi|3868721|gb|AAD13462.1| selenopolypeptide subunit of formate dehydrogenase H; formate dehydrogenase H, selenopolypeptide subunit [Escherichia coli K12]
   2. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
AE006468 Salmonella typhimurium LT2 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   2. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   3. gi|3868721|gb|AAD13462.1| selenopolypeptide subunit of formate dehydrogenase H; formate dehydrogenase H, selenopolypeptide subunit [Escherichia coli K12]
BA000016 Clostridium perfringens 1. gi|28202985|gb|AAO35429.1| conserved protein [Clostridium tetani E88]; gi|20906561|gb|AAM31712.1| HesB protein [Methanosarcina mazei Goe1]
   2. gi|46914191|emb|CAG20971.1| putative selenophosphate synthase [Photobacterium profundum]
BX470251 Photorhabdus luminescens 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase alpha subunit [Aquifex aeolicus VF5]
BX571656 Wolinella succinogenes 1. gi|27362035|gb|AAO10941.1|AE016805_198 formate dehydrogenase, alpha subunit [Vibrio vulnificus CMCP6]
L42023 Haemophilus influenzae 1. gi|2983532|gb|AAC07107.1| formate dehydrogenase, alpha subunit [Aquifex aeolicus VF5]
   2. gi|26108424|gb|AAN80626.1|AE016761_201 selenide, water dikinase [Escherichia coli CFT073]
CR354531 Photobacterium profundum 1. gi|58428447|gb|AAW77484.1| conserved hypothetical protein [ Xanthomonas oryzae pv. oryzae KACC10331]
CR354532 Photobacterium profundum 1. gi|41816370|gb|AAS11237.1| glycine reductase complex selenoprotein GrdA [Treponema denticola ATCC 35405]
   2. gi|51589698|emb|CAH21328.1| selenide, water dikinase [Yersinia pseudotuberculosis IP 32953]
   3. gi|41816370|gb|AAS11237.1| glycine reductase complex selenoprotein GrdA [Treponema denticola ATCC 35405]
   4. gi|41818450|gb|AAS12639.1| glycine reductase complex selenoprotein GrdB2 [Treponema denticola ATCC 35405]
AE009439 Methanopyrus kandleri (archaea) 1. gi|2622673|gb|AAB86026.1| formate dehydrogenase, alpha subunit homolog [Methanothermobacter thermautotrophicus]; gi|2622681|gb|AAB86033.1| tungsten formylmethanofuran dehydrogenase, subunit B [Methanothermobacter thermautotrophicus]
   2. gi|57160335|dbj|BAD86265.1| probable formate dehydrogenase, alpha subunit [Thermococcus kodakaraensis KOD1]
   3. gi|33566318|emb|CAE37231.1| putative iron-sulfur binding protein [Bordetella parapertussis]
   4. gi|44921146|emb|CAF30381.1| heterodisulfide reductase, subunit A [Methanococcus maripaludis]
   5. gi|44921142|emb|CAF30377.1| coenzyme F420-non-reducing hydrogenase, subunit delta [Methanococcus maripaludis]; gi|2622243|gb|AAB85627.1| methyl viologen-reducing hydrogenase, delta subunit homolog FlpD [Methanothermobacter thermautotrophicus]; gi|20904385|gb|AAM29752.1| heterodisulfate reductase, subunit A [Methanosarcina mazei Goe1]
   6. gi|45047811|emb|CAF30938.1| coenzyme F420-reducing hydrogenase subunit alpha [Methanococcus maripaludis]
   7. gi|39576202|emb|CAE80367.1| selenide, water dikinase [Bdellovibrio bacteriovorus HD100]
L77117 Methanococcus jannaschii (archaea) 1. gi|44921146|emb|CAF30381.1| heterodisulfide reductase subunit A [Methanococcus maripaludis]
   2. gi|45047811|emb|CAF30938.1| coenzyme F420-reducing hydrogenase subunit alpha [Methanococcus maripaludis]
   3. gi|50875900|emb|CAG35740.2| methyl-viologen-reducing hydrogenase, delta subunit [Desulfotalea psychrophila LSv54]
   4. gi|2622240|gb|AAB85625.1| methyl viologen-reducing hydrogenase, delta subunit [Methanothermobacter thermautotrophicus]; gi|44921142|emb|CAF30377.1| coenzyme F420-non-reducing hydrogenase subunit delta [Methanococcus maripaludis]
   5. gi|2622673|gb|AAB86026.1| formate dehydrogenase, alpha subunit homolog [Methanothermobacter thermautotrophicus]; gi|45048129|emb|CAF31247.1| tungsten containing formylmethanofuran dehydrogenase, subunit B [Methanococcus maripaludis] (overlaps with #4)
   6. gi|26108424|gb|AAN80626.1|AE016761_201 selenide, water dikinase [Escherichia coli CFT073]
   7. gi|53758707|gb|AAU92998.1| HesB/YadR/YfhF family protein [Methylococcus capsulatus str. Bath]
   8. gi|45047727|emb|CAF30854.1| formate dehydrogenase, alpha subunit [Methanococcus maripaludis]
BX950229 Methanococcus maripaludis (archaea) 1. gi|2622673|gb|AAB86026.1| formate dehydrogenase, alpha subunit homolog [Methanothermobacter thermautotrophicus]; gi|19886584|gb|AAM01476.1| Formylmethanofuran dehydrogenase subunit B [Methanopyrus kandleri AV19]
   2. gi|2622673|gb|AAB86026.1| formate dehydrogenase, alpha subunit homolog [Methanothermobacter thermautotrophicus]
   3. gi|2622240|gb|AAB85625.1| methyl viologen-reducing hydrogenase, delta subunit [Methanothermobacter thermautotrophicus]; gi|39981962|gb|AAR33424.1| heterodisulfide reductase subunit [Geobacter sulfurreducens PCA]
   4. gi|2622673|gb|AAB86026.1| formate dehydrogenase, alpha subunit homolog [Methanothermobacter thermautotrophicus]
   5. gi|2622673|gb|AAB86026.1| formate dehydrogenase, alpha subunit homolog [Methanothermobacter thermautotrophicus]; gi|19918286|gb|AAM07526.1| formylmethanofuran dehydrogenase, subunit B [Methanosarcina acetivorans str. C2A]
   6. gi|19886593|gb|AAM01482.1| Heterodisulfide reductase, subunit A, polyferredoxin [Methanopyrus kandleri AV19]
  1. Organism names, National Center for Biotechnology Information accession numbers for the genomes and the top PSI-BLAST hit(s) from our database are shown. Seven novel candidate selenoproteins are shown in bold type. *Each entry corresponds to a computationally identified read-through protein in the organism indicated to the left. FASTA files for these recoded protein sequences are provided in the Additional file 2. For each recoded protein, the GI number and the functional annotation for a homologous protein are given.