NEAT: a domain duplicated in genes near the components of a putative Fe3+siderophore transporter from Gram-positive pathogenic bacteria
© Andrade et al., licensee BioMed Central Ltd 2002
Received: 24 April 2002
Accepted: 5 June 2002
Published: 15 August 2002
Iron uptake from the host is essential for bacteria that infect animals. To find potential targets for drugs active against pathogenic bacteria, we have searched all completely sequenced genomes of pathogenic bacteria for genes relevant for iron transport.
We identified a protein domain that appears in variable copy number in bacterial genes that are usually in the vicinity of a putative Fe3+ siderophore transporter. Accordingly, we have denoted this domain NEAT for 'near transporter'. Most of the bacterial species containing this domain are pathogenic. Sequence features indicate that the domain is anchored to the extracellular side of the membrane. The domain seems to be under high selective pressure for rapid independent duplications that are typical of sequences involved in signaling and binding.
The NEAT domain might be functionally related to iron transport. The taxonomic specificity of this domain and its predicted extracellular position could make it an interesting target for designing new drugs against some highly pathogenic bacteria.
Iron transport into the cell is very important for the growth of an organism. Pathogenic bacteria, which have to survive within an animal, are able to sequester iron from the iron-containing proteins of the host by secreting siderophores that have a higher affinity for the iron (reviewed in ). Then, a specific transport system imports the iron-siderophore complex back into the bacterial cytoplasm. The disruption of this uptake function in bacteria is likely to be a good strategy in fighting infectivity. We searched the genomic neighborhood of putative Fe3+ siderophore transporters in pathogenic bacteria in order to identify genes that could be associated with this functionality and thus constitute targets for therapy against disease. As a result of our analysis, we characterized a highly duplicated domain that we propose as a receptor for an iron complex.
Results and discussion
Survey for putative Fe3+siderophore transporters in complete bacterial genomes
In order to find proteins related to iron transport in pathogenic bacteria, we first scanned complete genomes of pathogenic bacteria for sequences homologous to those encoding the three currently known Escherichia coli Fe3+ siderophore transporters: the Fe3+ dicitrate transport complex , the Fe3+ enterobactin transport complex , and the Fe3+ hydroxamate transport complex . These transporters import iron from the periplasm into the cytoplasm of E. coli, expending ATP. Several components of the putative transporter were found in contiguous genomic positions of four pathogenic Gram-positive bacteria, three of which are associated with food-borne diseases (Listeria monocytogenes, Clostridium perfringens, and Staphylococcus aureus). In humans, the fourth bacterium, Staphylococcus pyogenes, produces pharyngitis, impetigo, toxic shock syndrome, necrotizing fasciitis, rheumatic fever, and acute glomerulonephritis.
In order to find genes associated with the putative Fe3+ siderophore transporter that could be characteristic of the pathogenic species, we analyzed the genomic neighborhood of the transporter in complete genomes. The repeated presence of neighboring gene pairs across different species permits us to reach conclusions about the possible functional association of the paired genes [5,6,7].
Codes for genes containing the NEAT domain
Protein accession number*
S. aureus strain N315
S. aureus strain N315
S. aureus strain N315
S. aureus strain N315
B. anthracis, virulence plasmid PX01
The search for homologous sequences in the whole protein database added one more sequence from S. aureus, and a short protein corresponding to the middle part of the domain in the virulence plasmid pXO1 of Bacillus anthracis, which is essential for the manifestation of the disease anthrax. Both genes are apparently not physically close to transport-related genes.
Phylogenetic analysis of the new domain
Some protein domains have a highly variable copy number per protein, but they can exist as a single copy. This is in contrast to structural repeats (such as armadillo or leucine-rich repeats) that fold together and, by definition, never appear as a single copy . Whereas structural repeats are related to DNA or protein binding, occasionally repeated domains can bind either large or small substrates; for example, Ca2+ (bound by C2, cadherin repeats, epidermal growth factor repeats), nucleotides (bound by zinc finger domains, LIM domains, and homeobox domains), or proteins (kazal inhibits serine proteases, ubiquitin domains in polyubiquitin bind target proteins to be degraded, PDZ domains bind polypeptides, nebulin repeats bind actin, immunoglobulins bind antigens, fibronectin 1 repeats bind fibrin and so on). (See the SMART server for further examples and references [9,10].) Accordingly, occasionally repeated domains are often involved in signaling or transcription regulation. A large copy number is used as a way of increasing the effectiveness of the binding activity. This could be the case with the NEAT domain, which can be found as one single copy per sequence. In this respect, the NEAT domain appears to perform a binding function rather than a structural or an enzymatic one. Accordingly, the multiple alignment of the instances of the domain (Figure 2) indicates the lack of obvious conserved catalytic residues.
The NEAT domain appears to be associated with iron transport in several Gram-positive species (some of them pathogenic). Given its predicted extracellular location and its close association with the components of an iron transport system, one possible function of the NEAT domain is to be a receptor of the siderophore-iron complex. It would initiate a cascade upon detection of the substrate, ending in the expression of the components of the transporter in a system similar to that used in the induction of FecA . Further evidence in this direction is given by recent experimental results for two of the NEAT-domain proteins from S. aureus, FrpA and FrpB (denoted here as S_aur4 and S_aur2, respectively), which were identified as cell wall proteins expressed under iron-restricted conditions .
The multiple duplication of this domain could reflect competition with an inhibitor. It could also be used for increasing bacterial sensitivity to the presence of the iron complex at very low substrate concentrations, in order to trigger the production of the corresponding transporter. The extracellular location of the domain, its association with a key process for bacterial survival, and its specificity to the group of pathogenic bacteria described, all make it a good candidate for developing a strategy against these pathogens.
- Ratledge C, Dover LG: Iron metabolism in pathogenic bacteria. Annu Rev Microbiol. 2000, 54: 881-941. 10.1146/annurev.micro.54.1.881.PubMedView ArticleGoogle Scholar
- Staudenmaier H, van Hove B, Yaraghi Z, Braun V: Nucleotide sequences of the fecBCDE genes and locations of the proteins suggest a periplasmic-binding-protein-dependent transport mechanism for iron(III) dicitrate in Escherichia coli. J Bacteriol. 1989, 171: 2626-2633.PubMedPubMed CentralGoogle Scholar
- Ozenberger BA, Nahlik MS, McIntosh MA: Genetic organization of multiple fep genes encoding ferric enterobactin transport functions in Escherichia coli. J Bacteriol. 1987, 169: 3638-3646.PubMedPubMed CentralGoogle Scholar
- Burkhardt R, Braun V: Nucleotide sequence of the fhuC and fhuD genes involved in iron (III) hydroxamate transport: domains in FhuC homologous to ATP-binding proteins. Mol Gen Genet. 1987, 209: 49-55.PubMedView ArticleGoogle Scholar
- Dandekar T, Snel B, Huynen M, Bork P: Conservation of gene order: a fingerprint of proteins that physically interact. Trends Biochem Sci. 1998, 23: 324-328. 10.1016/S0968-0004(98)01274-2.PubMedView ArticleGoogle Scholar
- Overbeek R, Fonstein M, D'Souza M, Pusch GD, Maltsev N: The use of gene clusters to infer functional coupling. Proc Natl Acad Sci USA. 1999, 96: 2896-2901. 10.1073/pnas.96.6.2896.PubMedPubMed CentralView ArticleGoogle Scholar
- Huynen M, Snel B, Lathe W, Bork P: Exploitation of gene context. Curr Opin Struct Biol. 2000, 10: 366-370. 10.1016/S0959-440X(00)00098-1.PubMedView ArticleGoogle Scholar
- Rost B, Sander C: Combining evolutionary information and neural networks to predict protein secondary structure. Proteins. 1994, 19: 55-72.PubMedView ArticleGoogle Scholar
- Letunic I, Goodstadt L, Dickens NJ, Doerks T, Schultz J, Mott R, Ciccarelli F, Copley RR, Ponting CP, Bork P: Recent improvements to the SMART domain-based sequence annotation resource. Nucleic Acids Res. 2002, 30: 242-244. 10.1093/nar/30.1.242.PubMedPubMed CentralView ArticleGoogle Scholar
- SMART - Simple Modular Architecture Research Tool. [http://smart.embl-heidelberg.de/]
- Perldoc documentation for Bioperl Modules. [http://doc.bioperl.org]
- von Heijne G: A new method for predicting signal sequences cleavage sites. Nucleic Acids Res. 1986, 14: 4683-4690.PubMedPubMed CentralView ArticleGoogle Scholar
- Krogh A, Larsson B, von Heijne G, Sonnhammer ELL: Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. J Mol Biol. 2001, 305: 567-580. 10.1006/jmbi.2000.4315.PubMedView ArticleGoogle Scholar
- Fischetti VA, Pancholi V, Schneewind O: Conservation of a hexapeptide sequence in the anchor region of surface proteins from gram-positive cocci. Mol Microbiol. 1990, 4: 1603-1605.PubMedView ArticleGoogle Scholar
- Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, Eddy SR, Griffiths-Jones S, Howe KL, Marshall M, Sonnhammer EL: The Pfam protein families database. Nucleic Acids Res. 2002, 30: 276-280. 10.1093/nar/30.1.276.PubMedPubMed CentralView ArticleGoogle Scholar
- Protein families database of alignments and HMMs. [http://www.sanger.ac.uk/Pfam/]
- Andrade MA, Perez-Iratxeta C, Ponting CP: Protein repeats: structures, functions and evolution. J Struct Biol. 2001, 134: 117-131. 10.1006/jsbi.2001.4392.PubMedView ArticleGoogle Scholar
- Enz S, Mahren S, Stroeher UH, Braun V: Surface signaling in ferric citrate transport gene induction: interaction of the FecA, FecR, and FecI regulatory proteins. J Bacteriol. 2000, 182: 637-646. 10.1128/JB.182.3.637-646.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- Morrissey JA, Cockayne A, Hammacott J, Bishop K, Denman-Johnson A, Hill PJ, Williams P: Conservation, surface exposure, and in vivo expression of the Frp family of iron-regulated cell wall proteins in Staphylococcus aureus. Infect Immun. 2002, 70: 2399-2407. 10.1128/IAI.70.5.2399-2407.2002.PubMedPubMed CentralView ArticleGoogle Scholar
- Takami H, Nakasone K, Takaki Y, Maeno G, Sasaki R, Masui N, Fuji F, Hirama C, Nakamura Y, Ogasawara N, Kuhara S, Horikoshi K: Complete genome sequence of the alkaliphilic bacterium Bacillus halodurans and genomic sequence comparison with Bacillus subtilis. Nucleic Acids Res. 2000, 28: 4317-4331. 10.1093/nar/28.21.4317.PubMedPubMed CentralView ArticleGoogle Scholar
- Shimizu T, Ohtani K, Hirakawa H, Ohshima K, Yamashita A, Shiba T, Ogasawara N, Hattori M, Kuhara S, Hayashi H: Complete genome sequence of Clostridium perfringens, an anaerobic flesh-eater. Proc Natl Acad Sci USA. 2002, 99: 996-1001. 10.1073/pnas.022493799.PubMedPubMed CentralView ArticleGoogle Scholar
- Ferretti JJ, McShan WM, Ajdic D, Savic DJ, Savic G, Lyon K, Primeaux C, Sezate S, Suvorov AN, Kenton S, et al: Complete genome sequence of an M1 strain of Streptococcus pyogenes. Proc Natl Acad Sci USA. 2001, 98: 4658-4663. 10.1073/pnas.071559398.PubMedPubMed CentralView ArticleGoogle Scholar
- Glaser P, Frangeul L, Buchrieser C, Rusniok C, Amend A, Baquero F, Berche P, Bloecker H, Brandt P, Chakraborty T, et al: Comparative genomics of Listeria species. Science. 2001, 294: 849-852. 10.1126/science.1063447.PubMedGoogle Scholar
- Kuroda M, Ohta T, Uchiyama I, Baba T, Yuzawa H, Kobayashi I, Cui L, Oguchi A, Aoki K, Nagai Y, et al: Whole genome sequencing of meticillin-resistant Staphylococcus aureus. Lancet. 2001, 357: 1225-1240. 10.1016/S0140-6736(00)04403-2.PubMedView ArticleGoogle Scholar
- Okinaka RT, Cloud K, Hampton O, Hoffmaster AR, Hill KK, Keim P, Koehler TM, Lamke G, Kumano S, Mahillon J, et al: Sequence and organization of pXO1, the large Bacillus anthracis plasmid harboring the anthrax toxin genes. J Bacteriol. 1999, 181: 6509-6515.PubMedPubMed CentralGoogle Scholar
- Snel B, Lehmann G, Bork P, Huynen MA: STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene. Nucleic Acids Res. 2000, 28: 3442-3444. 10.1093/nar/28.18.3442.PubMedPubMed CentralView ArticleGoogle Scholar
- STRING - Search Tool for Recurring Instances of Neighbouring Genes. [http://www.bork.embl-heidelberg.de/STRING/]
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680.PubMedPubMed CentralView ArticleGoogle Scholar
- Mott R: Accurate formula for P-values of gapped local sequence and profile alignments. J Mol Biol. 2000, 300: 649-659. 10.1006/jmbi.2000.3875.PubMedView ArticleGoogle Scholar
- PROSPERO. [http://www.well.ox.ac.uk/rmott/ARIADNE/prospero.shtml]
- Eddy SR: Profile hidden Markov models. Bioinformatics. 1998, 14: 755-763. 10.1093/bioinformatics/14.9.755.PubMedView ArticleGoogle Scholar
- HMMER 2.2. [http://hmmer.wustl.edu/]
- Andrade MA, Ponting CP, Gibson TJ, Bork P: Homology-based method for identification of protein repeats using statistical significance estimates. J Mol Biol. 2000, 298: 521-537. 10.1006/jmbi.2000.3684.PubMedView ArticleGoogle Scholar
- REP. [http://www.embl-heidelberg.de/~andrade/papers/rep/search.html]