Searching sequence space for ATP-binding proteins
- Rachel Brem
© BioMed Central Ltd 2001
Received: 25 April 2001
Published: 22 May 2001
A library of random protein sequences has yielded novel, ATP-binding proteins not found in any organism.
Significance and context
How difficult is it to create a functional protein? Evolutionary biologists ask this question when studying the origin of life; protein designers ask it in reference to theories of protein folding. Keefe and Szostak address the question using a brute-force protein-sequence sampling technique. They synthesized 1012 random proteins of 80 amino acids (80-mers), each covalently fused to its mRNA (to allow simpler sequencing), and then selected for fusions that bound an ATP affinity column. Keefe and Szostak identified at least one protein that bound free ATP tightly and specifically. Their results enable an upper limit to be estimated for the rate at which evolution would uncover a functional protein in a random sequence search.
Keefe and Szostak began with a library of 1012 80-mers. Following eight rounds of selection and amplification, 6.2% of the library bound the ATP column; none of the binders at this stage shared sequence similarity with other known proteins. After mutagenesis and further selection, 34% of the library bound the ATP column. From the final generation, eight clones were purified in milligram quantities and characterized as free proteins. All shared about 80% sequence identity among themselves but are not related to any other known sequence. The isolate with the highest affinity for ATP, called clone 18-19, bound the nucleotide with a Kd of 100 nM at 4°C. This clone bound other nucleotides at least three orders of magnitude more weakly than ATP; affinity dropped by a factor of between two and 600 when moieties were removed from the ATP structure. EDTA chelation and metal replacement experiments suggested that Zn2+ may be coordinated by a CXXC motif in clone 18-19.
From their first phase of selection - 6.2% functional proteins in a library of 1012 sequences - the authors estimate that about one in 1011 random 80-mers has any given biological activity. They also conclude that bound metals may be a useful structural linchpin for proteins with low melting temperatures. In general, Keefe and Szostak argue that new functional proteins may be accessible through simple selection schemes.
As a protein design study, this paper hits the mark expertly. As a study of sequence evolution rates, there are some gray areas. For example, proteins from Keefe and Szostak's first eight rounds of selection appear to use their tethered mRNA in binding ATP; thus, strictly speaking, 1011 80-mers alone may not yield one functional protein. The next step for Keefe and Szostak will be to estimate whether nature has exhaustively searched functional protein space. To do this, they will need to characterize their isolates structurally and compare them with known ATP-binding proteins.