Synthetic biology: advancing biological frontiers by building synthetic systems
© BioMed Central Ltd 2012
Published: 20 February 2012
Skip to main content
© BioMed Central Ltd 2012
Published: 20 February 2012
Advances in synthetic biology are contributing to diverse research areas, from basic biology to biomanufacturing and disease therapy. We discuss the theoretical foundation, applications, and potential of this emerging field.
Synthetic biology is an emerging field of interdisciplinary research that seeks to transform our ability to probe, manipulate, and interface with living systems by combining the knowledge and techniques of biology, chemistry, computer science, and engineering. Its main aim is to increase the ease and efficiency with which biological systems can be designed, constructed, and characterized. Core efforts in the field have focused on the development of tools to support this goal, including new approaches to biological design and fabrication. Although the first generation of synthetic systems demonstrated genetic circuits that encode dynamic behavior, cellular computational operations, and biological communication channels, more recent research has focused on implementing synthetic biological devices and systems in diverse applications, including disease therapy, environmental remediation, and biosynthesis of commodity chemicals. As the field matures, synthetic biology is advancing biological frontiers by expanding biomanufacturing capabilities, developing next-generation therapeutic approaches, and providing new insights into natural biological systems. Here, we review the theoretical foundations, diverse tool kits, and engineered systems that have emerged from synthetic biology and discuss current as well as potential future applications, which include in-depth studies of basic biology (such as understanding endogenous signaling pathways and feedback circuits) and new frontiers in health and medicine (such as identification of diseased cells and targeted therapeutics).
A central aim of synthetic biology is to increase the ease and efficiency with which biological systems can be designed, constructed, and characterized. Although the manipulation of biological organisms and molecular pathways long preceded the emergence of synthetic biology, the engineering of biological systems has been a largely ad hoc exercise. A main reason is that biology is inherently diverse, mutable, and context specific. Natural biological substrates, including genetic elements such as promoters and genes, do not always behave predictably when implemented in different combinations, and details such as how individual parts are physically connected can vary widely across different construction methods. As a result, the components designed and assembled for one biological system often cannot be predictably reused in another system. Synthetic biology seeks to address this challenge by implementing a more 'engineering-ready' conceptual framework that emphasizes the need to generate and report biological constructs in a manner that is conducive to their understanding and utilization by a broad community of researchers.
The application of engineering tools such as abstraction, decoupling, and standardization was proposed early in the emergence of synthetic biology to support the efficiency and scaling of the biological system design process . An abstraction hierarchy that dissects the engineering process into several design levels - DNA, parts, devices, and systems - provides synthetic biologists with a means to manage complexity and distribute tasks. The design process at each abstraction level can be performed relatively independently of the other levels, and detailed information critical to one abstraction level need only be considered by designers operating at that level. This division of labor reduces the amount of information that each designer must be expert in to successfully design a part, device, or system.
Decoupling refers to the strategy of partitioning a complicated problem into simpler tasks that can be tackled separately and assembled into a complete solution. The separation of design and fabrication processes is an important example of decoupling supported by advances in design tools and fabrication platforms. The increasing efficiency and decreasing cost of DNA synthesis allow synthetic biologists to design novel systems with the confidence that DNA components can be readily synthesized by commercial sources. Furthermore, advances in DNA sequencing and synthesis provide researchers with access to biological components encoding functions of interest using sequence information deposited in databases, eliminating the need for physical exchange of genetic materials.
Standardization takes several forms, including standardization of physical assembly, functional assembly, and characterization/measurements. Early physical assembly standards used biological parts flanked by standardized sequences, enabling the interchangeable combination and sequential assembly of parts conforming to the specified standard through a constant restriction-enzyme/ligation-mediated cloning strategy [2, 3]. Significantly less progress has been made in the field on functional assembly standards, which focus on identifying sequence interfaces between two types of parts (for example, ribosome binding site (RBS) and gene) that allow functional coupling and predictable activity, independent of the specific sequence of each part. Several early physical-assembly strategies encountered obstacles because the proposed standards impaired the functional assembly of parts by requiring the insertion of standard sequences between each part. In response, the field is shifting to assembly methods that do not require restriction-enzyme-mediated cloning [4, 5]. Finally, technical measurement and reporting standards have been proposed to eliminate discrepancies that result from disparate experimental methods and to provide more reliable and thorough characterization data . Standardized characterization data will support reliable sharing and reuse of parts, devices, and systems such that new designs can build on the foundation of previous work and move beyond the ad hoc model of system development.
As synthetic biological systems become increasingly sophisticated, fabrication methods with larger capacities, greater precision, higher speed, and lower cost have become increasingly important. Outpacing the development of novel parts and devices, a number of groundbreaking fabrication techniques have been demonstrated in recent years, allowing researchers to focus on system design while outsourcing or performing system fabrication with higher efficiencies than was previously possible. Advances in multiplex oligonucleotide synthesis and assembly with microfluidic arrays have allowed cheaper de novo synthesis of gene-length fragments [7–9]. Furthermore, several techniques have been developed for the assembly of large DNA fragments, moving the field beyond laborious and time-consuming molecular cloning.
For example, transformation-associated recombination (TAR) in the yeast Saccharomyces cerevisiae has been used to construct yeast artificial chromosomes encoding genes and pathways isolated from several different organisms [4, 10]. Yeast artificial chromosomes can be further modified with bacterial artificial chromosome sequences to transfer the constructs to bacteria and subsequently to mammalian cells . Enzymatic in vitro assembly methods, such as one-step isothermal DNA assembly, can allow DNA molecules of several hundred kilobases to be assembled without restriction-enzyme-mediated digestion [5, 12]. A combination of in vitro and TAR-based assembly methods was used to assemble and clone the first bacterial genome from chemically synthesized oligonucleotides . However, large sets of parts encoding similar functions with distinct sequences are needed to avoid undesired recombination events between components that share similar sequences when assembling large genetic systems with recombination-based strategies.
In addition to DNA synthesis and assembly, methods have been developed for high-throughput genome modification. Multiplex automated genome engineering (MAGE) uses the bacteriophage λ-Red single-stranded-DNA-binding protein β to achieve allelic replacement in Escherichia coli. This process can greatly accelerate the optimization of biological systems and metabolic pathways, provided that the target genes are known and that an efficient screening method is in place to identify the desired variants within the diverse libraries generated . An alternative method termed trackable multiplex recombineering (TRMR) has been developed to support applications in which a priori knowledge of which target gene to modify is lacking, enabling rapid mapping of genes and quantification of population dynamics . A complementary technology called hierarchical conjugative assembly genome engineering (CAGE), which has been used to combine portions of a genome that have been modified by MAGE, was also recently described . Although genome modification has been reported in yeast , most high-throughput methods have been limited to demonstrations in E. coli and the extension of these technologies to mammalian cells remains an important challenge.
The synthetic biology toolbox: common components used in synthetic biological systems
Provide continuously ON gene expression at pre-determined levels
Provide conditional and, in certain cases, titratable gene expression in response to inducer signal
Control protein production levels by regulating mRNA stability or translation initiation in response to molecular input
Alternative splicing modulators 
Control protein production levels or protein activity by regulating alternative splicing of mRNA in response to molecular input
RNase substrate libraries 
Control protein levels through tunable hairpin elements that direct transcript cleavage
Modulate protein levels by shortening protein half-lives
Provide biosensing and modulate protein activity by conditionally splicing inactive protein fragments together into functional wholes
Regulate signaling and metabolic pathway flux by controlling the localization and stoichiometry of pathway components and intermediate products
As the diversity of gene regulatory processes in natural biological systems comes to light, efforts have also been directed to developing control devices that act through posttranscriptional and posttranslational mechanisms. In addition to parts such as degradation tags [24, 25] and split inteins [26, 27], non-coding regulatory RNAs have been used to construct a number of control devices . In one example, microbial gene expression was regulated by engineered RNA-responsive regulators (termed 'riboregulators') that modulate translation initiation by either obstructing or releasing the RBS of a target gene in response to the presence of a separately transcribed RNA sequence . Researchers have demonstrated the utility of riboregulators in a variety of applications, including protein localization studies, perturbation of stress response networks, and programmable cell killing . RNA-based devices responsive to small-molecule and protein inputs have also been demonstrated, exerting control over both transgenic and endogenous protein expression in bacteria, yeast, and mammalian cells [31–33], leading to applications ranging from bacteria-mediated detection and breakdown of pesticides  to disease-marker detection and cell-fate regulation in mammalian cells . The unique properties of RNA-based control devices - including ease of design and construction, small genetic footprint, high energy efficiency, fast regulatory time scales, and the ability to tailor input responsiveness and regulatory stringency - have made RNA a versatile substrate for designing programmable control systems.
In addition to controlling protein levels, synthetic biologists have developed tools to modulate the spatial organization of protein molecules inside cells, resulting in new strategies for regulating or rewiring cellular activities encoded in metabolic and signaling pathways . In one example, researchers constructed synthetic feedback loops within the yeast mating mitogen-activated protein (MAP) kinase pathway by recruiting modulator proteins to the pathway scaffold protein Ste5 through fusing leucine zipper domains to each component, and demonstrated circuits with pulse generator, accelerator, delay, and ultrasensitive switch functions [37, 38]. In another example, synthetic protein scaffolds that spatially recruit metabolic enzymes were implemented in E. coli, enabling the stoichiometric optimization of three mevalonate biosynthetic enzymes and achieving a 77-fold increase in product titer while avoiding cellular toxicity caused by the accumulation of a pathway intermediate . As an alternative to protein-based scaffolds, rationally designed RNA strands have recently been shown to assemble into higher-order structures, including sheets and nanotubes, inside bacterial cells . An RNA scaffold was applied to a two-enzyme hydrogen biosynthesis pathway and shown to increase hydrogen production by up to 48-fold compared with an unscaffolded system . These examples highlight the utility of spatial engineering in enhancing and modifying biological pathways.
One of the hallmarks of synthetic biology has been the drive to engineer biological systems from the bottom up. Model-driven design of synthetic gene circuits has demonstrated the ability to build circuits of specified function [41–43]; differences between models and realized circuits have illuminated important and unique aspects of biological system behavior, such as the effects of degradation processes, cooperativity, and noise [44–46]. In addition to inspiring the design of more robustly operating systems, the insights gained through synthetic approaches have contributed to our understanding of natural biological systems .
Genetic circuits that perform computations and logical evaluations of cellular information provide the ability to assess intracellular states and environmental signals. They transmit this information into changes in cellular function, such as production of easily assayed readouts, activation of metabolic pathways, or initiation of cell-fate decisions. Towards this goal, genetic circuits and devices capable of performing logical evaluations have been built to detect small molecules (using tandem promoter systems  and RNA devices ), and small RNAs such as small interfering (si)RNAs (using tandem RNA interference (RNAi) target sites ) (Figure 1b). These various schemes have demonstrated the classic NOT, OR, NOR, and AND gates that are used to build larger logic evaluators and computations.
Methods for counting and maintaining memory of system states will enable a broader spectrum of intracellular computing. A genetic circuit that can count up to three exposure events to a small-molecule inducer was built in bacteria by nesting polymerase-promoter pairs controlled by riboregulators responsive to an inducible transactivator . Although this system captured brief induction pulses, system performance was highly dependent on pulse duration and frequency. The incorporation of genetic memory offers an alternative strategy to increase the robustness of counting events over longer time frames. A three-event counter circuit was demonstrated by using DNA recombinase-based cascades that record each event as a permanent change to the DNA, where the output of each recombinase event would 'prime' the next promoter-recombinase pair in the circuit . Synthetic networks of feedback loops have been built as memory circuits that lock a system in one state through sustained production of proteins following a transient signal that initiates the state. For example, toggle switches engineered to show bistability in bacteria  and mammalian cells  use architectures of mutually inhibitory feedback loops to achieve reversible memory of small-molecule pulses. As another example, a positive feedback loop built from a synthetic transcriptional activator cascade demonstrated heritable memory over many generations in yeast .
One recurrent limitation in adapting biological systems to perform computation through the rules of binary logic is the analog nature of the responses. In particular, gene expression leakage in the OFF state can contribute to improper input processing and high basal output, diminishing an evaluator's signal-to-noise ratio [48, 53, 55]. In addition, control of highly lethal proteins and proteins that mediate irreversible genetic changes requires stringent OFF states. To address this issue, researchers have layered transcriptional and posttranscriptional control elements within genetic circuits to provide strategies for achieving stringent regulation of transgenes in mammalian [56, 57] and bacterial cells . In one example, an inducible promoter was layered with repressible expression of a small hairpin (sh)RNA to achieve undetectable expression levels of the highly lethal diphtheria toxin in the OFF state, thus enabling induced cell death only in the ON state . Although tight OFF states are desirable for binary computing, biological computing necessarily exploits the analog and tunable nature of gene expression. Connecting logical circuit outputs to changes in cellular state requires the ability to both identify thresholds of expression at which cellular behavior diverges and tune the output to cross that threshold when triggered. Combining the computational ability of logical evaluators with improved strategies for leakage minimization and output tuning should enable more robust computing. These tools can expand our ability to detect and treat diseases by increasing diagnostic certainty and improving precision in gene expression, and can also be used to probe previously inaccessible information sets, such as the temporal and spatial profiles of particular developmental genes, which will inform our fundamental understanding of biology.
Communication systems are required to coordinate events and tasks between different cells in a population. Synthetic communication circuits have been engineered in bacteria using various bacterial quorum-sensing systems. In these systems, a lactone signal is broadcast with increasing strength as cell density increases. At a given threshold level, lactone binds and activates a transcriptional regulator, upregulating the expression of a target gene. Broadcasting and receiving can be incorporated within a single cell population or distributed between 'sender' and 'receiver' cells. Incorporating both functions in a single population programmed to regulate a killer gene resulted in population control and demonstrated how population heterogeneity can be exploited to achieve a robust population response  (Figure 1c). Segregating tasks and localizing the sender population established a radial gradient of signaling molecules. Coupling the quorum-sensing circuitry to a band-pass circuit, which detects a specified range of input concentrations, achieved formation of various radial patterns in the receiver cells . In addition, connecting bacterial quorum systems to synthetic circuits has demonstrated dual-population consensus response and symbiosis in biofilms [60, 61] and synchronized genetic clocks  (Figure 1d). Finally, coupling a light-responsive device  to logic-processing circuitry and a communication module resulted in a biological edge detector  (Figure 1e). These examples demonstrate how synthetic circuits can distribute and coordinate computational tasks across a population of cells to achieve complex responses similar to what is observed in natural pattern formation and development.
Beyond bacterial systems, mammalian receiver cells have been engineered to respond to volatile chemical signals  and metabolic conditions  using engineered synthetic promoters. These receivers can potentially be paired with various processing circuits and sender cells to generate synthetic hormone-signaling systems and synthetic ecosystems. The eventual coupling of metabolic functions and cell-fate circuitry to synthetic hormone-signaling systems will enable spatial patterning of cell differentiation and timing of coordinated cellular responses, a requisite for complex tissue formation and function.
Despite remarkable advances in the design and construction of increasingly sophisticated genetic circuits over the past decade, the transition of these systems to real-world applications has been constrained by the limited availability of devices that can connect synthetic circuitry with information in living systems. However, synthetic biologists are developing new ways to connect natural and engineered systems. For example, exploiting existing connections between synthetic circuitry and intracellular information, researchers have used the natural correlation between DNA damage and proteolysis of the ON state inhibitor λ cI in a genetic toggle switch to record transiently induced DNA damage through the formation of a biofilm . Taking another approach, researchers have constructed synthetic sensor devices from natural components, such as promoter-repressor pairs [67, 68], signaling pathway components , and small RNAs and their target sites [52, 57], to extract information from biological systems. As the range of sensor devices, processing circuitry, and output modules expands, synthetic biology is poised to address a broad scope of biological, medical, and biotechnological challenges.
Biomanufacturing is one of the more compelling and immediate applications of biotechnology that promises sustainable synthesis strategies for alternative energy sources, commodity chemicals, and high-value specialty chemicals such as therapeutic drugs. A major challenge of biosynthetic pathway engineering lies in balancing the levels and activities of the many heterologous pathway enzymes to achieve optimized productivity and yield of desired compounds in the microbial host. Synthetic biology is transforming biosynthesis capabilities by providing new tools that support pathway construction and optimization. For example, researchers have recently combined TAR-based assembly strategies with sets of biosynthetic pathway parts (including enzyme coding regions, promoters, and terminators) to demonstrate one-step, whole-pathway assembly for a variety of natural-product pathways [75–77]. In another example, combinatorial libraries of tunable intergenic regions (TIGRs) harboring a number of RNA regulatory elements, including terminators, RNase cleavage sites, and stabilizing hairpins, were assembled in the non-coding regions between three heterologous enzymes in the mevalonate biosynthetic pathway expressed from a polycistronic transcript in E. coli. Researchers screened library variants for the TIGR sequences that resulted in optimal relative expression levels of each enzyme to increase mevalonate production; the best mevalonate producers decreased accumulation of a toxic intermediate and increased growth rate . Libraries of modular control elements, including promoters and RNA regulatory elements, that have broad ranges of predictable activities have also been generated [19, 79]. Recently, a library of RNase cleavage elements was used in yeast to titrate a key enzyme and thus flux through the endogenous ergosterol pathway, which competes with synthetic terpenoid pathways for the common precursor farnesyl pyrophosphate . Finally, several new tools supporting colocalization of heterologous enzymes, such as protein- and RNA-based scaffolds, are being used to develop pathway optimization strategies based on spatial engineering [39, 40] (Figure 2b).
By developing new strategies to interface with and manipulate natural biological systems, synthetic biology holds exciting promise in developing new therapeutic approaches. For instance, synthetic biologists are developing genetic circuits that link therapeutic activities to the detection of molecular disease signals to develop targeted therapeutics with increased efficacy and safety. In one example, a layered microRNA (miRNA)- and transcription-factor-based logic circuit was used to distinguish a cervical cancer cell line (HeLa) from other cell lines based on the detection of a unique miRNA profile . Positive identification of HeLa cells through this logic circuit was subsequently linked to either expression of a reporter protein, as a model diagnostic device, or expression of a protein that led to cell death as a model therapeutic device. In another example, to restrict cell death to diseased cells showing hyperactive signaling, researchers developed protein-responsive RNA devices that could detect increased signaling through the NF-κB and Wnt pathways and transmit this information into changes in the expression of a clinically relevant suicide gene that sensitizes cells to an apoptosis-inducing prodrug  (Figure 2c). These types of autonomous sense-and-control circuits offer potential applications in the long-term surveillance and intervention of chronic diseases, such as gout and diabetes [68, 81]. Circuits currently under development that link genetic targets to clinician-modulated external inputs will provide an unprecedented level of temporal and spatial control over complex therapeutic activities. For example, systems have been described that support light-modulated glucose homeostasis  and drug-modulated control over in vivo gene expression  and T-cell proliferation .
The biological parts, genetic circuits, and fabrication techniques that have been developed and continue to be improved on offer exciting potential in diverse applications, from environmental engineering to regenerative medicine. Synthetic biological systems capable of detecting, reporting, and/or removing hazardous substances have been reported [85–88], and their implementation in robust host organisms suitable for environmental release will provide a new paradigm for environmental remediation. In the area of health and medicine, synthetic intercellular communication systems that regulate spatial patterning, timing of coordinated cellular responses, and tissue homeostasis have the potential to make significant contributions to tissue engineering. Furthermore, synthetic control circuitry may reduce the inherent tumorigenicity of stem cells  and improve the efficiency of induced pluripotent stem cell reprogramming . Novel genetic circuits capable of guiding the ex vivo construction of complex tissues may be built in the foreseeable future as researchers continue to unravel the systems biology behind cell-fate decisions [91, 92].
Efforts in synthetic biology so far have covered a wide range of topics spanning broad conceptual frameworks and specific circuit designs, and the future direction of synthetic biology is by no means limited to the few areas highlighted here. However, a unifying driving force in the field has been the desire to efficiently build biological systems, whether to improve our fundamental understanding of biology or to provide solutions for pressing global challenges. By developing conceptual frameworks and technical tools for the design, construction, and characterization of novel biological systems that can perform autonomous functions and interact with natural biological systems, synthetic biology is poised to transform our ability to probe, understand, and manipulate biology.
CDS is supported by funds from the National Institutes of Health, National Science Foundation, and Defense Advanced Research Projects Agency. YYC is supported by the Harvard University Society of Fellows.