High-throughput phenotyping of multicellular organisms: finding the link between genotype and phenotype
© BioMed Central Ltd 2011
Published: 28 March 2011
Skip to main content
© BioMed Central Ltd 2011
Published: 28 March 2011
High-throughput phenotyping approaches (phenomics) are being combined with genome-wide genetic screens to identify alterations in phenotype that result from gene inactivation. Here we highlight promising technologies for 'phenome-scale' analyses in multicellular organisms.
The availability of complete genomic sequences of many model organisms has made it possible to perform highly informative genome-wide functional analyses. For multicellular organisms (including the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster, the plants Arabidopsis thaliana and rice, as well as mouse), phenotypic analysis of genetic mutations is still one of the most effective ways to explore the function of a gene. Collections of strains with mutations in nearly every gene are now available, making it possible to analyze the phenotypes of a large number of independent strains. However, conventional analytic approaches, such as high-magnification microscopy at the single-cell level, require manual manipulation of samples and screening by eye, thus limiting throughput and presenting bottlenecks to large-scale genetic studies in multicellular organisms. Therefore, development of high-throughput methods, including automation in phenotyping and screening, is a strategy that is now coming to fruition . Systematic large-scale phenotyping efforts have begun to generate information on a previously unattainable scale. For example, it was recently shown that even a highly dynamic process such as the division of human cells can be studied on a genome-wide scale by live imaging .
Cultured cells have also proved amenable to high-throughput phenotyping . Although more challenging, the study of living organisms can provide insights into biological pathways, regulatory networks and/or cellular activity and behavior not obtainable from cultured cells [3–6]. Large-scale acquisition of phenotypic data can then predict important biological outputs, such as the roles of individual genes in development. Thus, high-throughput phenotyping approaches (that is, phenomics) can encompass a broad range of model systems and techniques aimed at understanding the link between genotype and phenotype.
A good example of the evolution of high-throughput phenotyping is provided by RNA interference (RNAi) screens in the worm C. elegans, where recent advances in robotic sample preparation have facilitated high-throughput screens. However, C. elegans is only one of many systems in which innovative technologies for high-throughput studies are being developed. Indeed, the development and use of robotic platforms has also enabled high-throughput phenotypic analysis of plant growth and development at a larger physical scale. Here, we use C. elegans and Arabidopsis as the primary examples of the exciting new wave of approaches to functional genomics [7–10]. We focus on current advances in high-throughput phenotyping (HTP) for the analysis of C. elegans and Arabidopsis, as lessons learned from these organisms can be broadly applied to other animal and plant species.
Reverse genetic screening has proved a powerful method to identify gene function [11, 12]. RNAi is a well-conserved phenomenon observed in many different organisms [13–23]. It was originally discovered in plants, and became one of the first genome-wide techniques used to study loss-of-function phenotypes in several model systems and in mammalian cell culture [24–27]. RNAi screens have become invaluable tools in assessing genotype-phenotype relationships [28, 29], and several large-scale RNAi libraries have been generated to identify essential genes and those with novel functions [16, 30–32]. For example, an RNAi library of 750 ovary-enriched genes was generated to study the function of genes involved in embryogenesis . RNAi genome-wide screens in Drosophila have been performed using cell culture [12, 15]. The genome-wide collection of transgenic constructs that has been prepared for in vivo screening has underpinned a number of studies, including a screen that led to the identification of the sex-peptide receptor of Drosophila [33, 34]. Large-scale mutagenesis and phenotyping projects are also under way in mammalian cells, and are likely to yield similarly important results [23, 35].
New computer-aided visualization methods, which automatically distinguish images of worms grown in agar plates, are also available . In addition, automated phenotyping based on machine-learning methods of images obtained from movie frames can also be used to study embryo development . These systems overcome previous bottlenecks in image analysis by scoring image data in a fully automated manner and providing rapid quantitative output that would not be obtainable at high-throughput by manual scoring. Because high-throughput phenotyping generates a large volume of data, which need to be standardized, normalized and analyzed, statistical and bioinformatics approaches are also becoming increasingly available.
The resolution at which biological samples can be analyzed has greatly increased in recent years as fluorescence microscopy strategies have been developed to characterize gene expression at the single-cell level in C. elegans [1, 44]. Methods to quantitatively measure gene-expression dynamics with cellular resolution are anticipated, and will be advantageous to functional genomic studies. However, the challenge of capturing high-resolution images that represent the entire sample remains formidable. Extensive high-throughput time-lapse fluorescent microscopy will only become a reality with improvements to the automation of microscopy imaging and the processing of large datasets.
The identification of genes that underlie phenotypic variation for complex agronomic traits such as biomass and drought tolerance will be key to biotechnology-aided crop improvement. Because such traits are often controlled by many genes that are also heavily influenced by the environment, the discovery of their genetic basis often requires large-scale phenotyping strategies. Mutational methods such as chemical or fast neutron mutagenesis can be used in forward genetic screens, whereas insertional mutagenesis via T-DNA lines or transposons is used to generate libraries of loss-of-function mutants for reverse genetic screens. Arabidopsis has led the way in plant phenotypic profiling because insertional mutations of most genes are available [45–51]. Rice, as a leading experimental model for monocotyledonous crops, also has a panel of insertional mutant lines . Insertional mutagenesis has also been applied to other crops, including maize and Medicago truncatula [53, 54]. However, advances in phenomics will be essential to fully realize the potential of these powerful genetic resources.
The investigation of complex traits such as root morphology, leaf size, plant height, flower shape or seed weight requires analyzing hundreds to thousands of plants, which poses a major challenge. Furthermore, gene response as a function of the environment must be accounted for. For this reason, tools specific for digital phenotyping together with automation of this process in controlled environments are necessary for high-throughput screening of plant phenotypes. Digital phenotyping offers the major advantage that data can be reanalyzed when new traits of interest or new types of measurements emerge. As the demand for digital image-acquisition technologies increases, several efforts have been made to generate software tools capable of producing objective and quantitative analyses of large image sets. Automated platforms have been developed for Arabidopsis and for crop plants to allow different aspects of automated visualization and image quantification. For example, the PHENOPSIS platform was used to dissect plant responses to soil water deficit in a collection of natural accessions of Arabidopsis . The PHENODYN platform imposes drought scenarios and has been used to image maize and rice plants . In addition, several efforts to improve aspects of automated visualization and image quantification for high-throughput phenotype scoring (for example, seed germination, hypocotyl growth, leaf-area development and root growth dynamics) have been made for Arabidopsis. Specifically, the high-throughput seed-germination analysis platform GERMINATOR was used to screen for natural variation in a population of 165 recombinant inbred lines, which revealed several quantitative trait loci (QTLs) for salt tolerance . High-resolution measurements of hypocotyl growth and shape have been obtained by automated quantification of time series of electronic images using HYPOTrace . Other examples of fully or partially automated imaging platforms for non-destructive image-based phenotyping are LeafAnalyser, LAMINA and GROWSCREEN 3 D [59–61]. These computer-based tools provide quantitative descriptors for leaf shape and size. A shortcoming of most of these tools is that they are designed to address very specific questions. Moreover, most traditional phenotype-scoring systems are based on endpoint analysis, and therefore do not easily capture the dynamic aspects of complex traits.
Recent approaches to capture these aspects have incorporated time-course data acquisition so that transient events and subtle temporal changes can be observed. However, the challenge of observing dynamic growth processes and responses to environmental stimuli, through the combination of automated time-lapse imaging with automated image analysis, remains . Many image-analysis-based software tools have focused on quantifying root growth rates and root structure. Advances in machine vision and computation of automatic trait evaluation have facilitated digital reconstruction of root systems and have potentially increased the levels of throughput for phenotyping in plants. Examples of software that allow higher-throughput phenotyping are RootTrace , KineRoot , SmartRoot , RootLM , Phytomorph [67, 68], RootFlow  and WinRhizo .
Phenotypic databases for crop plants
Oryza Tag Line (OTL)
Rice Mutant Database (RMD)
SCRI Barley Mutants
Tomato Mutant Database
Soybean Mutation Database
Mutational analysis remains the gold standard for identifying and characterizing gene function and this is being facilitated by high-throughput phenotyping. Given the demand for high-throughput phenotypic analysis in many organisms, we can expect the further development of large-scale phenotyping to unravel complex genotype-phenotype relationships. As an example, automated microscopy provides the opportunity to collect vast amounts of data that need to be standardized, normalized and analyzed. This increases the need for community access to store and search these large datasets. It would be of great benefit if large-scale phenotypic data could be easily compared and shared between labs. However, current limitations to the reuse and sharing of such data include the lack of standardized vocabulary terms, experimental parameters and quantitative benchmarks. Therefore, there is a pressing need for clearly defined standards and terms agreed upon by a given community. To achieve this goal, databases that contain phenotypic information and, especially, integration of phenomic and other genome-wide data are required. Multi-organism phenotype-genotype databases that facilitate cross-species identification of genes associated with orthologous phenotypes are now becoming available (for example, PhenomicDB) [83, 84]. In the next few years, the ability to harvest the full benefit of such large datasets can only be obtained by combining the genomic, epigenomic, transcriptomic, proteomic, metabolomic and phenomic data into shared databases. This resource will be invaluable for the investigation and eventual elucidation of molecular mechanisms regulating the biology of multicellular organisms, and will form a comprehensive description of the whole organism, opening new paths into systems biology.
We are grateful to Tim Horn and Chris Topp for providing the 'Root Imaging Platform' design. We thank Ryan Baugh and Brad Martsberger, the members of the Benfey lab and the reviewers for their helpful comments. Our work in this area is funded by the NIH as well as the NSF Arabidopsis 2010 program and NSF Plant Genome Research Program.