Impressive expressions: developing a systematic database of gene-expression patterns in Drosophila embryogenesis
© BioMed Central Ltd 2003
Published: 28 January 2003
Skip to main content
© BioMed Central Ltd 2003
Published: 28 January 2003
The establishment of a database of gene-expression patterns derived from systematic high-throughput in situ hybridization studies on whole-mount Drosophila embryos, together with new information on the reannotated Drosophila genome and several recent microarray-based genomic analyses of Drosophila development, vastly increase the breadth and depth that can be reached by developmental genetics.
These are exciting days for developmental genetics; the rapid advance of functional genomic analyses of key model systems is creating possibilities that were only scientific fantasies a few years ago. One of these fantasies was to know not only all of the genes in your favorite model organism, but also the expression patterns, in time and space, of all the genes during development. Imagine, for example, what you could do if you were a developmental biologist interested in the formation of midline structures who had access to a database that revealed the identity of all of the fly genes expressed at the midline during embryogenesis. Instead of spending time and money establishing a subtractive library or doing differential display to look for midline-specific genes, you could simply go to the database and query it for all the genes that are expressed at the midline. Then, knowing all the potential genes of interest, you could proceed directly to the functional analysis of these genes and the genetic networks in which they are involved. A recent report in Genome Biology by Tomancak et al. , on the systematic determination of patterns of gene expression during Drosophila embryogenesis, shows that this fantasy is rapidly becoming a reality. The systematic establishment of a gene-expression database, together with a flurry of new information on the annotated Drosophila genome [2–10] and several other recent microarray-based, functional genomic analyses of Drosophila development [11–14], allow us to study development at a new, more detailed level of resolution.
Driven by advances in DNA-sequencing technology, early genomic projects consisted principally of the large-scale sequencing of whole genomes. Currently, approximately 100 genomes have been completely sequenced, including around 90 prokaryotic and 8 eukaryotic genomes, and the complete genome sequences of an increasing number of model organisms are now becoming available . A first annotated version of the Drosophila melanogaster genome was released in March 2000 [16–18], and this was the first metazoan genome to be successfully sequenced by the whole-genome shotgun method [19,20]. Since then, the Drosophila genome has been reannotated twice, and the most recent of these reannotations, called Release 3, has now been finished and is available online in FlyBase [3,21]. Established by human curators with the help of sophisticated new software and significantly increased amounts of experimentally derived data from cDNAs and expressed sequence tags (ESTs), Release 3 provides a euchromatic sequence that is virtually free of gaps and is highly accurate [3,4,6–8]. The number of genes has not changed much compared with the previous annotation, Release 2, but Release 3 contains more exons and more transcripts and, importantly, has changes in over 40% of the predicted protein sequences . It is believed that this new release of the Drosophila genome sequence is now a reliable resource for molecular and genetic experiments as well as for computational analysis.
With the rapid progress of genome sequencing projects, microarrays have become powerful and popular tools with which to investigate biological questions at a genome-wide level. The adoption of microarray technology for the study of the development of Drosophila was initially rather slow, but its use has accelerated markedly, especially in the past year . One of the first microarray-based analyses of Drosophila development focused on the process of metamorphosis, using microarrays containing cDNAs corresponding to several thousand gene sequences; this was carried out before sequence information on the entire genome became available . Similar microarrays were combined with automated embryo sorting by Furlong and colleagues  to identify the targets of the transcription factor twist, which plays a key role in mesoderm development. More recently, a systematic study of gene expression throughout Drosophila development using microarrays has been carried out, in which approximately one third of all genes were surveyed at different stages of development - embryos, larvae, pupae and adults . In two further recent investigations [13,14], whole-genome oligonucleotide microarrays representing the entire protein-coding capacity of the Drosophila genome (over 13,500 genes) have been used to study specific aspects of embryogenesis in the fly. Stathopoulos and colleagues  focused on dorsal-ventral patterning in the Drosophila embryo and used whole-genome microarrays to identify targets of the transcription factor dorsal; their work identified over 40 novel dorsal target genes as well as several new tissue-specific enhancers of dorsal targets. We and our colleagues  have studied gliogenesis in Drosophila embryos by using whole-genome microarrays to identify downstream targets of the glial cells missing gene, which controls the determination of glial versus neuronal cell fate.
Although these microarray experiments have each provided a quantitative overview of changes in gene-expression levels over developmental time or between different experimental conditions [11–14], they still suffer from several limitations that have been discussed in similar studies of other organisms. Transcripts of low abundance, which are often involved in regulatory processes and thus may be of great interest for understanding development, are typically under-represented in RNA probe pools and are therefore hard to detect in microarray experiments . Moreover, in multicellular organisms, cell division and differentiation leads to an increase in tissue complexity throughout development, but whole-animal microarray analysis cannot document this spatial information. One can try to isolate mRNA from every tissue at each developmental stage and then define gene-expression information in different tissues at different times, but this is a formidable task and requires the establishment of reliable methods for tissue-specific mRNA isolation and probe preparation. Furthermore, false-positive results can arise from technical problems such as cross-hybridization of target-probe pairs or incorrect annotation of genome sequences leading to false gene-model predictions [24,25]. For all these reasons, validation of microarray data with histological methods such as RNA in situ hybridization is both important and necessary. Indeed, all of the recent whole-genome microarray studies of Drosophila development incorporate selected in situ hybridization experiments to confirm and localize expression for a subset of the studied genes [11–14,26]. Given the massive quantitative expression dataset that is coming out of whole-genome microarray experiments, it now becomes important to have access to equally massive amounts of in situ hybridization data. Ideally, one would like to have access to the expression patterns of all genes in the genome in all major embryonic tissues at all embryonic stages. This is the goal of the online in situ gene-expression atlas that Tomancak and colleagues are assembling .
Currently, over 2,000 genes, or about one sixth of all Drosophila genes, have been examined by in situ hybridization in embryos, and over 25,000 digital photographs of gene-expression patterns have been taken, annotated and stored in the database . On the basis of current production rates, the authors estimate that a first pass through the existing cDNA collections, which represent about 70% of the Drosophila genes, should be finished within a year; probes for genes that lack a suitable cDNA clone but that show significant expression by microarray analysis will be generated by genomic PCR so that expression patterns for these genes can also be determined . This will represent a major step towards the overall goal of determining the expression patterns of all genes in the fly genome and creating an integrated public resource of image-oriented gene-expression data analogous to the repositories of DNA sequences. The project will not stop there, however, but will continue to be refined as more accurate information on gene sequences, coding regions, and cDNAs becomes available. Release 3 of the fly genome has already presented marked improvements in all of these areas, and regular updates of the fly genome and the in situ expression database are planned [1–10].
Systematic high-throughput in situ hybridization of whole-mount embryos as used by Tomancak and colleagues  provides a powerful method for the global survey of gene expression in embryos . Combined with data obtained by microarray analysis, this method makes it possible to investigate gene-expression profiles in both a quantitative and a qualitative manner. Analysis of this kind of gene-expression dataset will provide a rich source of developmental-genetic information and should also make it possible to identify genes involved in developmental processes that have been missed by traditional, mutagenesis-based genetic analysis. According to published estimates for flies and other animals, less than one-third of genes lead to obvious phenotypes when mutated [32–34], so a lot remains to be discovered. The exciting days of developmental genetics have only just begun.