The exome factor
© BioMed Central Ltd 2011
Published: 14 September 2011
Skip to main content
© BioMed Central Ltd 2011
Published: 14 September 2011
Exome sequencing is rapidly expanding both as a technique and in its biological applications.
The decision to launch a special issue of Genome Biology coincided with my first American Society of Human Genetics meeting in Washington, DC ; exciting work using exome capture sequencing had been presented on melanoma, somatic mutations in induced pluripotent stem cells and numerous single gene disorders. The technique strongly enriches a sequencing sample for exons by using DNA capture probes targeted only to the portion of the genome constituting the exome. In addition, at this time I met and discussed exome sequencing with one of the leaders in the field, Jay Shendure, and we are delighted that he agreed to be the Guest Editor for this issue. There was no doubt for me that exome sequencing was flourishing, and so it is in this September issue that we present articles with that special 'exome factor'.
Choosing to publish a special issue on a field that is, relatively speaking, in its infancy necessitates many discussions about how best to make use of the technology and analyze the data, and these discussions have led us to present some guidelines and new approaches to exome capture sequencing. In a 'bake-off' between different exome capture techniques, Janna Saarela and colleagues  perform a systematic comparison of the two solution-based capture kits commercially available from Agilent and Roche-Nimblegen. The results show that researchers should be confident with the data garnered by either approach, and we hope that the clarity of this study will make it a useful reference, particularly to those just starting out. To provide further coverage and discussion of best methodological practice, we commissioned Shamil Sunyaev and colleagues  to review the computational and statistical approaches necessary for prioritization of variants from exome sequencing data. With typically 15,000 to 20,000 variants discovered per exome, whittling these variants down to those that are likely to be causing disease is a significant challenge. An abundance of tools has been developed for this task, and the Review therefore sets out to consider the relative merits of a number of the leading examples of these. Progressing to new methods, Katherine Smith et al.  present a novel method to further simplify variant prioritization - demonstrating that genetic linkage mapping can be applied to single nucleotide polymorphism (SNP) genotypes extracted from exome data, removing the need for array-based genotyping.
It is the lack of ascertainment bias in SNP genotyping from exome sequencing data that motivates Josh Akey and colleagues  to promote its use in the fields of human evolution, population structure and demography. In their Review, he discusses how the power to sequence to high depth the exomes of multiple individuals from several populations will provide us with insights into the mode and timing of population expansions. Indeed, in an article from Gabor Marth and the 1000 Genomes Project Exon Sequencing Pilot Subgroup , data are presented from the exomes of 700 individuals from seven populations. A project of this scale allows a unique insight into rare variants and, as might be expected but has never previously been shown, rare exomic variants show increased population specificity and are enriched for functional polymorphisms. For researchers who have previously been unable to find rare variants in their data, Tao Wang and colleagues  have developed a new technique that displays an excellent combination of sensitivity and specificity in identifying such variants.
But why should human population geneticists have all the fun? In an article from Eduard Akhunov and colleagues , 3.5 Mb of exome regions are captured from two species of wheat, a cultivated variety and a wild variety, and SNPs and copy number variations are compared, to garner information about the evolutionary history of these two varieties. The challenges that had to be overcome by these researchers are many; wheat species have undergone whole genome duplications leading to extensive polyploidy and the wheat genome assembly is significantly underdeveloped and poorly annotated. The polyploidy in particular proves challenging, as each probe could potentially hybridize to three homeologous sequences. Given the success of the authors' approach here, such pipelines may, in the future, be applied to similar problems in polyploid cancer genomes in addition to other plant species. Another non-human exome capture article comes from Eric Vallender , who has captured the exomes of two non-human primates on human exome arrays in a proof of principle of this approach. Furthermore, he shows the need for better annotation of non-human primate genomes.
Mouse geneticists, too, profit from exome capture techniques in this issue: Laura Reinholdt and colleagues  have developed and validated a set of mouse exome capture tools that will be of significant value to research in this beloved of model organisms, valuable for so much of our disease research. In another study in this issue, Karen Steel et al.  have applied commercially available mouse exome tools to a model for otitis media, finding a mutation in the Islet1 gene that is likely to predispose carriers to middle ear infections.
The understanding of the genetics and biology of Mendelian disease is surely the area in which exome sequencing has seen the largest growth in the past couple of years. In particular, genes associated with nonsyndromic mental retardation were pinpointed using this approach . Some of the authors of this work, including Christian Gilissen, have reviewed the progress in our understanding of Mendelian disease in this issue . Because of the ease of variant prioritization, highly penetrant recessive diseases are the most widely studied by exome sequencing. However, that is not to say that these are the diseases exclusively studied by these techniques. Heterotaxy is a cardiovascular disorder characterized by incomplete penetrance and heterogeneity and thought to be related to defective left-right patterning. By performing exome sequencing on a complex heterotaxy phenotype proband from a consanguineous family, Stephanie Ware and colleagues  have identified a recessive mutation in SHROOM3, a gene not previously associated with left-right patterning. Furthermore, while the identification of novel disease genes is exciting, there is other knowledge to be garnered from exome sequencing data, such as an improved understanding of modes of inheritance and of phenotypic spectra associated with known disease genes. An article from Tadafumi Kato and colleagues  identifies variants associated with autosomal progressive external ophthalmoplegia (PEO), an inherited mitochondrial disease characterized by slowly progressive paralysis of the extraocular muscles. The authors identify a novel mutation in RRM2B, a gene whose heterozygous rare variant has also been shown to cause an autosomal dominant form of PEO. Thus, mutations in a single gene are the root cause of both forms of the disease.
Finally, in an elegant use of targeted capture, Karen Avraham and colleagues  identify multiple genes unique to different populations that are responsible for heritable hearing loss. The authors use an approach in which capture probes are designed to hybridize to genes associated with hearing loss in mouse, in addition to those responsible for human deafness. Such capture probes were applied to families from several Israeli and Palestinian genetic populations, identifying causative alleles in both Arab and Jewish probands. Again, some help from the mouse has significantly progressed our understanding of human disease .
After reading these articles, you may find yourself wondering what the future holds for exome sequencing. In a question and answer piece  answered by Jim Mullikin, Les Biesecker and Kevin Shianna, Mullikin and Biesecker argue that exome capture will endure as a useful tool in the clinic, while Shianna instead believes that it has an inevitable expiry date owing to the rapidly declining cost of whole genome sequencing. In his editorial, Jay Shendure  discusses the 'high yield' of exome sequencing and how it is this benefit that may ensure its longevity. It seems to me that the greatest legacy of exome sequencing may in fact be the application of lessons learned in a future in which whole genome sequencing is king.
I am proud to present to you the collection of articles contained within this issue and I would like to thank all of those involved, notably the Guest Editor Jay Shendure, who has contributed invaluably to this exciting project, and Genome Biology's Editor Clare Garvey, in addition to the rest of the Genome Biology Editorial and Editorial Production teams. I hope you find the issue as timely and as interesting as we do.