The Deep Genome Project

Lloyd, K. C. Kent; Adams, David J.; Baynam, Gareth; Beaudet, Arthur L.; Bosch, Fatima; Boycott, Kym M.; Braun, Robert E.; Caulfield, Mark; Cohn, Ronald; Dickinson, Mary E.; Dobbie, Michael S.; Flenniken, Ann M.; Flicek, Paul; Galande, Sanjeev; Gao, Xiang; Grobler, Anne; Heaney, Jason D.; Herault, Yann; de Angelis, Martin Hrabě; Lupski, James R.; Lyonnet, Stanislas; Mallon, Ann-Marie; Mammano, Fabio; MacRae, Calum A.; McInnes, Roderick; McKerlie, Colin; Meehan, Terrence F.; Murray, Stephen A.; Nutter, Lauryl M. J.; Obata, Yuichi; Parkinson, Helen; Pepper, Michael S.; Sedlacek, Radislav; Seong, Je Kyung; Shiroishi, Toshihiko; Smedley, Damian; Tocchini-Valentini, Glauco; Valle, David; Wang, Chi-Kuang Leo; Wells, Sara; White, Jacqueline; Wurst, Wolfgang; Xu, Ying; Brown, Steve D. M.

doi:10.1186/s13059-020-1931-9

Editorial
Open access
Published: 03 February 2020

The Deep Genome Project

K. C. Kent Lloyd¹,
David J. Adams²,
Gareth Baynam^3,4,5,
Arthur L. Beaudet⁶,
Fatima Bosch⁷,
Kym M. Boycott⁸,
Robert E. Braun⁹,
Mark Caulfield¹⁰,
Ronald Cohn¹¹,
Mary E. Dickinson^6,12,
Michael S. Dobbie¹³,
Ann M. Flenniken¹⁴,
Paul Flicek¹⁵,
Sanjeev Galande¹⁶,
Xiang Gao¹⁷,
Anne Grobler¹⁸,
Jason D. Heaney⁶,
Yann Herault¹⁹,
Martin Hrabě de Angelis^20,21,22,
James R. Lupski⁶,
Stanislas Lyonnet²³,
Ann-Marie Mallon²⁴,
Fabio Mammano²⁵,
Calum A. MacRae²⁶,
Roderick McInnes²⁷,
Colin McKerlie^11,28,
Terrence F. Meehan¹⁵,
Stephen A. Murray⁹,
Lauryl M. J. Nutter²⁸,
Yuichi Obata²⁹,
Helen Parkinson¹⁶,
Michael S. Pepper³⁰,
Radislav Sedlacek³¹,
Je Kyung Seong³²,
Toshihiko Shiroishi²⁹,
Damian Smedley³³,
Glauco Tocchini-Valentini²⁵,
David Valle³⁴,
Chi-Kuang Leo Wang³⁵,
Sara Wells²⁴,
Jacqueline White⁹,
Wolfgang Wurst^36,37,38,
Ying Xu³⁹ &
…
Steve D. M. Brown²⁴

Genome Biology volume 21, Article number: 18 (2020) Cite this article

7998 Accesses
27 Citations
36 Altmetric
Metrics details

Introduction

In vivo research is critical to the functional dissection of multi-organ systems and whole organism physiology, and the laboratory mouse remains a quintessential animal model for studying mammalian, especially human, pathobiology. Enabled by technological innovations in genome sequencing, mutagenesis and genome editing, phenotype analyses, and bioinformatics, in vivo analysis of gene function and dysfunction in the mouse has delivered new understanding of the mechanisms of disease and accelerated medical advances. However, many significant hurdles have limited the elucidation of mechanisms underlying both rare and complex, multifactorial diseases, leaving significant gaps in our scientific knowledge. Future progress in developing a functionally annotated genome map depends upon studies in model organisms, not least the mouse. Further, recent advances in genetic manipulation and in vivo, in vitro, and in silico phenotyping technologies in the mouse make annotation of the vast majority of functional elements within the mammalian genome feasible. The implementation of a Deep Genome Project—to deliver the functional biological annotation of all human orthologous genomic elements in mice—is an essential and executable strategy to transform our understanding of genetic and genomic variation in human health and disease that will catalyze delivery of the promised benefits of genomic medicine to children and adults around the world.

Rationale

A comprehensive understanding of genetics, at the single locus, gene, and genomic level, and the pathophysiological consequences of gene variation resulting in gene, RNA, or protein dysfunction are crucial to meeting societal expectations of precision medicine and critical to optimizing clinical practice. With over 80% conserved synteny and a high degree of gene orthology, the mouse and human genomes have provided a unique opportunity for comparative functional analysis and the use of genetically altered mice to interrogate the pathobiology of human disease [1]. For example, of the 6000–8000 rare genetic diseases cited by the rare disease community, the genetic basis is known for between 5000 and 6000 [2], many of which were revealed and/or confirmed by studying the causative genetic variants in mice. Nevertheless, when viewed from a genotype perspective, more than 75 to 80% of the computationally annotated ~ 20,000 genes in the human genome have not had variation in them tied to any specific phenotype [3].

CRISPR/Cas9 has enabled rapid and highly efficient targeted mutagenesis of the mouse genome. Concurrent development of in vivo analytical and imaging technologies has transformed high-throughput pipelines for precise and reproducible phenotyping of mouse mutants [4]. Deep phenotyping of virtually all body systems including cardiovascular, digestive, endocrine, immune, integumentary, lymphatic, muscular, neurological, sensory, reproductive, respiratory, skeletal, and urinary systems is possible. Further, online computational resources such as the MONARCH Initiative (www.monarchinitiative.org) that use controlled vocabularies to integrate numeric, text, and image biological information from heterogeneous datasets (e.g., MGI, OMIM, Orphanet) link genotype to phenotype and enable comparisons between mouse and human ontologies [5]. These and other advances have facilitated the coordination of industrial scale mutant mouse production and phenotyping at costs far less than previously imagined making functional annotation of all human orthologous genomic elements in mice an achievable scientific goal.

Realization of this goal is the foundation of the work of the International Mouse Phenotyping Consortium (IMPC). The IMPC is a coordinated program of 20 research laboratories in 12 countries on 5 continents dedicated to the design, production, and description of the function of human gene orthologs in the mouse genome (www.mousephenotype.org). The magnitude of this global effort reflects the spirit and scale of the Human Genome Sequencing Project. The IMPC uses homologous recombination in embryonic stem (ES) cells and CRISPR/Cas9 technology to create mutants for genes in the mouse genome followed by whole organism phenotyping of female and male cohorts of adult mice and embryos [6]. The focus thus far has been on the production and phenotyping of null protein-coding alleles in the mouse genome [7] recognizing that such resources serve as the fundamental baseline for mammalian gene function upon which the generation and study of allelic series of other mutations—hypomorphic, neomorphic, antimorphic, and hypermorphic—will prosper and will deliver further insights into gene-phenotype relationships.

Gene association with a broad diversity of human diseases, including hearing loss, ocular diseases, metabolic disorders, bone pathologies, developmental abnormalities, and others, differentiated by sex, has been revealed through IMPC-led discovery research and IMPC-fueled studies by the broader scientific community (https://www.mousephenotype.org/data/publications). This work continues on an industrial scale generating novel insights into gene-based disease phenotypes and other scientific domains such as conservation and ecology. This combined mouse production, phenotyping, and informatics approach has recently been applied to ~ 1/3 of known Mendelian disease genes and detected significant phenotypic similarities between human disease genes and mouse knockouts (i.e., null alleles) of the orthologs for approximately half the genes [8]. At least one clinical phenotype per disease was tested for the majority (95%) of the genes and matches detected across the whole range of body systems [9].

Currently, IMPC has generated null mutations for nearly 9000 genes of which over 6000 have been phenotyped. By July 2021, IMPC will complete comprehensive phenotypic annotation for over 9000 genes, representing about half of the ~ 18,000 human orthologs in the mouse (Fig. 1). In its 10-year strategic plan for 2021–2030, the IMPC calls for expanding mouse modeling studies to inform precise molecular diagnostics and targeted therapeutics for Mendelian and multifactorial disorders to maximize beneficial impacts on human health (https://www.mousephenotypetest.org/about-impc/).

If revealing the full biological role of every gene is not daunting enough, the pathobiological effects of individual human genetic and genomic variation further escalate the challenge. As exome sequencing (ES), clinical exome sequencing (cES), and whole genome sequencing become more commonly used in research and medical diagnostics to establish an etiologic molecular diagnosis, the number of variants of unknown clinical significance (i.e., VUS) is increasing exponentially and exceeding our current capabilities to interpret loss-of-function alleles [10]. Importantly, this growth has been driven not only by clinical caregivers, but also by the growing diagnostic and perceived personal utility of these advances by other stakeholders including patients and patient families. Genetically modified mice enable statistically powered, randomized, and blinded experiments using sex-balanced and age-matched cohorts of mutant mice alongside appropriate genetic controls with sufficient sensitivity and specificity to reliably assess gene function and dysfunction in relation to specific traits, development, genetic context, and/or other physical and environmental conditions. As a result, the scale and breadth of mouse genetics research is increasingly driving the use of mouse mutants and phenotyping data to inform human genomic diagnostic projects, including the US NIH Centers for Mendelian Genomics (www.mendelian.org), the Undiagnosed Diseases Network (https://undiagnosed.hms.harvard.edu/), Canada’s Care4Rare (http://care4rare.ca), The Gabriella Miller Kids First Pediatric Research Program (https://kidsfirstdrc.org/), the Genomics England Project (https://www.genomicsengland.co.uk/), and the “Fondation Maladies Rares” (https://fondation-maladiesrares.org/eng/). Animal model data, including mice, facilitate the interpretation of potential causal variants among variants of unknown significance in clinical sequencing.

Remaining gaps

Although substantial efforts to date have revealed a fuller understanding of the functional landscape of the entire mammalian genome, significant and important knowledge gaps remain that limit the ability to interpret the causal relationship of genes and genetic variations to human development and disease. For instance, the majority of published gene to phenotype studies continue to focus on genes that are well-annotated or for which knowledge of biological function and pathological consequences of mutations already exist [11]. As a result, much of the human genome remains unexplored and considered “dark” [12]. Strains of mouse mutants and associated phenotyping data are only available for approximately 60% of the mammalian genome, yet studies of human genes are significantly primed and enhanced by knowledge from model organisms [11]. Failure to fully illuminate the dark genome is a threat to realizing the full potential of the science of genomics, clinical genomics, the human genome project, and precision medicine. Overall, newly discovered disease genes significantly enhance the molecular diagnosis rates for clinical exome sequencing data by almost twofold [13]. Ambitious programs like the IMPC that intentionally focus on comprehensive phenotyping of genes with little to no functional annotation directly address the dark genome crisis and offer the potential for accelerated human health impact.

Only around 21% of the ~ 20,000 genes annotated on the reference human genome have variation in them tied to a human disease trait (Fig. 2). Further, although human and animal (mouse, rat, fly, fish, and worm) model phenotypes together have been linked to ~ 80% of human genes, with mouse model phenotypes associated with around 60% of human genes, the depth and extent of phenotypic coverage for each gene is generally limited. Critically, for both human and model organisms, our knowledge of pleiotropy and multi-morbidities is often incomplete, undermining our understanding of gene function and disease mechanisms. Moreover, our knowledge of phenotypic heterogeneity and its potentially underlying genetic bases (e.g., locus and allelic heterogeneity, multi-locus variation, modifier loci), as well as our understanding of multiple disease phenotypes converging on a single gene locus, age-dependent penetrance, and variable expressivity, are all limited. This lack of in vivo functional annotation in experimental models contributes to long diagnostic odysseys and is a significant impediment to the development of molecular entities targeted at specific gene products [14].

The significant progress that has been made in deciphering the genetic basis of rare monogenic diseases represents the low-hanging fruit of gene to phenotype relationships. Progress in elucidating complex multi-allelic and multi-locus relationships and the consequences of de novo mutations in complex disorders is also dependent on genome-wide functional descriptions and could be further explored by the development of multi-allelic [15] and multi-locus [16] models. Mouse models have already provided significant insight into complex diseases such as juvenile diabetes [17] and autism spectrum disorders [18]. Non-mouse models have and continue to contribute immensely to this effort, but definitive identification of causal relationships between mutant alleles and human diseases will often require a mammalian model. The molecular, cellular, and physiologic insights gained from mouse studies are critical to directly inform the early recognition of predictive biomarkers before clinical symptoms manifest and to drive the identification and validation of the new therapeutic targets essential for precision medicine.

Moving forward

We postulate that four steps undertaken by the collective endeavors of the global community will be needed to drive progress in genomic and precision medicine. These four steps, allied to genome-wide goals for in vitro systems and other model organisms, will deliver a deeper and more comprehensive understanding of individual gene function, make biological resources and data available to experimentally decipher disease mechanisms, interpret genomic variation, and reduce the diagnostic odysseys of patients with variants of unknown significance. The IMPC’s strategy 2021–2030 (https://www.mousephenotype.org/wp-content/uploads/2019/05/IMPC_Strategy_2021-30.pdf) is also formulated around these four steps.

1)
Complete functional annotation of the protein-coding genome. Complete loss-of-function mutations are essential for identifying the phenotypic impact of protein-coding genes and a necessary first step to interpreting clinically relevant human genetic variation causing disease. By 2021, at the conclusion of the IMPC’s current mandate, ~ 9000 human orthologous genes in mice will remain to be analyzed by the consortium. Stopping at this point, halfway through the genome, would be equivalent to the Human Genome Project halting its sequencing effort after assembling euchromatic sequence of just 11 chromosomes. It will be vital to continue efforts toward the completion of the functional analysis of the remaining unannotated protein-coding genes using mouse models.
2)
Establish functional evaluation of the noncoding genome. The entire coding region is only 3–5% of the mammalian genome. The remaining 95% of the genome plays many roles across a variety of biological processes, including DNA replication, transcriptional regulation, and genomic structure. Of particular note, variation in enhancers, silencers, promoters, and insulators can have significant impact on both normal and abnormal gene expression [19] and gene dosage phenomena [20]. Strategies must be implemented for the prioritization and modeling of mutations of conserved noncoding elements in order to fully explore the in vivo function of the darkest part of the genome.
3)
Translate functional biological knowledge to clinical knowledge. The emerging field of genomic and precision medicine relies on the ability to interpret the potential pathophysiological consequences of genetic variation in patients. Beyond academic and research considerations, the long-term financial investments in translating human genetic variation to functional phenotypes and disease mechanisms is enormous and growing. Global investments from government agencies and corporate investors are predicted to nearly triple from US$79billion to over US$200billion in 10 years (https://bisresearch.com/industry-report/precision-medicine-market.html). For example, programs from the US All of Us Project, UK Biobank, the UK 100K Genome Project, the Chinese Precision Medicine Initiative, and many others are enrolling volunteers in efforts to gather personal history, clinical information, genome sequences, and environmental metadata to understand the role of genes in health and disease in order to identify new targets for molecular therapies. For these investments to deliver on their promise to improve health outcomes from molecular diagnosis to management and therapeutic intervention, it will be necessary to integrate gene function data generated from the study of mouse and other animal models into clinical databases, such as ClinVar, ClinGen, and others. Continued development of model organism databases, along with improvements in data integration and analysis, is needed to enable mechanistic insight into genetic variation and disease and support future developments in genomic and precision medicine.
4)
Enable rapid functional assessment of genomic variation and integrate functional testing into the clinical decision-making process. While statistical inference of human patient data is currently used to discriminate disease causing from benign associated variants, a definitive molecular diagnosis is often not attainable. Even in those cases where this approach is sufficient, the delay in diagnosis is costly—psychologically, socially, and economically. It will be necessary to undertake programs for the rapid creation and analysis of mouse models of human coding variants along with more efficient approaches to phenotyping. These models, along with other mutational variants, will inform diagnostic decisions and targeted treatments. With these programs in place, clinicians and their research colleagues could rely on mouse models as diagnostic and therapeutic testing platforms, examine the pathological significance of a genetic variant in an orthologous mammalian system, interrogate gene/phenotype relationships for different types of alleles (e.g., SNV, etc.), explore potential gene/environment effects, and access comprehensive datasets to help guide clinical decision-making. In turn, mouse genetic experts will need to respond quickly with targeted phenotyping of mouse models in order for them to achieve clinical utility. It will be necessary to optimize funding and bring scientific insights from mouse functional data to inform the application of mouse models as human patient avatars, make the knowledge gained from mouse data available via the electronic medical record, and enhance education and training of clinicians in genetics and genomic medicine.

Conclusion

Despite the incredible scientific advances in genetics at the single gene and genomic level, the collective biomedical community has only begun to scratch the surface of knowledge about the diverse and varied in vivo pathobiological roles of functional elements throughout the human genome. The global mouse genetics community is primed to address the grand challenges that we face to fully comprehend the role of genes and genetics in development, biological homeostasis, systems biology, disease, and medicine, and is ready to launch a new era for the systematic study of the function of the mammalian genome. The time is right to embark on a Deep Genome Project, on the scale of the Human Genome Project, that will fundamentally enhance the knowledgebase across the biomedical sciences. Driven by the enormous potential of mouse genetics and allied to developments in other model organisms and in vitro approaches, this project will be transformative for biology, medicine, and global health.

References

Waterston RH, et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–62.
Article CAS PubMed Google Scholar
Hartley T, et al. The unsolved rare genetic disease atlas? An analysis of the unexplained phenotypic descriptions in OMIM. Am J Med Genet. 2018;178C:458–62.
Article Google Scholar
Posey JE, et al. Insights into genetics, human biology and disease gleaned from family based genomic studies. Genetics Med. 2019;21:798–812.
Article Google Scholar
Brommage R, Powell DR, Vogel P. Predicting human disease mutations and identifying drug targets from mouse gene knockout phenotyping campaigns. Dis Model Mech. 2019;12:dmm038224. https://doi.org/10.1242/dmm.038224.
Article CAS Google Scholar
Mungall CJ, et al. The Monarch Initiative: an integrative data and analytic platform connecting phenotypes to genotypes across species. Nucleic Acids Res. 2017;45:D712–22.
Article CAS PubMed Google Scholar
Brown SD, Moore MW. The International Mouse Phenotyping Consortium: past and future perspectives on mouse phenotyping. Mamm Genome. 2012;23:632–40.
Article CAS PubMed PubMed Central Google Scholar
Breschi A, Gingeras TR, Guigo R. Comparative transcriptomics in human and mouse. Nat Rev Genet. 2017;18:425–40.
Article CAS PubMed PubMed Central Google Scholar
Cacheiro P, et al. New models for human disease from the International Mouse Phenotyping Consortium. Mamm Genome. 2019;30:143–50.
Article CAS PubMed PubMed Central Google Scholar
Meehan TF, et al. Disease model discovery from 3,328 gene knockouts by The International Mouse Phenotyping Consortium. Nat Genet. 2017;49:1231–8.
Article CAS PubMed PubMed Central Google Scholar
Hoffman-Andrews L. The known unknown: the challenges of genetic variants of uncertain significance in clinical practice. J Law Biosci. 2018;4:648–57.
Article PubMed PubMed Central Google Scholar
Stoeger T, et al. Large-scale investigation of the reasons why potentially important genes are ignored. PLoS Biol. 2018;16:e2006643.
Article PubMed PubMed Central Google Scholar
Oprea TI, et al. Unexplored therapeutic opportunities in the human genome. Nat Rev Drug Discov. 2018;17:377.
Article CAS PubMed Google Scholar
Liu P, et al. Reanalysis of clinical exome sequencing data. N Engl J Med. 2019;380:25.
Article Google Scholar
Waring MJ, et al. An analysis of the attrition of drug candidates from four major pharmaceutical companies. Nat Rev Drug Discov. 2015;14:475–86.
Article CAS PubMed Google Scholar
Yang N, et al. TBX6 compound inheritance leads to congenital vertebral malformations in humans and mice. Hum Mol Genet. 2019;28:539–47.
Article CAS PubMed Google Scholar
Posey JE, et al. Resolution of disease phenotypes resulting from multilocus genomic variation. N Engl J Med. 2017;376:21–31.
Article CAS PubMed Google Scholar
Paun A, Yau C, Danska JS. The influence of the microbiome on type 1 diabetes. J Immunol. 2017;198:590–5.
Article CAS PubMed Google Scholar
Stoodley CJ, et al. Altered cerebellar connectivity in autism and cerebellar-mediated rescue of autism-related behaviors in mice. Nat Neurosci. 2017;20:1744–51.
Article CAS PubMed PubMed Central Google Scholar
Yue F, et al. A comparative encyclopedia of DNA elements in the mouse genome. Nature. 2014;515:355–64.
Article CAS PubMed PubMed Central Google Scholar
Wu N, et al. Tbx6 null variants and a common hypomorphic allele in congenital scoliosis. N Engl J Med. 2015;372:341–50.
Article CAS PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Department of Surgery, School of Medicine, and Mouse Biology Program, University of California, Davis, CA, 95618, USA
K. C. Kent Lloyd
Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
David J. Adams
Western Australian Register of Developmental Anomalies and Genetic Services of Western Australia, Department of Health, Government of Western Australia, Perth, Australia
Gareth Baynam
Division of Paediatrics and Telethon Kids Institute, Faculty of Health and Medical Sciences, University of Western Australia, Perth, Australia
Gareth Baynam
Faculty of Science and Engineering, School of Spatial Sciences, Curtin University, Perth, Australia
Gareth Baynam
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
Arthur L. Beaudet, Mary E. Dickinson, Jason D. Heaney & James R. Lupski
Center of Animal Biotechnology and Gene Therapy, Universitat Autònoma Barcelona, Barcelona, Spain
Fatima Bosch
Children’s Hospital of Eastern Ontario Research Institute, University of Ottawa, Ottawa, ON, K1H 8L1, Canada
Kym M. Boycott
The Jackson Laboratory, Bar Harbor, ME, 04609, USA
Robert E. Braun, Stephen A. Murray & Jacqueline White
Genomics England, William Harvey Research Institute, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ, UK
Mark Caulfield
The Hospital for Sick Children, Toronto, ON, M5G 1X8, Canada
Ronald Cohn & Colin McKerlie
Departments of Molecular Physiology and Biophysics, Baylor College of Medicine, Houston, TX, 77030, USA
Mary E. Dickinson
Phenomics Australia, The Australian National University, 131 Garran Road, Acton, ACT, 2601, Australia
Michael S. Dobbie
The Centre for Phenogenomics, Lunenfeld-Tanenbaum Research Institute, Toronto, ON, M5T 3H7, Canada
Ann M. Flenniken
European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
Paul Flicek & Terrence F. Meehan
National Facility for Gene Function in Health and Disease, Department of Biology, Indian Institute of Science, Education and Research (IISER) Pune, Pune, Maharashtra, 411008, India
Sanjeev Galande & Helen Parkinson
SKL of Pharmaceutical Biotechnology and Model Animal Research Center, Collaborative Innovation Center for Genetics and Development, Nanjing Biomedical Research Institute, Nanjing University, Nanjing, 210061, China
Xiang Gao
DST/NWU Preclinical Drug Development Platform, North-West University, Potchefstroom, 2520, South Africa
Anne Grobler
Université de Strasbourg, CNRS, INSERM, Institut de Génétique, Biologie Moléculaire et Cellulaire, Institut Clinique de la Souris, IGBMC, PHENOMIN-ICS, 67404, Illkirch, France
Yann Herault
German Mouse Clinic, Institute of Experimental Genetics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764, Neuherberg, Germany
Martin Hrabě de Angelis
Chair of Experimental Genetics, Center of Life and Food Sciences Weihenstephan, Technische Universität München, 85354, Freising-Weihenstephan, Germany
Martin Hrabě de Angelis
German Center for Diabetes Research (DZD), 85764, Neuherberg, Germany
Martin Hrabě de Angelis
Institut Imagine, UMR-1163 INSERM et Université de Paris, Hôpital Universitaire Necker-Enfants Malades, 24, Boulevard du Montparnasse, 75015, Paris, France
Stanislas Lyonnet
Medical Research Council Harwell Institute (Mammalian Genetics Unit and Mary Lyon Centre), Harwell, Oxfordshire, OX11 0RD, UK
Ann-Marie Mallon, Sara Wells & Steve D. M. Brown
Monterotondo Mouse Clinic, Italian National Research Council (CNR), Institute of Biochemistry and Cell Biology (IBBC), Monterotondo Scalo, I-00015, Rome, Italy
Fabio Mammano & Glauco Tocchini-Valentini
Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, USA
Calum A. MacRae
Lady Davis Research Institute, Jewish General Hospital, McGill University, 3999 Côte Ste- Catherine Road, Montreal, Quebec, H3T 1E2, Canada
Roderick McInnes
The Centre for Phenogenomics, The Hospital for Sick Children, Toronto, ON, M5T 3H7, Canada
Colin McKerlie & Lauryl M. J. Nutter
RIKEN BioResource Research Center, Tsukuba, Ibaraki, 305-0074, Japan
Yuichi Obata & Toshihiko Shiroishi
Institute for Cellular and Molecular Medicine, Department Immunology, and SAMRC Extramural Unit for Stem Cell Research and Therapy, Faculty of Health Sciences, University of Pretoria, Pretoria, South Africa
Michael S. Pepper
Czech Centre for Phenogenomics, Institute of Molecular Genetics of the Czech Academy of Sciences, 252 50, Vestec, Czech Republic
Radislav Sedlacek
Korea Mouse Phenotyping Consortium (KMPC) and BK21 Program for Veterinary Science, Research Institute for Veterinary Science, College of Veterinary Medicine, Seoul National University, 599 Gwanangno, Gwanak-gu, Seoul, 08826, South Korea
Je Kyung Seong
Clinical Pharmacology, William Harvey Research Institute, School of Medicine and Dentistry, Queen Mary University of London, London, EC1M 6BQ, UK
Damian Smedley
McKusick-Nathans Department of Genetic Medicine, The Johns Hopkins University School of Medicine, 519 BRB, 733 N Broadway, Baltimore, MD, 21205, USA
David Valle
National Laboratory Animal Center, National Applied Research Laboratories, Taipei, Taiwan
Chi-Kuang Leo Wang
Institute of Developmental Genetics, Helmholtz Zentrum München, German Research Center for Environmental Health GmbH, 85764, Neuherberg, Germany
Wolfgang Wurst
Chair of Developmental Genetics, Center of Life and Food Sciences Weihenstephan, Technische Universität München, 85354, Freising-Weihenstephan, Germany
Wolfgang Wurst
Deutsches Zentrum für Neurodegenerative Erkrankungen (DZNE), Munich Cluster for Systems Neurology (SyNergy), Adolf-Butenandt-Institut, Ludwig Maximillian’s Universitat Munchen, 81377, Munich, Germany
Wolfgang Wurst
Cambridge-Suda Genomic Resource Center, Jiangsu Key Laboratory of Neuropsychiatric Diseases, Medical College of Soochow University, Suzhou, 215123, Jiangsu, China
Ying Xu

Authors

K. C. Kent Lloyd
View author publications
You can also search for this author in PubMed Google Scholar
David J. Adams
View author publications
You can also search for this author in PubMed Google Scholar
Gareth Baynam
View author publications
You can also search for this author in PubMed Google Scholar
Arthur L. Beaudet
View author publications
You can also search for this author in PubMed Google Scholar
Fatima Bosch
View author publications
You can also search for this author in PubMed Google Scholar
Kym M. Boycott
View author publications
You can also search for this author in PubMed Google Scholar
Robert E. Braun
View author publications
You can also search for this author in PubMed Google Scholar
Mark Caulfield
View author publications
You can also search for this author in PubMed Google Scholar
Ronald Cohn
View author publications
You can also search for this author in PubMed Google Scholar
Mary E. Dickinson
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Dobbie
View author publications
You can also search for this author in PubMed Google Scholar
Ann M. Flenniken
View author publications
You can also search for this author in PubMed Google Scholar
Paul Flicek
View author publications
You can also search for this author in PubMed Google Scholar
Sanjeev Galande
View author publications
You can also search for this author in PubMed Google Scholar
Xiang Gao
View author publications
You can also search for this author in PubMed Google Scholar
Anne Grobler
View author publications
You can also search for this author in PubMed Google Scholar
Jason D. Heaney
View author publications
You can also search for this author in PubMed Google Scholar
Yann Herault
View author publications
You can also search for this author in PubMed Google Scholar
Martin Hrabě de Angelis
View author publications
You can also search for this author in PubMed Google Scholar
James R. Lupski
View author publications
You can also search for this author in PubMed Google Scholar
Stanislas Lyonnet
View author publications
You can also search for this author in PubMed Google Scholar
Ann-Marie Mallon
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Mammano
View author publications
You can also search for this author in PubMed Google Scholar
Calum A. MacRae
View author publications
You can also search for this author in PubMed Google Scholar
Roderick McInnes
View author publications
You can also search for this author in PubMed Google Scholar
Colin McKerlie
View author publications
You can also search for this author in PubMed Google Scholar
Terrence F. Meehan
View author publications
You can also search for this author in PubMed Google Scholar
Stephen A. Murray
View author publications
You can also search for this author in PubMed Google Scholar
Lauryl M. J. Nutter
View author publications
You can also search for this author in PubMed Google Scholar
Yuichi Obata
View author publications
You can also search for this author in PubMed Google Scholar
Helen Parkinson
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Pepper
View author publications
You can also search for this author in PubMed Google Scholar
Radislav Sedlacek
View author publications
You can also search for this author in PubMed Google Scholar
Je Kyung Seong
View author publications
You can also search for this author in PubMed Google Scholar
Toshihiko Shiroishi
View author publications
You can also search for this author in PubMed Google Scholar
Damian Smedley
View author publications
You can also search for this author in PubMed Google Scholar
Glauco Tocchini-Valentini
View author publications
You can also search for this author in PubMed Google Scholar
David Valle
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Kuang Leo Wang
View author publications
You can also search for this author in PubMed Google Scholar
Sara Wells
View author publications
You can also search for this author in PubMed Google Scholar
Jacqueline White
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Wurst
View author publications
You can also search for this author in PubMed Google Scholar
Ying Xu
View author publications
You can also search for this author in PubMed Google Scholar
Steve D. M. Brown
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

KCKL and SDMB co-drafted the editorial; KCKL, SDMB, DJA, GB, ALB, FB, KMB, REB, MC, RC, MED, MSD, AMF, PF, SG, XG, AG, JDH, YH, MHA, JRL, SL, AMM, FM, CAM, RM, CM, TFM, SAM, LMJN, YO, HP, MSP, RS, JKS, TS, DS, GTV, DV, CKLW, SW, JW, WW, and YX read, re-drafted, and approved the final version.

Corresponding authors

Correspondence to K. C. Kent Lloyd or Steve D. M. Brown.

Ethics declarations

Competing interests

PF is a member of the Scientific Advisory Boards of Fabric Genomics, Inc., and Eagle Genomics, Ltd. The other authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Lloyd, K.C.K., Adams, D.J., Baynam, G. et al. The Deep Genome Project. Genome Biol 21, 18 (2020). https://doi.org/10.1186/s13059-020-1931-9

Download citation

Received: 08 January 2020
Accepted: 08 January 2020
Published: 03 February 2020
DOI: https://doi.org/10.1186/s13059-020-1931-9

The Deep Genome Project

Introduction

Rationale

Remaining gaps

Moving forward

Conclusion

References

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Genome Biology

Contact us