Open Access

An encyclopaedia of mouse genes

  • Kenneth Lee
Genome Biology20012:spotlight-20010208-02

DOI: 10.1186/gb-spotlight-20010208-02

Published: 08 February 2001

An international consortium of scientists aiming to sequence every transcript encoded by the mouse genome has analysed 21,076 so far.

Estimates of the number of genes in the mammalian genome range from 30,000 to 200,000. The problem is one of identifying which of the sequences in the billions of base pairs that make up the genome actually code for protein.

Instead of sequencing all 109 bp in the mouse genome, an international consortium of scientists has been sequencing a large bank of cDNAs prepared from various mouse tissues and developmental stages. The scientists, co-ordinated by Yoshihide Hayashizaki of the RIKEN Genomic Sciences Centre in Japan, report the characterization of the first 21,076 of these cDNA clones in the 8 February Nature (Nature 2001, 409:685-690).

The consortium found, for example, more than 100 new genes that represent metabolic enzymes. Ten novel orthologues of genes implicated in human disease were also identified. Many of the cDNAs represented members of large multigene families associated with cellular differentiation and signal transduction, but few immune-related transcripts were identified. To enrich for these transcripts, Hayashizaki and his colleagues plan to prepare libraries from stimulated immune cells. They hope eventually to identify and sequence every transcript encoded by the mouse genome.


  1. RIKEN Genomic Sciences Centre, []
  2. The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium: Functional analysis of a full-length mouse cDNA collection. Nature 2001, 409:685-690., []


© BioMed Central Ltd 2001