Rapid identification of PAX2/5/8 direct downstream targets in the otic vesicle by combinatorial use of bioinformatics tools
© Ramialison et al.; licensee BioMed Central Ltd. 2008
Received: 11 September 2008
Accepted: 1 October 2008
Published: 01 October 2008
The pax2/5/8 genes belonging to the PAX family of transcription factors are key developmental regulators that are involved in the patterning of various embryonic tissues. More particularly, their function in inner ear specification has been widely described. However, little is known about the direct downstream targets and, so far, no global approaches have been performed to identify these target genes in this particular tissue.
Here we present an original bioinformatics pipeline composed of comparative genomics, database querying and text mining tools, which is designed to rapidly and specifically discover PAX2/5/8 direct downstream targets involved in inner ear development. We provide evidence supported by experimental validation in medaka fish that brain 2 (POU domain, class 3, transcription factor 2), claudin-7, secretory pathway component sec31-like and meteorin-like precursor are novel direct downstream targets of PAX2/5/8.
This study illustrates the power of extensive mining of public data repositories using bioinformatics methods to provide answers for a specific biological question. It furthermore demonstrates how the usage of such a combinatorial approach is advantageous for the biologist in terms of experimentation time and costs.
The pax genes encode a family of transcription factors that have been conserved through evolution and play different roles in early development. This family is defined by the presence of a highly conserved motif of 128 amino acids, the paired-domain, which does not have any obvious sequence homology with other known protein domains. Nine members of the pax gene family have been isolated in vertebrates, which are grouped into four distinct subfamilies, based on sequence similarity and structural domains [1–3]. The subfamily consisting of PAX2, PAX5 and PAX8 (PAX2/5/8) encodes transcription regulators that bind DNA via the amino-terminal paired-domain, whereas the carboxy-terminal region is required for trans-activation or repression of target genes. Detailed DNA binding studies led to the definition of a consensus recognition sequence that is bound by all members of this subfamily [4, 5]. The pax2/5/8 genes are expressed in a spatially and temporally overlapping manner in the brain, eye, kidney and inner ear in several model organisms [6–9]. Particularly, the members of this subfamily are the earliest known genes that are involved in inner ear development. In teleosts, pax8 is expressed in preotic cells by the early somitogenesis stages, followed by pax2 expression in the otic placode and vesicle, whereas pax5 is restricted to the utricular macula [10–12]. Although the roles of pax2/5/8 genes during ear development are partly illustrated by loss-of-function, mutant analysis and gain-of-function in fish [11, 13–17], little is known about the direct downstream target genes of this PAX subfamily. More particularly, although gene expression profiling comparing wild type and PAX2 mutants has already been performed in mouse embryos , this analysis was restricted to the identification of PAX2 targets in the midbrain-hindbrain boundary. A systematic discovery of specific PAX2/5/8 direct targets in the otic vesicle has not yet been performed.
We therefore aimed to identify PAX2/5/8 direct downstream targets, especially those involved in inner ear development. For this purpose, we opted for a novel approach that takes advantage of the vast amount of biological resources generated by large-scale experiments and available to the scientific community through public databases. Indeed, on one hand, numerous high-throughput gene expression pattern screens (for example, in vertebrates [19–23]) combined with massive whole-genome sequencing (for example, pioneer efforts with mammals [24–26]) have generated an invaluable resource of information concerning any given gene. On the other hand, a myriad of bioinformatics tools from functional to comparative genomics [27, 28] have emerged to extract and mine this information in a systematic way. Therefore, we took advantage of these bioinformatics tools to develop a strategy that combines a comparative genomics algorithm, gene expression pattern databases queries and text mining.
Firstly, we ran an improved version of the previously described evolutionary double filtering algorithm (EDF)  on PAX2/5/8 position weight matrices (PWMs) [4, 5] to predict PAX2/5/8 downstream targets in silico. This algorithm has been successfully applied for the discovery of ATH5 target genes  and its power lies in the requirement of a unique single input, the PWM representing the binding site of the transcription factor of interest.
Secondly, from this primary list of in silico predicted PAX2/5/8 target genes, we extracted the subset of candidate genes that would be specifically involved in otic vesicle development by selecting the genes that were either known to be expressed in the otic vesicle or cited in the context of otic vesicle development. Queries against mouse and zebrafish expression pattern databases and text mining of MEDLINE abstracts were respectively applied to perform this selection.
Thirdly, to validate the putative PAX2/5/8 downstream targets in the otic vesicle predicted by this combination of in silico analysis, we carried out in vitro electrophoretic mobility shift assays and in vivo misexpression experiments in medaka to provide experimental evidence that four predicted candidates genes (brn2 (pou3f2), claudin-7, sec31-like, ccdc102a and meteorin-like precursor) are new PAX2/5/8 direct target genes for otic development.
Results and discussion
Evolutionary double filtering
The primary list of PAX2/5/8 putative downstream targets was obtained in silico, by applying an improved version of the EDF algorithm as described in Del Bene et al.  (see also Materials and methods). This algorithm requires as input the PWM representing the binding site of the transcription factor of interest. It delivers as output a list of human genes that contain the transcription factor binding site in their promoter region, in a position that is evolutionarily conserved at least within the mammalian orthologues and, in some cases, up to other distant vertebrate orthologues (namely fish). Using binding site conservation as a filter is particularly suitable for the discovery of PAX2/5/8 downstream targets since the role of this transcription factor family in the development of the otic vesicle is conserved throughout vertebrate species .
Evolutionarily conserved non-coding DNA sequences, which are at least 85% similar to these matrices, were retrieved, resulting in a final list of 672 candidate PAX2/5/8 downstream targets genes (Figure 1b; 486 and 195 candidates using matrices A and B, respectively (Additional data file 1)). The high number of candidate genes not only reflects the different functions of PAX transcription factors in the different tissues, but is also a consequence of the high variability of the PAX2/5/8 consensus binding site (Figure 1b). Hence, previously known PAX2 direct targets such as pax2 itself or foxi1 [13, 14] were recovered by the EDF pipeline when the threshold for matrix similarity was lowered to 80% (data not shown).
Furthermore, the overlap between the outputs from the two matrices is fairly small (nine genes), which can be explained by the differences between the two matrices (compared in Figure 1a). Indeed, it has been shown that PAX proteins do not recognize a single consensus DNA-binding sequence ; therefore, it is expected that the PWMs derived from two different methods are not alike. This ambiguity is not the result of the differences in nucleotide binding specificities between different PAX proteins, but rather the inherent flexibility in the target preference exhibited by PAX proteins. The paired-domain is highly conserved between all nine PAX orthologues; however, small differences in the paired-domain are responsible for subfamily specific differences in nucleotide recognition. For instance, PAX6 and PAX5 proteins, which are members of two distinct subfamilies, differ in the DNA-binding specificities of their paired domains by three amino acid residues . Therefore, running the EDF pipeline using these two different matrices in parallel provides an advantage to optimize the discovery of PAX2 target genes.
Identification of candidate genes expressed in the otic vesicle
Occurrences of PAX2/5/8 candidate target genes predicted from matrices A and B in mouse (GXD) and zebrafish (ZFIN) databases
Genes described in GXD
Genes described in ZFIN
Genes described in GXD and ZFIN
Genes expressed in ear in GXD
Genes expressed in ear in ZFIN
Genes expressed in ear in GXD and ZFIN
In parallel, we screened the literature by text mining to select amongst the 672 putative target genes those that were already cited in a MEDLINE abstract along with the keywords 'ear', 'otic placode' or 'otic vesicle' (Figure 1b). The search was performed by searching the co-occurrences of the UNIPROT terms of the candidate genes and one or more of the keywords. Out of the 438 candidate genes that were described by a UNIPROT term, 68 co-occurred with the given keywords (Figure 1b; Additional data file 2). This occurrence is highly statistically significant when compared to the total number of genes cited in MEDLINE with these keywords (p-value = 2e-11).
In total, 133 non-redundant PAX2/5/8 downstream targets involved in inner ear development were predicted using both methods and both matrices (Figure 1b). Strikingly, only three genes were common to both the database and literature search, while only four genes were annotated in both GXD and ZFIN databases (Additional data file 2). This number of overlapping genes demonstrates the complementarity of the resources and illustrates the benefit we gain from running the analysis on different bioinformatics datasets.
In order to experimentally validate the predicted targets, we chose the vertebrate model system medaka to perform these experiments, as the comparison of data from three different vertebrate model organisms (mouse, zebrafish and medaka) would confer a further line of evidence to the validation of the in silico process. Indeed, sequence conservation of PAX2/5/8 binding sites amongst different vertebrate species is used as a positive read-out of functionality in the EDF pipeline. Thus, if the candidate genes that bear a conserved PAX2/5/8 binding site in their promoters also display conserved expression patterns (that is, expressed in the otic vesicle) throughout different species, they are more likely to be under evolutionary pressure to maintain functional regulation by PAX2/5/8.
Candidate genes selected for experimental validation
Human EnsEMBL ID
Clones specifically expressed in otic vesicle and Pax2 responsive
Clones specifically expressed in otic vesicle
Clones ubiquitously expressed
Clones not expressed in otic vesicle
In total, 9 out of 14 candidates showed a specific expression in the otic vesicle, demonstrating the power of the in silico approach to predict otic vesicle markers. The remaining five candidates either exhibited an expression pattern in the otic vesicle scarcely distinguishable from a strong ubiquitous staining all over the embryo (lysosome-associated membrane glycoprotein 1 precursor (lamp1), major vault protein (mvp); data not shown), or they were not expressed at all in the otic vesicle (basic helix-loop-helix domain containing, class B, 5 (bhlhb5), skeletal muscle LIM-protein 1 (slim1), epidermal growth factor receptor substrate 15-like 1 (eps15rl1)).
To gain further insight into the interaction between PAX2/5/8 and these nine downstream candidates specifically expressed in the otic vesicle, pax2 overexpression experiments were undertaken. To overcome potential unspecific effects caused by morphological alterations through misexpression of pax2 at earlier stages, we decided to use the artificial heat shock promoter HSE and the meganuclease injection technique [41, 42], which results in broad misexpression in the majority of medaka embryos.
Pax2/8 over-expression effect on predicted downstream targets
Expression in the otic vesicle
Effect upon pax2 misexpression (%)*
To exclude unspecific false positive results, we selected three genes annotated to be expressed in the otic vesicle in the expression pattern databases, but that are not present in the PAX2/5/8 candidate target genes list. We analyzed the expression of these three clones in misexpressing pax2 embryos. All three genes showed no specific ectopic expression in the otic vesicle region (Additional data file 4). Furthermore, it has already been shown that four other genes (six1, eya1, otx1 and gbx2) co-expressed with Pax2/8 but absent from the candidate target genes list do not show altered expression upon Pax2 over-expression , supporting the specificity of our assay.
Therefore, these experimental results demonstrate that brn2, mtrnl, cldn7 and sec31l are direct targets of PAX2/5/8 transcription factors.
The large costs of genome sequencing efforts and high-throughput approaches are justified by the wealth of biological information they eventually provide to the scientific community. It is worthwhile, therefore, to exploit these resources, which are mostly freely available, in order to quickly provide data to the biologist for further experimental studies.
In this study, we have demonstrated that taking advantage of available bioinformatics techniques, from comparative genomics to database and text mining, efficiently and rapidly identifies new direct downstream targets of the PAX2/5/8 transcription factor family. This pipeline also allowed the discovery of previously undescribed novel otic vesicle specific genes, which are amenable to further functional analysis.
Although the number of targets discovered with this approach is not comparable to that of 'wet' experimental techniques, such as large-scale chromatin-immunoprecipitation , this pipeline offers the major advantages of delivering results after a run time of just a day, being applicable to other transcription factors and identifying key downstream targets using sequence and expression conservation.
We have demonstrated that an alternative approach combining simple straightforward bioinformatics pipelines can help the biologist to target their questions and obtain encouraging results, in a fast and cost-efficient way.
Materials and methods
Evolutionary double filtering
We improved the EDF pipeline described in  to take advantage of the increased number of vertebrate genomes sequenced since it was first published. The 4 kilobase upstream sequences of orthologues in the following species were retrieved from EnsEMBL compara v29: Homo sapiens, Pan troglodytes, Mus musculus, Rattus norvegicus, Canis familiaris, Gallus gallus, Xenopus tropicalis, Tetraodon nigroviridis, Fugu rubripes and Danio rerio.
The PWM, which was given as input, was matched to all sequences, in an alignment independent manner. In order to increase the weight between matches on distantly related sequences, for all sequences that matched the PWM at least once, the distance measures to all the other matching sequences were calculated using a previously published metric  (tuple size = 5). The matching sequence was then weighted with the distance to the nearest sequence (for example, if two sequences are identical, then one sequence will be weighted 0) and all the weights were added in order to get a final score. All genes with a final score above 1 were considered.
As input to the EDF pipeline, two different PAX2/5/8 PWMs were processed in parallel. the first PWM, matrix A (Figure 1a), was derived from the TRANSFAC database  based on in vitro selection data obtained from Epstein et al. . The second PWM, matrix B (Figure 1b), was derived from the work of Czerny at al.  based on a compilation of known PAX binding sites. Matches to the PAX2/5/8 PWM with more than 85% identity were selected. The graphical output of these matrices (Figure 1a) was generated using the WebLogo program .
Gene Ontology over-representation
Over-representation of GO terms was calculated using Fisher's exact test of statistical significance, implemented in the G:profiler software , providing as input the primary list of 672 in silico predicted target genes.
The total list of genes described in GXD was obtained from database dumps (April 2006, updated on March 2008) from the GXD Data and Statistical Reports in the 'gene expression' section. The list of genes expressed in the otic vesicle and substructures was directly downloaded from the Mouse Genome Informatics website  using the Gene Expression Data Query Form. Default parameters were used except for 'Expression = detected', 'Anatomical Structure(s) = ear' and 'Sorting and output format = no limit'.
The total list of genes described in ZFIN was obtained from querying the web interface for genes expressed in 'ear', 'otic placode', 'otic vesicle' and substructures (April 2006). Data update was performed on March 2008, using the database dumps for 'gene expression'. Perl scripts were written to select only the genes described by in situ hybridization in a wild-type background. The subset of genes expressed in the otic vesicle were obtained by selecting from the total list of described genes those which have been annotated to be expressed in the otic vesicle and its substructures. The list of otic vesicle substructures was obtained from the Ontology Lookup Service , using 'otic placode' as input.
We applied a Chi-square test to calculate the over-representation of genes annotated in the otic vesicle in the predicted list of PAX2/5/8 candidates, compared to the total occurrences of genes annotated to be expressed in the otic vesicle in GXD and ZFIN. Values used for the contingency table are highlighted in Table 1.
Computer-assisted text mining  was employed to select MEDLINE abstracts containing at least one name or synonym listed in the UNIPROT entries of the candidate genes and at least one of the following keywords: 'ear', 'otic vesicle' and 'otic placode' (according to MEDLINE contents on August 2005 and March 2008). To assess the significance of the co-occurrence of candidate genes with these terms, we used computer cluster farms to calculate the total number of genes cited in MEDLINE (using UNIPROT terms), and the fraction of genes cited with the given keywords. In total, 14,453 UNIPROT terms where found in more than 15 million MEDLINE abstracts, amongst which 1,047 were cited with the keywords. These values were used to populate the contingency table for the Chi-square test in order to calculate the corresponding p-value.
Fish strain and maintenance
Microinjection and heat shock treatment
For experimental validation, the heat-inducible pSGHPax2 and pSGHPax8 constructs were used as described in [14, 41]. Medaka embryos were microinjected at 20-40 ng/μl into single blastomeres at the one- to two-cell stage. DNA was co-injected with the I-SceI meganuclease enzyme as described . After injection the embryos were incubated at 28°C. Embryos lacking background green fluorescent protein (GFP) activity were selected prior to heat shock treatment. For all experiments, the heat treatment was performed for 2 h at 39°C. Four hours after heat shock embryos were fixed for in situ hybridization analysis.
Whole mount in situhybridization
The medaka orthologues corresponding to the candidate genes were obtained using EnsEMBL Biomart . The corresponding clones to these genes were blasted (blastn, percentage of identity >95%) against the Medaka Expression Pattern Database  and against an in-house library of medaka sequenced cDNA clones at the EMBL. In situ hybridization analysis of the 14 downstream targets selected was primarily performed using Intavis Robot as previously described in . For the functional validation, in situ hybridization was performed three to four hours after heat shock, GFP positive embryos were fixed over-night in 4% paraformaldehyde/2 × PTW (phosphate-buffered saline/Tween). Whole-mount in situ hybridization was performed at 65°C as described previously using DIG-labeled probes .
Electrophoretic mobility shift assay
Double stranded oligonucleotides for each candidate gene were designed to contain the predicted PAX2/5/8 binding site and flanking regions up to 40 bp and an additional 5'-GGG overhang for labeling: cldn7 human promoter, 5'gggTAGGGAGGACGGAACAGTGAGGCGTGACAGAGTGCACAGCAATTG; sec31l human promoter, 5'gggAGCTGGTAGAAGGTCACTGAAGCTTAAATACAGGTTTTCCAATTC; brn2 Xenopus tropicalis promoter, 5'gggAACTGCCATGTGCGCAGTGAAGGGTTAATCAGATCAATAGACTGA; metrnl zebrafish promoter, 5'gggGAACAGAAATAACACACTGAAGCTTGTCCCAGATGACCCAAATTC; ccdc102a human promoter, 5'gggATTCACCTCGGGGGCCTTCCAGGGTACAGGATGTAGTGGGGAGTC; Δcldn7 human promoter, 5'gggTAGGGAGGACGGAAAAGTGAGTCGAGACAGAGTGCACAGCAATTG.
Complementary oligonucleotides were annealed and end-labeled with Klenow DNA polymerase and [α-32P]dCTP. Corresponding cold probes where processed in parallel using non-labeled dCTP for competition assays. Zebrafish pax2 was in vitro translated using the Promega TnT sp6/T7 coupled reticulocyte lysate system according to manufacturer's instructions. One fmole of labeled DNA probe was incubated with 5 μl of Pax2 translation reaction for 30 minutes at room temperature in the following binding buffer: 100 mM KCl, 10 mM Hepes (pH 8), 1 mM DTT, 5% glycerol, 1 mM EDTA and 1 μg poly(dI:dC) in a total volume of 20 μl in water. Competition was performed with 100- or 500-fold molar excess of cold competitor. The DNA-protein complex was resolved on a native 6% polyacrylamide gel (in 0.5 × TBE) at 160 V at 4°C for 2 h. The gel was dried and visualized by autoradiography.
Additional data files
The following additional data are available with the online version of this paper. Additional data file 1 is a table listing the candidate genes from the EDF pipeline (matrices A and B). Additional data file 2 is a table listing Pax2/5/8 putative downstream targets predicted to be expressed in the otic vesicle by systematic queries in mouse (MGI), zebrafish (ZFIN) databases and MEDLINE abstracts. Additional data file 3 is a figure representing the genes specifically expressed in the otic vesicle but not affected by Pax2/8 over-expression. Additional data file 4 is a table listing the negative candidate genes for the validation of the pax2 over-expression system.
evolutionary double filtering
electrophoretic mobility shift assay
green fluorescent protein
position weight matrix.
We would like to thank Stuart Archer for critical reading of the manuscript and all the members of the Wittbrodt lab for fruitful discussions. We thank especially Lazaro Centanin, Virginie Marchand and Jürg Müller for providing protocols and assistance with the EMSA. This work has been supported by the DFG Collaborative Research Network SFB488 (JW), FWF P19571-B11 (BB, NA and TC), 'E-STAR' fellowship funded by the EC's FP6 Marie Curie Host fellowship for Early Stage Research Training (MEST-CT-2004-504640) (SG).
- Stuart ET, Kioussi C, Gruss P: Mammalian Pax genes. Annu Rev Genet. 1994, 28: 219-236. 10.1146/annurev.ge.28.120194.001251.PubMedView ArticleGoogle Scholar
- Mansouri A, Goudreau G, Gruss P: Pax genes and their role in organogenesis. Cancer Res. 1999, 59: 1707s-1709s. discussion 1709s-1710s.PubMedGoogle Scholar
- Noll M: Evolution and role of Pax genes. Curr Opin Genet Dev. 1993, 3: 595-605. 10.1016/0959-437X(93)90095-7.PubMedView ArticleGoogle Scholar
- Czerny T, Schaffner G, Busslinger M: DNA sequence recognition by Pax proteins: bipartite structure of the paired domain and its binding site. Genes Dev. 1993, 7: 2048-2061. 10.1101/gad.7.10.2048.PubMedView ArticleGoogle Scholar
- Epstein J, Cai J, Glaser T, Jepeal L, Maas R: Identification of a Pax paired domain recognition sequence and evidence for DNA-dependent conformational changes. J Biol Chem. 1994, 269: 8355-8361.PubMedGoogle Scholar
- Nornes S, Clarkson M, Mikkola I, Pedersen M, Bardsley A, Martinez JP, Krauss S, Johansen T: Zebrafish contains two pax6 genes involved in eye development. Mech Dev. 1998, 77: 185-196. 10.1016/S0925-4773(98)00156-7.PubMedView ArticleGoogle Scholar
- Plachov D, Chowdhury K, Walther C, Simon D, Guenet JL, Gruss P: Pax8, a murine paired box gene expressed in the developing excretory system and thyroid gland. Development. 1990, 110: 643-651.PubMedGoogle Scholar
- Adams B, Dorfler P, Aguzzi A, Kozmik Z, Urbanek P, Maurer-Fogy I, Busslinger M: Pax-5 encodes the transcription factor BSAP and is expressed in B lymphocytes, the developing CNS, and adult testis. Genes Dev. 1992, 6: 1589-1607. 10.1101/gad.6.9.1589.PubMedView ArticleGoogle Scholar
- Pfeffer PL, Gerster T, Lun K, Brand M, Busslinger M: Characterization of three novel members of the zebrafish Pax2/5/8 family: dependency of Pax5 and Pax8 expression on the Pax2.1 (noi) function. Development. 1998, 125: 3063-3074.PubMedGoogle Scholar
- Whitfield TT, Riley BB, Chiang MY, Phillips B: Development of the zebrafish inner ear. Dev Dyn. 2002, 223: 427-458. 10.1002/dvdy.10073.PubMedView ArticleGoogle Scholar
- Riley BB, Phillips BT: Ringing in the new ear: resolution of cell interactions in otic development. Dev Biol. 2003, 261: 289-312. 10.1016/S0012-1606(03)00245-8.PubMedView ArticleGoogle Scholar
- Hochmann S, Aghaallaei N, Bajoghli B, Soroldoni D, Carl M, Czerny T: Expression of marker genes during early ear development in medaka. Gene Expr Patterns. 2007, 7: 355-362. 10.1016/j.modgep.2006.07.008.PubMedView ArticleGoogle Scholar
- Hans S, Liu D, Westerfield M: Pax8 and Pax2a function synergistically in otic specification, downstream of the Foxi1 and Dlx3b transcription factors. Development. 2004, 131: 5091-5102. 10.1242/dev.01346.PubMedView ArticleGoogle Scholar
- Aghaallaei N, Bajoghli B, Czerny T: Distinct roles of Fgf8, Foxi1, Dlx3b and Pax8/2 during otic vesicle induction and maintenance in medaka. Dev Biol. 2007, 307: 408-420. 10.1016/j.ydbio.2007.04.022.PubMedView ArticleGoogle Scholar
- Kwak SJ, Vemaraju S, Moorman SJ, Zeddies D, Popper AN, Riley BB: Zebrafish pax5 regulates development of the utricular macula and vestibular function. Dev Dyn. 2006, 235: 3026-3038. 10.1002/dvdy.20961.PubMedView ArticleGoogle Scholar
- Mackereth MD, Kwak SJ, Fritz A, Riley BB: Zebrafish pax8 is required for otic placode induction and plays a redundant role with Pax2 genes in the maintenance of the otic placode. Development. 2005, 132: 371-382. 10.1242/dev.01587.PubMedView ArticleGoogle Scholar
- Torres M, Gomez-Pardo E, Dressler GR, Gruss P: Pax-2 controls multiple steps of urogenital development. Development. 1995, 121: 4057-4065.PubMedGoogle Scholar
- Bouchard M, Grote D, Craven SE, Sun Q, Steinlein P, Busslinger M: Identification of Pax2-regulated genes by expression profiling of the mid-hindbrain organizer region. Development. 2005, 132: 2633-2643. 10.1242/dev.01833.PubMedView ArticleGoogle Scholar
- Quiring R, Wittbrodt B, Henrich T, Ramialison M, Burgtorf C, Lehrach H, Wittbrodt J: Large-scale expression screening by automated whole-mount in situ hybridization. Mech Dev. 2004, 121: 971-976. 10.1016/j.mod.2004.03.031.PubMedView ArticleGoogle Scholar
- Gawantka V, Pollet N, Delius H, Vingron M, Pfister R, Nitsch R, Blumenstock C, Niehrs C: Gene expression screening in Xenopus identifies molecular pathways, predicts gene function and provides a global view of embryonic patterning. Mech Dev. 1998, 77: 95-141. 10.1016/S0925-4773(98)00115-4.PubMedView ArticleGoogle Scholar
- Kudoh T, Tsang M, Hukriede NA, Chen X, Dedekian M, Clarke CJ, Kiang A, Schultz S, Epstein JA, Toyama R, Dawid IB: A gene expression screen in zebrafish embryogenesis. Genome Res. 2001, 11: 1979-1987. 10.1101/gr.209601.PubMedView ArticleGoogle Scholar
- Neidhardt L, Gasca S, Wertz K, Obermayr F, Worpenberg S, Lehrach H, Herrmann BG: Large-scale screen for genes controlling mammalian embryogenesis, using high-throughput gene expression analysis in mouse embryos. Mech Dev. 2000, 98: 77-94. 10.1016/S0925-4773(00)00453-6.PubMedView ArticleGoogle Scholar
- Lein ES, Hawrylycz MJ, Ao N, Ayres M, Bensinger A, Bernard A, Boe AF, Boguski MS, Brockway KS, Byrnes EJ, Chen L, Chen L, Chen TM, Chin MC, Chong J, Crook BE, Czaplinska A, Dang CN, Datta S, Dee NR, Desaki AL, Desta T, Diep E, Dolbeare TA, Donelan MJ, Dong HW, Dougherty JG, Duncan BJ, Ebbert AJ, Eichele G, et al: Genome-wide atlas of gene expression in the adult mouse brain. Nature. 2007, 445: 168-176. 10.1038/nature05453.PubMedView ArticleGoogle Scholar
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, et al: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.PubMedView ArticleGoogle Scholar
- Mouse Genome Sequencing Consortium, Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, Antonarakis SE, Attwood J, Baertsch R, Bailey J, Barlow K, Beck S, Berry E, Birren B, Bloom T, Bork P, Botcherby M, Bray N, Brent MR, Brown DG, Brown SD, Bult C, Burton J, Butler J, Campbell RD, et al: Initial sequencing and comparative analysis of the mouse genome. Nature. 2002, 420: 520-562. 10.1038/nature01262.View ArticleGoogle Scholar
- Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA, Gocayne JD, Amanatides P, Ballew RM, Huson DH, Wortman JR, Zhang Q, Kodira CD, Zheng XH, Chen L, Skupski M, Subramanian G, Thomas PD, Zhang J, Gabor Miklos GL, Nelson C, Broder S, Clark AG, Nadeau J, McKusick VA, Zinder N, et al: The sequence of the human genome. Science. 2001, 291: 1304-1351. 10.1126/science.1058040.PubMedView ArticleGoogle Scholar
- Chiang DY, Brown PO, Eisen MB: Visualizing associations between genome sequences and gene expression data using genome-mean expression profiles. Bioinformatics. 2001, 17 (Suppl 1): S49-55.PubMedView ArticleGoogle Scholar
- Ureta-Vidal A, Ettwiller L, Birney E: Comparative genomics: genome-wide analysis in metazoan eukaryotes. Nat Rev Genet. 2003, 4: 251-262. 10.1038/nrg1043.PubMedView ArticleGoogle Scholar
- Del Bene F, Ettwiller L, Skowronska-Krawczyk D, Baier H, Matter JM, Birney E, Wittbrodt J: In vivo validation of a computationally predicted conserved Ath5 target gene set. PLoS Genet. 2007, 3: 1661-1671. 10.1371/journal.pgen.0030159.PubMedView ArticleGoogle Scholar
- Torres M, Giraldez F: The development of the vertebrate inner ear. Mech Dev. 1998, 71: 5-21. 10.1016/S0925-4773(97)00155-X.PubMedView ArticleGoogle Scholar
- Wingender E, Chen X, Fricke E, Geffers R, Hehl R, Liebich I, Krull M, Matys V, Michael H, Ohnhäuser R, Prüss M, Schacherer F, Thiele S, Urbach S: The TRANSFAC system on gene expression regulation. Nucleic Acids Res. 2001, 29: 281-283. 10.1093/nar/29.1.281.PubMedPubMed CentralView ArticleGoogle Scholar
- Czerny T, Busslinger M: DNA-binding and transactivation properties of Pax-6: three amino acids in the paired domain are responsible for the different sequence recognition of Pax-6 and BSAP (Pax-5). Mol Cell Biol. 1995, 15: 2858-2871.PubMedPubMed CentralView ArticleGoogle Scholar
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000, 25: 25-29. 10.1038/75556.PubMedPubMed CentralView ArticleGoogle Scholar
- Eccles MR, He S, Legge M, Kumar R, Fox J, Zhou C, French M, Tsai RW: PAX genes in development and disease: the role of PAX2 in urogenital tract development. Int J Dev Biol. 2002, 46: 535-544.PubMedGoogle Scholar
- Hagman J, Lukin K: "Hands-on" regulation of B cell development by the transcription factor Pax5. Immunity. 2007, 27: 8-10. 10.1016/j.immuni.2007.07.001.PubMedView ArticleGoogle Scholar
- Trueba SS, Auge J, Mattei G, Etchevers H, Martinovic J, Czernichow P, Vekemans M, Polak M, Attie-Bitach T: PAX8, TITF1, and FOXE1 gene expression patterns during human development: new insights into human thyroid development and thyroid dysgenesis-associated malformations. J Clin Endocrinol Metab. 2005, 90: 455-462. 10.1210/jc.2004-1358.PubMedView ArticleGoogle Scholar
- Smith CM, Finger JH, Hayamizu TF, McCright IJ, Eppig JT, Kadin JA, Richardson JE, Ringwald M: The mouse Gene Expression Database (GXD): 2007 update. Nucleic Acids Res. 2007, 35: D618-623. 10.1093/nar/gkl1003.PubMedPubMed CentralView ArticleGoogle Scholar
- Sprague J, Bayraktaroglu L, Bradford Y, Conlin T, Dunn N, Fashena D, Frazer K, Haendel M, Howe DG, Knight J, Mani P, Moxon SA, Pich C, Ramachandran S, Schaper K, Segerdell E, Shao X, Singer A, Song P, Sprunger B, Van Slyke CE, Westerfield M: The Zebrafish Information Network: the zebrafish model organism database provides expanded support for genotypes and phenotypes. Nucleic Acids Res. 2008, 36: D768-772. 10.1093/nar/gkm956.PubMedPubMed CentralView ArticleGoogle Scholar
- Henrich T, Ramialison M, Wittbrodt B, Assouline B, Bourrat F, Berger A, Himmelbauer H, Sasaki T, Shimizu N, Westerfield M, Kondoh H, Wittbrodt J: MEPD: a resource for medaka gene expression patterns. Bioinformatics. 2005, 21: 3195-3197. 10.1093/bioinformatics/bti478.PubMedView ArticleGoogle Scholar
- Kitahara T, Li-Korotky HS, Balaban CD: Regulation of mitochondrial uncoupling proteins in mouse inner ear ganglion cells in response to systemic kanamycin challenge. Neuroscience. 2005, 135: 639-653. 10.1016/j.neuroscience.2005.06.056.PubMedView ArticleGoogle Scholar
- Bajoghli B, Aghaallaei N, Heimbucher T, Czerny T: An artificial promoter construct for heat-inducible misexpression during fish embryogenesis. Dev Biol. 2004, 271: 416-430. 10.1016/j.ydbio.2004.04.006.PubMedView ArticleGoogle Scholar
- Thermes V, Grabher C, Ristoratore F, Bourrat F, Choulika A, Wittbrodt J, Joly JS: I-SceI meganuclease mediates highly efficient transgenesis in fish. Mech Dev. 2002, 118: 91-98. 10.1016/S0925-4773(02)00218-6.PubMedView ArticleGoogle Scholar
- Kim TH, Ren B: Genome-wide analysis of protein-DNA interactions. Annu Rev Genomics Hum Genet. 2006, 7: 81-102. 10.1146/annurev.genom.7.080505.115634.PubMedView ArticleGoogle Scholar
- Berry MW, Drmac Z, Jessup ER: Matrices, vector spaces, and informtion retrieval. SIAM rev. 1999, 41: 335-362. 10.1137/S0036144598347035.View ArticleGoogle Scholar
- Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14: 1188-1190. 10.1101/gr.849004.PubMedPubMed CentralView ArticleGoogle Scholar
- Reimand J, Kull M, Peterson H, Hansen J, Vilo J: g:Profiler - a web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Res. 2007, 35: W193-200. 10.1093/nar/gkm226.PubMedPubMed CentralView ArticleGoogle Scholar
- Cote RG, Jones P, Martens L, Apweiler R, Hermjakob H: The Ontology Lookup Service: more data and better tools for controlled vocabulary queries. Nucleic Acids Res. 2008, W372-376. 10.1093/nar/gkn252. 36 Web Server
- Kirsch H, Gaudan S, Rebholz-Schuhmann D: Distributed modules for text annotation and IE applied to the biomedical domain. Int J Med Inform. 2006, 75: 496-500. 10.1016/j.ijmedinf.2005.06.011.PubMedView ArticleGoogle Scholar
- Wittbrodt J, Shima A, Schartl M: Medaka - a model organism from the far East. Nat Rev Genet. 2002, 3: 53-64. 10.1038/nrg704.PubMedView ArticleGoogle Scholar
- Iwamatsu T: Stages of normal development in the medaka Oryzias latipes. Mech Dev. 2004, 121: 605-618. 10.1016/j.mod.2004.03.012.PubMedView ArticleGoogle Scholar
- Flicek P, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y, Clarke L, Coates G, Cunningham F, Cutts T, Down T, Dyer SC, Eyre T, Fitzgerald S, Fernandez-Banet J, Gräf S, Haider S, Hammond M, Holland R, Howe KL, Howe K, Johnson N, Jenkinson A, Kähäri A, Keefe D, Kokocinski F, Kulesha E, Lawson D, Longden I, Megy K, et al: Ensembl 2008. Nucleic Acids Res. 2008, 36: D707-714. 10.1093/nar/gkm988.PubMedPubMed CentralView ArticleGoogle Scholar
- Aghaallaei N, Bajoghli B, Walter I, Czerny T: Duplicated members of the Groucho/Tle gene family in fish. Dev Dyn. 2005, 234: 143-150. 10.1002/dvdy.20510.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.