Skip to main content

Identification of novel regulatory factor X (RFX) target genes by comparative genomics in Drosophila species



Regulatory factor X (RFX) transcription factors play a key role in ciliary assembly in nematode, Drosophila and mouse. Using the tremendous advantages of comparative genomics in closely related species, we identified novel genes regulated by dRFX in Drosophila.


We first demonstrate that a subset of known ciliary genes in Caenorhabditis elegans and Drosophila are regulated by dRFX and have a conserved RFX binding site (X-box) in their promoters in two highly divergent Drosophila species. We then designed an X-box consensus sequence and carried out a genome wide computer screen to identify novel genes under RFX control. We found 412 genes that share a conserved X-box upstream of the ATG in both species, with 83 genes presenting a more restricted consensus. We analyzed 25 of these 83 genes, 16 of which are indeed RFX target genes. Two of them have never been described as involved in ciliogenesis. In addition, reporter construct expression analysis revealed that three of the identified genes encode proteins specifically localized in ciliated endings of Drosophila sensory neurons.


Our X-box search strategy led to the identification of novel RFX target genes in Drosophila that are involved in sensory ciliogenesis. We also established a highly valuable Drosophila cilia and basal body dataset. These results demonstrate the accuracy of the X-box screen and will be useful for the identification of candidate genes for human ciliopathies, as several human homologs of RFX target genes are known to be involved in diseases, such as Bardet-Biedl syndrome.


Eukaryotic cilia and flagella are present in many types of tissues and organisms and are important for sensory functions, cell motility, molecular transport, and several developmental processes, such as the establishment of left-right asymmetry in vertebrates [15]. Several human diseases are known to result from defects in ciliary assembly or function and have recently been designated as ciliopathies [5]. Cilia are well-defined structures consisting of a microtubular axoneme composed of specific proteins that are assembled dynamically in a strict stereotypical pattern (for reviews, see [6, 7]). Ciliary assembly depends on intraflagellar transport (IFT) a dynamic process highly conserved in organisms ranging from the green algae Chlamydomonas to mammals (reviewed in [1, 8, 9]). Several studies in various organisms have been instrumental in the identification of genes involved in the assembly and function of the cilium. The proteomic analysis of detergent-extracted ciliary axonemes from cultured human epithelial cells identified 214 proteins [10]. More recently, a biochemical fractionation of Chlamydomonas reinhardtii flagella led to the identification of about 700 proteins, of which 360 had high confidence of truly being involved in flagellar composition [11]. A proteomic analysis of Trypanosoma brucei flagella allowed the identification of 522 proteins [12]. Two remarkable approaches took advantage of the availability of complete genome sequences to identify genes encoding ciliary and flagellar proteins. By comparing the genomes of ciliated versus non-ciliated organisms, Avidor-Reiss et al. [13] and Li et al. [14] selected 187 and 688 genes, respectively, that are specific to ciliated organisms. Stolc et al. [15] used microarray hybridization to analyze induction levels of all C. reinhardtii genes after deflagellation. They identified 220 genes that are induced at least two-fold and, therefore, are likely to be involved in the assembly or function of cilia and flagella.

Much less is known about the regulatory pathways that control the expression of ciliary components or direct the differentiation of ciliated cells. The transcription factor FoxJ1 appears to govern the differentiation of ciliated cells in vertebrates, but so far, only one gene has been shown to be directly regulated by FoxJ1 [16]. The transcription factor HNF1-β has also been shown to regulate several genes involved in ciliogenesis in the kidney [17]. Most importantly, regulatory factor X (RFX) transcription factors play a key role in regulating genes involved in ciliogenesis. RFX transcription factors are conserved in a wide range of species, including Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster and mammals. They share a characteristic DNA-binding domain of the winged-helix DNA binding family and bind to an X-box motif, an imperfect inverted repeat with variable spacing between the repeats [18, 19]. Whereas only one Rfx gene is described in yeast and C. elegans, two Rfx genes are present in the Drosophila genome and five in mammals [20]. Major clues on RFX functions in metazoans have been obtained from work on invertebrates. daf-19, the sole Rfx gene in C. elegans, is a key regulator of ciliogenesis [21]. dRfx in Drosophila is expressed in ciliated cells and is necessary for ciliated sensory neuron differentiation: all sensory neurons are present but cilia are missing at the dendritic tips [22, 23]. In mouse, we have shown that RFX function in ciliogenesis is conserved. Indeed, Rfx3 controls the growth of mouse embryonic node cilia [24] and Rfx3 loss-of-function leads to hydrocephalus with differentiation defects of ciliated ependymal cells of the choroid plexus and subcommisural organ [25]. Moreover, Rfx3 mutant mice show insulin secretion failure and impaired glucose tolerance correlated with primary ciliary growth defects on islet cells [26]. In zebrafish, Rfx2 is expressed specifically in multiciliated cells of the pronephros and loss of Rfx2 leads to cyst formation and loss of multicilia [27]. The function of the other RFX proteins has yet to be linked to ciliogenesis. Rfx5, the most divergent mammalian member, regulates major histocompatibility class II gene expression and mutations in it are responsible for the bare lymphocyte syndrome [28]. Rfx4 has been implicated in dorsal patterning of brain development in mice and may participate in circadian rhythm regulation in humans [2932].

Because RFX function in ciliogenesis appears conserved from C. elegans to mammals, X-box promoter motif sequences can guide the search for ciliary genes. Indeed, genome wide searches for genes controlled by DAF-19 in C. elegans have identified many genes involved in ciliogenesis [14, 21, 3338]. Genomic X-box searches thus comprise a key method to identify genes involved in ciliary development. We show here that ciliogenic RFX regulatory cascades are well conserved between D. melanogaster and C. elegans and identify a first set of 14 RFX target genes. In particular, we show that all known Drosophila homologs of genes defective in human Bardet-Biedl syndrome (BBS), a human ciliopathy with complex phenotypes, are controlled by dRFX. Moreover, by using comparative genomic screens we show that genes under dRFX control in D. melanogaster share conserved X-boxes with another divergent Drosophila species, D. pseudoobscura. Applied to the whole genome of both species, our comparative approach led to the identification of at least 11 novel RFX target genes. In vivo reporter assay studies for three of them confirmed their involvement in ciliary structure or function in Drosophila, thus illustrating the accuracy of our screen. In addition, we have established a highly confident Drosophila cilia and basal body (DCBB) gene list and highlight several genes as novel candidates for ciliogenesis. Our data are of particular importance for further genetic and genomic studies in the field of ciliogenesis and, consequently, for identifying genes involved in human ciliopathies.


Homologs of C. elegans DAF-19 target genes are regulated by dRFX in Drosophila

Our previous work has shown that RFX transcription factors share a common function in ciliogenesis in worm and fly [21, 23]. We thus inferred that an identical set of genes would be regulated by DAF-19 in C. elegans and dRFX in D. melanogaster. Indeed, among more than 20 previously identified DAF-19 targets expressed in all ciliated sensory neurons of C. elegans [21, 3638], we show that a majority of the homologous genes in fly are down regulated in dRfx mutants (Table 1). Regulation of gene expression was tested by real-time PCR based on RNA extracted from 40-hour old pupae thoraxes and legs. At this stage, dendrites and cilia have just differentiated. Moreover, the levels of expression of ciliary genes osm-6 and nompB, relative to the housekeeping gene TBP (TATA Binding Protein) or the pan-neural gene elav during pupae development, is at a maximum starting at 40 hours after puparium formation (data not shown). As shown in Table 1, 14 of 19 DAF-19 regulated genes for which a homologous gene can be found in Drosophila are also regulated by dRFX. Only one gene (CG5359/D1009.5/xbx-2/dylt-2) regulated by DAF-19 in all ciliated sensory neurons in C. elegans does not seem to be under dRFX regulation in Drosophila. Among all the C. elegans genes expressed and regulated by DAF-19 in a subset of ciliated sensory neurons, only CG9398/tulp appears to be under dRFX control in Drosophila. All the others, such as oseg3, NudC or amo, do not appear to be regulated by dRFX in our assay conditions. However, we cannot exclude that these genes are under dRFX regulation in a small subset of ciliated sensory neurons and, thus, that variations of their expression cannot be detected by real time RT-PCR of RNA preparations of pupae thoraxes and legs. Remarkably, genes that are involved in BBS and conserved in both organisms are regulated by RFX proteins. We quantified the expression of CG13232/BBS4 in Drosophila, the only BBS gene that is not found in the C. elegans genome, and show that it is also down regulated 17-fold in a dRfx deficient background. Most of the other genes regulated by dRFX are involved in IFT. This transport is led by two types of molecular motors, anterograde kinesins and retrograde dyneins, that carry particles that can be biochemically fractionated as A and B complexes [1]. dRFX regulates genes encoding B complex components, but not A complex components.

Table 1 RFX target genes in C. elegans and D. melanogaster and in compartmentalized ciliogenesis

Genes specific to compartmentalized ciliogenesis are regulated by dRFX in Drosophila

Interestingly, most of the genes regulated by dRFX also fall in the list of genes for compartmentalized ciliogenesis (Cp ciliary type, Table 1) defined by the work of Avidor-Reiss et al. (Table 1) [13]. This group of genes is found only in genomes of species showing compartmentalized cilia biogenesis, but neither in the genomes of non-ciliated organisms nor in Plasmodium falciparum, which uses cytosolic cilia biogenesis. We thus tested the expression of almost all the genes described in the Cp category in control and dRfx deficient Drosophila. Among the 34 Cp ciliary genes tested by real-time PCR, 18 were down regulated more than 2-fold in a dRfx mutant background, 4 were significantly reduced between 1.5- and 2-fold and one was significantly over expressed. Eleven genes did not show significant expression variations between control and mutant background (Table 1).

In order to demonstrate the accuracy of our quantification procedure, we performed in vivo observations of reporter constructs of some of the genes in wild-type and dRfx deficient backgrounds (Figure 1). As previously published, sensory neuron ciliary endings are missing in a dRfx deficient background [23]. As observed in the cell body or remaining dendrite, the expression of osm-1 is totally shut down in the dRfx deficient background, whereas the expression of oseg1 is not affected (Figure 1), in agreement with real-time RT-PCR results. Interestingly, CG3259 and CG9227 cDNAs were hardly detectable by real-time PCR and, thus, difficult to quantify. However, in vivo observations of reporter constructs in wild-type and dRfx mutant backgrounds show a complete absence of expression of these two genes in the mutant background (Figure 1).

Figure 1

In vivo observations of reporter constructs in control or dRfx-deficient Drosophila. (a) Schematic of two typical chordotonal organs of the Drosophila leg or antenna. The different segments of the dendrite and of the ciliated ending are shown. Sensory neurons have a single cilium (arrow) extending from their dendrite (arrowhead). (b) Live confocal image of GFP driven expression of osm-1 transgene in a control femur. (c) GFP expression is totally shut down in a dRfx mutant background. (d-i) Confocal imaging of chordotonal neurons labeled with anti-ELAV (red) and anti-GFP (green). oseg1-GFP expression in (d) control flies and (e) a dRfx mutant background. Note that oseg1-GFP expression is not affected in the mutant background. CG3259-GFP expression in (f) control flies and (g) dRfx mutant flies. Reporter construct expression is totally shut down in the mutant background. Johnston's organs from antennae of adult flies carrying CG9227-GFP transgenes in (h) control and (i) dRfx mutant pupae. Note the absence of expression in the mutant background. Scale bar = 10 μm.

In summary, we show that RFX target genes are mainly conserved between C. elegans and D. melanogaster. Our functional comparative approach between both organisms combined with the work of Avidor-Reiss et al. in Drosophila allowed us to identify 27 genes that are regulated by dRFX in Drosophila. A majority of them are shown to be involved in ciliogenesis.

X-box conservation between D. melanogaster and D. pseudoobscura

As previously described [13, 14, 21, 3639], the X-box promoter motif has been used successfully to screen for genes involved in ciliogenesis. As shown above, this first set of X-box gene data in Drosophila is thus a key to better understand the link between X-box sequences and dRFX transcriptional control in Drosophila. We looked for X-boxes in the promoters of dRFX target genes. We searched for X-boxes up to 3 kb upstream of the ATG for each of them, with the most degenerated X-box consensus deduced to date from known RFX protein binding sites (RYYNYY N1-3 RRNRAC). We could identify several X-boxes for each gene (Table 2, columns 2 and 3). However, known negative control genes also presented X-boxes at the same frequency and no particular constraint on the consensus seemed to correlate with one set of genes. Therefore, the presence for one gene of an X-box upstream of its ATG is not predictive of dRFX-dependent expression. We thus turned to the D. pseudoobscura genome. The two Drosophila species' most recent common ancestor occurred 40-60 million years ago. The average identity of coding sequence between D. melanogaster and D. pseudoobscura at the nucleotide level is 70% for the first and second bases of codons, and 49% for the wobble base. Intron sequences are 40% identical, untranslated regions 45-50%, and DNA protein binding sites extracted from the literature have been estimated to an average of 63% [40]. Moreover, detailed comparison of both Drosophila genomes showed that 50-70% of known DNA binding sites reside in conserved sequence blocks in the genomes, called conserved regulatory elements (CREs), whereas the overall conservation of the cis-regulatory regions is low [4143].

Table 2 X-box comparisons in promoters of dRFX regulated genes, between Drosophila melanogaster and Drosophila pseudoobscura

We thus looked for D. pseudoobscura homologs of either dRFX positively regulated or invariant genes and for X-boxes up to 3 kb upstream of the ATG. Interestingly, 70% of conserved dRFX target genes present a conserved X-box in both species (Table 2), whereas only 23% of negative control genes present the same characteristic. Even more precisely, while the sequence and the location of X-boxes for dRFX target genes are conserved, this is not the case for negative control genes. Interestingly, palindromic X-boxes are significantly over-represented compared to non-palindromic X-box sequences in dRFX regulated genes in the two species.

We also looked for overall sequence conservation around the selected X-boxes by Vista promoter sequence comparison between the two Drosophila species. The percentage of identities was quantified either on 100 bp or 25 bp windows surrounding the X-boxes (Figure 2, Table 2) and block conservation was considered positive if identities were over 50%. As shown in Table 2, sequences around the X-boxes are generally not well conserved. Two representative examples are depicted in Figure 2. For the CG9595/osm-6 gene, one of the two conserved X-boxes falls into an overall conserved 100 bp block, whereas the other one does not. For CG8853/che-13, the X-box falls into a poorly conserved region. These results are in agreement with previously published data showing that sequence block conservation alone cannot discriminate regulatory regions, but that binding site clusters present in multiple species more likely discriminate active and inactive clusters [43].

Figure 2

Promoter comparisons between Drosophila species. Sequence identities (from 50-100%) between different Drosophila species ranging from D. melanogaster to the most distant D. virilis as calculated and presented in the VISTA interface [91] for two dRfx target genes, CG9595 (osm-6/NDG5) and CG8853 (IFT55/che-13/Hippi). Coding sequences are depicted in dark blue, untranslated regions are in light blue and other conserved regions in pink. Gene orientation is shown by a horizontal arrow. The location of conserved X-boxes for each gene is indicated by numbered vertical arrows. Note that one conserved X-box for osm-6 is in a conserved block of sequence, while others (osm-6 and che-13) are not.

Screening Drosophilaspecies' genomes for dRFX regulated genes

The presence of a conserved X-box upstream of genes in both D. melanogaster and D. pseudoobscura is thus a good prognostic factor to predict novel dRFX target genes. We thus screened the genome of both Drosophila species for the presence of X-boxes. We searched for all possible matches to a defined motif sequence using a Perl based algorithm [36]. The most degenerated consensus RYYNYY N1-3 RRNRAC found 50,000 hits throughout the entire genome of D. melanogaster and, therefore, could not be used within our experimental framework. We selected five different more restricted consensus motifs that cover X-boxes of the entire set of known target genes at the time (see Materials and methods). Four (RYYVYY N1-3 RRHRAC, GYTNYY N1-3 RRNRAC, GYTDYY N1-3 RRNRAC, GYTRYY N1-3 RRHRAC) were searched in a 1 kb window upstream of the ATG, and the less degenerated one, RTNRCC N1-3 RGYAAC, in a 3 kb window.

Under these conditions, 4,726 non-redundant genes in D. melanogaster and 3,848 in D. pseudoobscura with an X-box upstream of the start codon were selected. Based on a best hit reciprocal search between the two coding sequence (CDS) lists, we identified 1,462 homologous genes having an X-box in their 5' region in both species. This first set of 1,462 genes was further restricted by selecting only genes that share an X-box with no more than 4 bases different (out of the 12 nucleotides recognized by the protein on either side of the spacer) between each species and in a conserved position upstream of the ATG (500 bp difference at most). The list was thus restricted to a subset of 412 genes (Additional data file 1). An even more restricted subset of genes was selected using the X-box motif GYTRYY N1-3 RRHRAC, which was found upstream of most known target RFX genes at the beginning of this work, leading to a list of 83 genes (Table 3). Indeed, among the identified dRFX target genes for which a conserved X box was found in both Drosophila species (Table 2), the highest percentage of target genes (50%, 8 out of 16) was found in this list of 83 genes. The remaining 50% of known RFX target genes (Table 2) were not selected by the X-box screen and thus represent false negatives (see Discussion for a comprehensive analysis).

Table 3 Eighty-three genes selected for a conserved X-box between D. melanogaster and D. pseudoobscura

X-box genes and ciliogenesis

In order to check for enrichment of genes involved in ciliogenesis, we compared our three X-box gene lists to previously published lists of genes potentially involved in cilium or centrosome composition. We first identified the Drosophila homologs for the full set of previously published genes from various organisms from several studies. These include comparative genomic studies of species that have cilia versus species that do not and proteomic analyses of human cilia and centrosome, Chlamydomonas flagellar or basal body and Trypanosoma brucei proteomes [1014, 44, 45]. This set also includes recent genome-wide transcriptional analysis of gene expression during flagellar regeneration in Chlamydomonas or identified by SAGE analysis of ciliated neurons combined with X-box searches in C. elegans [15, 36, 37]. The full set of Drosophila homologs that we found for all studies combined is listed as the DCBB gene set (Additional data file 2).

Interestingly, comparing our set of 1,462 Drosophila X-box candidate genes with the DCBB dataset shows that our list is slightly enriched in DCBB genes. Whereas 5% of the D. melanogaster genome is in the DCBB dataset, our 412 and the 83 X-box gene candidate datasets appear to be highly enriched in DCBB genes (11% and 22%, respectively), suggesting that the X-box conservation is a good marker for genes potentially involved in ciliogenesis (Table 4).

Table 4 Comparisons of Drosophila X-box candidate genes with the Drosophila cilia and basal body genes

The full set of genes with a putative function in ciliogenesis has also been summarized in parallel in two independent databases called the Ciliary proteome and Ciliome databases [4649]. Surprisingly, when we compared the two published databases with the DCBB dataset that we established for Drosophila using similar comparative methods (see Materials and methods and Additional data file 2), we observed large discrepancies between all three datasets (illustrated in Figure 3 and Additional data file 3). There are some differences between the three studies with regard to the initial published sets of genes that were included in the database. The major difference resides in which data are included from the work of Blacque et al. [37]. The Ciliome database [47] includes the complete SAGE dataset from Table S1 in [37], whereas our DCBB dataset includes only data from Table 1 from Blacque et al. (2005), which contains part of the SAGE data combined with an X-box search. The ciliary proteome database [46] includes data from Table S4 of the Blacque et al. study [37], which reports the list of putative X-box genes in the nematode. These differences could account for the high number of genes exclusively represented in the Ciliome database [47] but cannot account for all the discrepancies between our DCBB dataset and the Ciliary proteome database [46] (Additional data file 3). Very likely, the differences observed between all three studies illustrate the problems inherent in automatically processing published tables and gene lists that are then used to compile homologous genes from several different organisms. Another major explanation for the observed discrepancies resides in the order BLAST searches were performed to create each database. For example, the Ciliary proteome database [46] was obtained by looking first for human homologs for each study, and then for the Drosophila ones (unless Drosophila was the starting study). In our DCBB dataset, we have looked for Drosophila homologs, which were then compared to other datasets. Hence, genes that do not have an ortholog in Drosophila or in human are lost in the respective studies.

Figure 3

Comparison of the DCBB set of genes with the Ciliary proteome and Ciliome databases. Venn diagram presenting the overlaps between the three datasets: the cilia proteome [46,48]; the ciliome [47,49], and the DCBB (Additional data file 2). Asterisks indicate this study. Note that only 412 common genes are found in the three datasets. The number of genes also found in the 1,462, 412 or 83 X-box gene lists (Table 4), respectively, are noted in parentheses. The numbers of genes selected in the different studies to construct each dataset are given in Additional data file 3.

However, we show that our lists of 412 and 83 X-box genes are enriched in genes involved in ciliogenesis, whatever database is considered (Table 3, Additional data file 1). Thus, our genome wide X-box consensus motif search allowed the establishment of promising sets of candidate genes for ciliogenesis studies.

Functional analysis of identified X-box genes

We performed functional expression studies to determine whether or not some of the 83 X-box genes (Table 3) are indeed under dRFX control and if they are involved in ciliogenesis. Twenty-five genes were tested by real time RT-PCR to compare their levels of expression in wild-type versus dRfx deficient fly samples. Interestingly, 16 are under dRFX control (Table 3, fold variation indicated in column 2). Among them, 11 have not yet been described as RFX targets in any biological system and two of them have no assigned function as of yet. Nine genes were not found to be under dRFX control (Table 3, noted as 'Neg' in column 2). Among 19 genes also represented in the DCBB dataset (Table 3, Additional data file 2), 17 were tested by real time PCR. Fourteen are indeed regulated by dRFX and only three do not appear to be regulated by it. The two remaining genes were not amplified by real time RT-PCR and, thus, could not be analyzed by this approach. Interestingly, among six genes that were not found in any ciliary database and whose expression was quantified by real-time PCR, two (CG13415/Cby, CG31036) were down-regulated in dRfx mutants. Thus, a high proportion of the genes on the list of 83 X-box genes are indeed dRFX target genes. The 58 remaining genes from this list that have not yet been analyzed are thus promising candidates. Our whole genome screen led to the identification of novel dRFX target genes.

Among the 11 novel dRFX target genes that we identified in this screen and that have never been described as RFX target genes in any organism, 9 do have a described or highly predictive function in ciliogenesis in other organisms. For example, CG15161 encodes the homolog of the IFT46 subunit in Chlamydomonas [50] and the dyf-6 ciliary gene in C. elegans [51]. CG15148/btv, CG3723 and CG17150 encode different dynein subunits. beethoven (btv) mutants show defects in sensory cilia in Drosophila [52], whereas no functional studies are available for either CG3723 and CG17150 or their orthologs in any biological system. CG6129 is the only Drosophila member of the rootletin family of proteins. In mammals, rootletin is necessary for retinal cilia stability and centrosome cohesion in mammalian cells [5356]. CG4536/osm-9 encodes a vanilloid receptor of the transient receptor potential (TRP) family of ion channels. osm-9 is involved in sensory cilia function in Drosophila and C. elegans, and in mammals, TRPV4 plays a crucial role in ciliary activity [57]. CG9227/Tectonic has been described as being involved in Shh signaling in mouse [58]. It has been isolated by comparative genomics as a candidate for ciliogenesis and shown to be specific to ciliated cells in Drosophila [13]. CG13125 has recently been shown to be specific to species with motile cilia and its homolog, TbCMF46, is necessary for flagellar motility in T. brucei [59]. CG3259 encodes the MIP-T3 protein that associates with the tumor necrosis factor receptor in human cells. It is also an inhibitor of the IL13 signaling pathway that is known to repress ciliary differentiation of human epithelial cells in vitro [6062]. It is expressed in ciliated sensory cells in Drosophila [13]. Thus, the gene CG3259 may have a direct function in ciliogenesis, which functional studies in Drosophila will allow to be deciphered.

Interestingly, two novel dRFX target genes have not been described as being involved in ciliogenesis in any organism. CG13415/Chibby encodes a protein that interacts with the β-catenin protein and has been shown in Drosophila and in mammalian cells to antagonize the Wg/Wnt signaling pathway [6365]. The second gene, CG31036, has an unknown function and no obvious ortholog in vertebrates. Protein structure prediction algorithms detect a central transmembrane domain and a signal peptide at the amino-terminus of the protein encoded by CG31036.

Expression profile of three novel dRFX target genes

In order to further validate our screen, we chose three genes (CG6129/rootletin, C13125/TbCMF46 and CG31036) for in vivo study. CG6129 was selected to address the question of conservation in Drosophila of the dual role described in mammals for the rootletin protein in centrosome and ciliary biology. CG13125 is of particular interest to evaluate the possible involvement of a 'motility gene' in Drosophila sensory cilia. Last, since nothing was known about CG31036, we wanted to address whether this gene is involved in ciliogenesis and, thus, validate the overall X-box screening strategy.

Reporter constructs were made by cloning large promoter fragments including the conserved X-box, plus coding sequences in frame with green fluorescent protein (GFP). Transgenic flies were established and analyzed for GFP expression. Two types of ciliated cells have been described in Drosophila: spermatozoa and type I sensory neurons that innervate the proprioceptive chordotonal organs and external sensory organs that are mechano- or chemosensory. Remarkably, the expression of all three reporter constructs was observed only in type I sensory neurons. As a control, reporter GFP expression was compared to mRNA expression by in situ hybridization. CG6129/rootletin protein expression reproduces the expression of the transcript in only type I sensory neurons of the embryo (data not shown). CG31036 RNA expression is also available from the BDGP database [66]. CG31036 mRNA is restricted to type I sensory neurons of the head, thoraxes and abdomen of the embryo and reflects the protein expression of our transgene. However, we did not observe a strong protein expression in the gut as observed for the transcript. This could either reflect a non-specific hybridization signal or the presence of other transcript isoforms driven by a different promoter. We could not detect CG13125 transcripts by in situ hybridization, likely illustrating the faint expression of this gene in Drosophila.

Chimeric CG6129::GFP protein was present in the rootlet processes of the chordotonal dendrites, in agreement with the predicted function of rootletin in ciliary rootlet organization (Figure 4). It was also detected faintly at the cilium tip (Figure 4d) and clearly in axons (Figure 4). Since our construct does not include all the coding sequences of the rootletin protein, it is possible that the GFP expression does not reflect the exact location of the endogenous protein. Rootletin has been shown in mammalian cell culture to be localized to the ciliary rootlet and to be involved in centrosome cohesion [56]. We show that CG6129/Rootletin expression is restricted to ciliated chordotonal neurons in Drosophila, thus suggesting an involvement only in ciliogenesis. Despite strong GFP expression in the chordotonal organs, no expression was observed in the ciliated sensory neurons that innervate external sensory organs. Either the expression in those cells is too weak, or ciliary rootlets in Drosophila, as represented by CG6129/rootletin GFP expression, are restricted only to chordotonal organs, as observed previously by electron microscopy [67, 68].

Figure 4

Reporter GFP expression studies for three X-box containing genes. (a) Stereotypical arrangement of type I sensory neurons in a Drosophila embryo, anterior to the left, stained with the 21A6 antibody with a magnification of the dorsal and lateral neurons of one abdominal segment as visualized in (f,k). The arrowhead indicates the five lateral chordotonal neurons (ch) and the arrows point to the neurons of the external sensory (es) organs. (b) Schematic of two typical chordotonal organs of Drosophila. (c-l) Confocal imaging of GFP expression of transgenic lines carrying the promoter region and coding sequences fused to the GFP for CG6129/rootletin (c-e), CG31036 (f-h) and CG13125/TbCMF46 (k,l). GFP expression is only observed in ciliated sensory neurons of D. melanogaster where the chimeric GFP proteins are localized to the ciliary apparatus. (c) CG6129::GFP reporter expression (green) is observed in embryonic chordotonal organs, mainly along the dendrite from the base of the cilium to the cell body. The 21A6 antibody (red, see Materials and methods) labels the ciliary dilation of the cilium. (d,e) Live GFP imaging of the lateral pentascolopidial chordotonal organs in male third instar larvae of dRfx deficient (e) and control (d) sibs. The elav-RFP expression (red) labels all neurons. CG6129/rootletin is regulated by dRfx as no GFP expression is observed in dRfx deficient larvae. (f-h) CG31036::GFP reporter expression (green) is observed both in the external sensory neurons (arrows in (f)) and the chordotonal neurons in the embryo (arrowhead in (f-h)). The 21A6 antibody (red) labels the ciliary dilation at the tip of the dendrite. CG31036::GFP protein localization appears to be slightly different depending on the fixative used (paraformaldehyde in (g), methanol in (h)). (i-j) Immunodetection of CG31036::GFP expression in leg chordotonal organs of 72-hour pupae in dRfx deficient (j) or control (i) sibs. The anti-ELAV antibody (red) labels all neurons. No CG31036::GFP expression is observed in dRfx deficient pupae (j). (k,l) CG13125::GFP expression is observed by immunodetection in the embryonic chordotonal organs (arrowheads in (k,l)) but also in the external sensory neurons (arrows in (k)). A higher magnification of the lateral chordotonal organs (l) shows that GFP is apposed to the 21A6 immunostaining (red). Scale bar = 10 μm.

CG31036::GFP specifically marks the ciliated endings of chordotonal neurons and confirms that this novel protein is a component of ciliated endings (Figure 4). The GFP signal is apposed to the 21A6 antibody staining, directed against the eyes shut protein, which has been described to locate at the ciliary dilation around the tip of the ciliated segment [69]. This implies that CG31036::GFP most likely locates to the tip of the tubular bundle that extends after the ciliary dilation (schematic in Figure 1a). However, only ultrastructural observations of immunogold labelings will allow precise subcellular localization of both CG6129/rootletin and CG31036. Interestingly, CG31036::GFP expression is also detectable in external sensory neurons as a dot apposed to the 21A6 antibody staining (Figure 4f). Finally, we confirmed that both reporter constructs are under dRfx control as the GFP signal was completely shut down in a dRfx mutant background (compare Figure 4d and 4e or 4i and 4j).

For the third construct, CG13125::GFP localization was consistently observed in the chordotonal neurons at the base of the cilium, presumably the basal body region, and also at the tip of what is likely the cilium. GFP expression was also often observed in the external sensory neurons as a dot but without consistent reproducibility, probably illustrating a threshold level of expression for these cells and the faint level of expression of the CG13125/TbCMF46 transgene (Figure 4k,l).

In conclusion, the three novel dRFX target genes that we identified in our X-box motif searches are indeed under dRFX control in vivo and specifically expressed in ciliated sensory neurons in Drosophila. In addition, they encode proteins that are localized to the base or the tip of the cilium, thus suggesting a role in ciliary structure or function.


Ciliogenic RFX regulatory networks are conserved between C. elegans and D. melanogaster. Based on these first observations, the genomic screens we conducted combined with functional and in vivo gene analyses led to the identification of at least 11 novel genes that had never been described as RFX targets in any biological model. In addition, our screen allowed us to identify at least two novel genes specifically expressed in ciliated sensory neurons in Drosophila that are potentially involved in sensory ciliogenesis. These results validate the accuracy of our screens. Our work thus provides a new set of candidate genes for further functional studies in ciliogenesis.

Molecular nature of RFX target gene products

Our Drosophila genome wide X-box screen led to the identification of 83 X-box genes among which we report 11 novel RFX targets. Combined with the genes identified by comparisons to C. elegans or to other genomic studies in Drosophila (Table 1) [13], we report 35 genes regulated by dRFX in Drosophila. Most of these genes can be classified based on their described function. Many of the RFX target genes are involved in IFT, which is necessary for cilium assembly and function [1]. Remarkably, a second class of genes regulated by dRFX includes all the Drosophila homologs of BBS genes. Similarly, most C. elegans BBS genes are regulated by DAF-19 [14, 36, 37]. This strong dependence of BBS genes on RFX control may thus be conserved in mammals. Hence, RFX proteins may be involved in BBS in humans. Interestingly, two of the three Drosophila genes coding for proteins with B9 domains are also controlled by dRFX (tectonic, CG14870). One human B9 domain protein, MKS1, is known to be involved in the human Meckel-Gruber syndrome [70]. The molecular function of this domain is unknown and work in Drosophila suggested that these two B9 domain containing proteins are likely involved in ciliogenesis [13]. Several of the novel dRFX target genes that we identified in this study encode known components of the ciliary axoneme and associated structures, such as axonemal dyneins or rootletin. Other genes encode different types of proteins likely involved in sensory transduction (CG4536/osm-9/TRPV4 or MIP-T3). A last class includes genes for which the function is either not described or poorly understood, such as CG31036 and CG13125. However, our functional studies strongly suggest that they are also probably involved in sensory ciliogenesis in Drosophila as well. Thus, RFX target genes play various roles in ciliary structure and function and our X-box search strategy has proven to be useful to identify novel ciliogenic genes.

Database mining using the X-box promoter motif

This full set of dRFX target genes in Drosophila is of crucial importance, as we can now more precisely define X-box sequences and the promoter context required for dRFX control. This will be particularly useful for further database mining of dRFX target genes in Drosophila. In fact, several genes that are under dRFX control (Table 1, for example CG4525, CG17599) for which an X-box can be identified did not come out in the whole genome X-box screen. Several reasons can explain this result. First, homologs were not all annotated in CDS listings that were available at the time of the search (for example, CG18631, CG9595, nompB in D. pseudoobsura). Second, annotation of both Drosophila databases is incomplete, as sometimes the start codon is not properly defined for all genes. Our X-box search algorithm keeps only genes for which the X-box match is upstream of the ATG. For example, for CG15666/GA13881, we clearly predict that the correct ATG should be considered 75 bp downstream of the currently defined ATG, based on evolutionarily conserved sequences. This definition clearly excludes the homologous genes CG15666 and GA13881 from the dataset. However, as illustrated in Table 2, in a few cases, our X-box consensus cannot define a clearly conserved X-box match in the two Drosophila species for genes that appear to be down-regulated in a dRfx mutant, while several individual X-boxes are found separately in each organism. This could either reflect that these genes are not direct dRFX targets but are shut down by a feedback control loop that is not dependent on a X-box motif, or that the X-box is only loosely conserved in some promoter contexts. Notably, homologs of these genes in C. elegans are under RFX (DAF-19) control and have a well defined X-box (for example, CG9333/che-2, CG13691/bbs-8), which argues in favor of the second possibility. Interestingly, we also quantified the expression levels in control and dRfx deficient Drosophila of several genes of the DCBB dataset that did not come out of the X-box genome-wide motif search. It allowed us to identify several novel genes that are indeed down-regulated in dRfx mutants, but for which no conserved X-box can be recognized based on our initial consensus motif (AL, unpublished). Altogether, our observations clearly highlight the difficulties encountered in motif definition in promoters. Similar conclusions were deduced from a parallel approach performed in C. elegans, which has led to the identification of several novel DAF-19 target genes [38]. Interestingly, in that study the in silico search was associated with microarray analysis of transcripts in wild-type and daf-19 mutant worms. The in silico search allowed the identification of 93 X-box genes. Yet, among the 466 genes that were shown to be down-regulated at least two-fold in microarray hybridization experiments, only 25 were also represented in the 93 in silico X-box gene list. Thus, in silico searches on isolated motifs are likely hampered by a high level of false negatives. In order to improve the screening efficiency, the use of combinatorial motif searches would probably greatly enhance the accuracy of the screen as proposed by other studies [71, 72]. Even though, since conserved X-boxes that we identified are rarely associated with highly conserved surrounding sequences (Table 2), it is reasonable to assume that other conserved nearby motifs, still to be identified, could help to discriminate between false positives and false negatives.

Regulatory network of ciliary genes

We have identified 35 genes that are transcriptionally down-regulated in dRfx mutants. We show that RFX regulatory networks are conserved between C. elegans and Drosophila as most of the genes controlled by DAF-19 in C. elegans are also under dRFX control in D. melanogaster. Interestingly, our results show that only certain subsets of ciliogenic genes are regulated by RFX proteins. For example, in our assay conditions all the genes known to be involved in IFT-A complexes are not regulated by dRFX, whereas all IFT-B homologous proteins are regulated by dRFX. In addition, retrograde motors are also regulated by dRFX (CG15148/btv and CG3769), whereas anterograde motors seem not to be. Indeed, in addition to CG10642/KIF3A, the main described anterograde motor in several organisms, we have shown that two other kinesin subunits, CG17461/Kif3C/osm-3 and CG7293/Klp68D, are invariantly expressed in wild-type and dRfx-deficient Drosophila (AL, data not shown). It is also interesting to note that all the BBS gene homologs in D. melanogaster are under dRFX control (Table 1).

The biological significance of these observations is unclear. It could reflect the fact that IFT-B proteins, BBS proteins and the dyneins involved in IFT are dedicated to ciliogenesis and, therefore, need to be turned on concomitantly only when the cilium is formed, whereas IFT-A complexes or anterograde transport kinesin II share more complex regulatory controls as they might be necessary also for other cellular functions. This is the case for kinesin II motors [73], but does not seem to be true for IFT-A complexes as these proteins are proposed to be specific for ciliated organisms [13]. In C. elegans, the ciliary IFT machinery works in modular fashion [74], and it is tempting to speculate that RFX-dependent proteins could be involved in specialized ciliogenic transport modules.

Genes necessary for centriole biogenesis or replication, such as the recently described DSas-6, DSas-4 or sak genes [7578] are not present in our screen and no conserved X-box can be found upstream of these genes. Thus, dRFX does not seem to regulate centriole biogenesis and appears to be restricted to cilia assembly only.

To find which transcription factors are responsible for governing other sets of ciliary proteins will certainly be one track to follow. Based on our data, it would be of particular interest to compare promoter sequences of genes, either regulated by dRFX, or not. It may allow us to discover novel regulatory motifs and protein modules that are necessary to coordinate ciliogenesis control. So far, only a few transcription factors have been shown to be involved in the control of ciliogenesis: the RFX proteins [21, 23, 24], Foxj1 [16], and HNF1-beta [17]. However, the last two have no obvious homologs in Drosophila. Thus, our work strongly suggests that novel transcription factors necessary for ciliogenesis still need to be discovered.

Novel RFX target genes

Some of the novel RFX target genes found in Drosophila were unexpected. For example, we identified several proteins that are proposed to be involved in flagella or cilia motility, such as dynein heavy chains (CG17150/Dhc93AB). Recently, a CG13125 homolog has also been shown to function as a motility factor in T. brucei (TbCMF46) [59]. Sensory cilia are thought not to be motile in general. However, it has been shown that Drosophila chordotonal neurons of the antenna generate motion that depends on the integrity of proteins encoded by genes such as CG15148/btv (cytoplasmic dynein heavy chain) or CG14620/tilB (LRRC6 homolog), described to affect the axonemal structure [52, 79] (D Eberl, personal communication). In addition, cilia of the chordotonal neurons of the grasshopper bend upon vibration stimulation [80]. Thus, proteins involved in axonemal motility might be important for motion generation of the cilium in response to mechanical stimulation. It will be of high interest to determine whether flies defective in these 'motility' genes are affected in hearing and, more specifically, in the motility of the mechanosensory cilium that amplifies hearing vibrations. Interestingly, CG13125/TbCMF46 does not seem to be expressed in fly testis (AL, unpublished), where the spermatozoa are the only cell type with a motile flagellum in flies. This suggests that like CG15148/btv, CG13125/TbCMF46 function could be restricted to the sensory cilium and, more specifically, in allowing these cilia to mechanically respond to auditory vibrations [52]. Thus, our data suggest that in the fly, possible axonemal motility could be regulated by different subsets of proteins in sperm flagella and in mechanosensory cilia. This is of particular interest with regard to hearing in mammals, which is dependent on hair cell motility. It will be very interesting to determine whether the CG13125/TbCMF46 homolog in mammals does have a specific function in those cell types.

We also identified in our screen three genes (CG6054/Su(fu), CG13415/Cby, CG33038/Ext(2)) known to be involved in the hedgehog or wingless signaling pathways in Drosophila. Su(fu) and Ext(2) are involved in the Hedgehog pathway and Su(fu) is localized to cilia in mammalian cells [81]. However, Su(fu) and Ext(2) do not appear to be under dRfx control according to real-time PCR quantification results (Table 3) and may be false positives in our screen. This result argues in favor of the generally accepted observation that the Hedgehog signaling pathway does not seem to depend on ciliogenic proteins in Drosophila [82]. Only Chibby (Cby) is statistically down-regulated two-fold in a dRfx deficient background. Cby was isolated in a two-hybrid screen for armadillo/beta-catenin interactors. RNAi knock-down of Cby in Drosophila embryos leads to ectopic activation of the wingless pathway [63]. Cby is also described to antagonize the Wnt/beta-catenin pathway in mammalian cells [64, 65]. However, the expression pattern of Cby in Drosophila is not documented, so we do not know if the variations of expression observed in the dRfx deficient background are connected to dRfx expression and, thus, if it is biologically significant.

Among the 83 genes with conserved X-boxes between D. melanogaster and D. pseudoobscura (Table 3), several genes were hardly detectable by quantitative RT-PCR. Hence we were unable to determine by this approach if they are under dRFX control. This could reflect that these genes are expressed only in a subset of sensory neurons and, thus, difficult to detect by quantitative RT-PCR. Nevertheless, several genes are interesting as potential ciliogenic or RFX target genes. For example, CG14079 is homologous to a mouse protein that appears to be specific to testis. CG11356 is homologous to mammalian arl13, which has just been isolated in an ethyl-nitroso-urea screen for neural tube defects in mouse. Indeed, mutation of arl13 affects ciliary architecture and Sonic-Hedgehog signaling in mouse [83]. This gene, CG11356, was not found in any previous ciliogenesis study, again illustrating the accuracy of our screen. Functional studies in Drosophila will be of particular importance to demonstrate the role of this gene in sensory ciliogenesis.


We have identified more than 30 dRFX target genes in Drosophila by exploiting the efficiency of the X-box promoter motif search by using two divergent Drosophila species in a comparative approach. These full sets of RFX dependent or independent ciliary genes are of particular importance for studies of X-box promoter motifs and associated promoter contexts in Drosophila. More remarkably, our screen allowed the identification of at least two novel genes specific to sensory ciliary architecture in D. melanogaster and provides several new RFX target gene candidates potentially involved in ciliogenesis. This is of particular importance with regard to the growing number of human diseases that are being associated with ciliary defects (for reviews, see [4, 5, 7]).

Materials and methods

Quantitative RT-PCR

Total RNA was extracted from 40-hour old puparium using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) or RNeazy (Qiagen, Venlo, The Netherlands). Pupae head and abdomen were removed as well as internal organs and muscles in order to enrich as much as possible the extract for sensory organs from thoraxes, legs and wings. DNA was digested with DNA-free reagent (Ambion, Austin, TX, USA). Reverse transcription (RT) was performed on 2 μg of RNA derived from pools of 5 thoraxes with random hexamers (Promega, Madison, WI, USA) with RevertAid™ H Minus M-MuLV reverse transcriptase (Fermentas, Burlington, Canada). Real-time PCR analysis was performed with SYBR Green fluorescent PCR (Qiagen) in a LightCycler (Roche, Basel, Switzerland) or a MX3000 (Stratagene, Cedar Creek, TX, USA) fluorescent temperature cycler. Primer sequences specific for each gene are available upon request. Primers were used at 0.5 μM. PCR conditions were as follows: 95°C, 15 minutes; 35 × (95°C, 15 s; 60°C, 20 s; 72°C, 20 s). According to melting point analysis, only one PCR product was amplified under these conditions. RNA extracted from wild-type samples was used to generate a standard quantification curve for each gene, allowing the calculation of relative amounts of transcripts in mutant samples compared to wild type. All reactions were performed with four biological replicates and two technical replicates. Results were normalized with respect to CG9874/TBP expression and standard errors of the mean were calculated. Results are expressed as relative mutant to wild-type expression ratios. Significance levels were tested with unpaired t-test.


Individual X-boxes (consensus RYYNYYN{1-3}RRNRAC) were searched for in the 5' upstream regions of ATGs on the same strand (+) and the antiparallel strand (-) in both D. melanogaster and D. pseudoobscura homologs [84]. Genome wide searches for X-box promoter motifs were primarily performed using a Perl-based algorithm that identifies all possible matches in a given DNA sequence. First, the algorithm finds all sequences that match a defined consensus, then the main module implements a cross-match file that compares a 3 kb window downstream of each match to a file containing the DNA sequences for all predicted genes [36]. Genome sequence information, gene prediction and CDS files for X-box searches were obtained from the following sources: the D. melanogaster complete genome sequence used was BDGP release 4; the complete CDS list was built from release 3.2.1 [85]. For D. pseudoobscura the 28 August 2003 genome assembly was used and release 2.1 of CDS sequences from BCM-HGSC were used [40]. Reverse BLASTP analysis was performed between the two CDS files in order to establish a list of orthologous genes between the two fly species with a cut-off value of BLAST e-score <1 e-10. Comparisons of all listed gene information were performed on a Unix platform. BDGP and Flybase databases were mined for expression patterns and gene information. Genome conservation between the two fly species was evaluated using the VISTA interface [86].

DCBB dataset

The ciliary and basal body genes in Additional data file 2 were identified using a reverse BLASTP strategy to define the best homologous proteins or genes described in the following studies: 210 proteins published in Table 2 from the human ciliary proteome [10] as modified by Marshall [87], 159 putative target genes of DAF-19 [36], 219 over expressed genes after deflagellation in C. reinhardtii described in Table 9 of Stolc et al. [15], 54 genes (Table 1) expressed in ciliated sensory neurons in C. elegans [37], 654 proteins identified in C. reinhardtii flagella [11], 380 proteins identified in the T. brucei flagella proteome [12] and 114 proteins listed in Table S1 for the human cell centrosome [44]. The following Drosophila homologs were extracted from published work: 260 genes described as homologous to the FABB proteins from C. reinhardtii in Table S1 of Li et al. [14], 51 genes described as homologous to 195 proteins described in Table S2 for the basal body proteome of C. reinhardtii [45] and 187 genes from Table S1 of compartmentalized cilia predicted genes, which has been modified to 188 genes according to Flybase annotation [13].

Reporter constructs

DNA fragments were amplified from wild-type fly genomic DNA using the Expand Long Template PCR system (Roche). Cloning strategies used primers to clone in frame the gene of interest to the GFP sequence of the PW8-GFP vector [88]. CG13125::GFP plasmid, a 3,547 bp genomic DNA fragment containing the complete coding sequence of CG13125-RA and RB, was amplified from Canton-S using primers starting 1,484 bp upstream of the RB ATG until the penultimate codon of the gene. CG6129::GFP plasmid, a 4,129 bp genomic DNA fragment containing part of the CG6129-RB gene, was amplified by PCR from Charolles genomic DNA using primers starting 2,619 bp upstream of the RB ATG. CG31036::GFP plasmid, a 3,780 bp genomic DNA fragment containing part of the CG31036-RA gene, was amplified by PCR from Canton-S using primers starting 1,800 bp upstream of the ATG. All coding regions cloned were entirely sequenced prior to transgenesis. Transgenic lines were established by P-element mediated germline transformation as described [89].

The following fly stocks were used for experiments: P{mecCP:Gal437a1, P{Osm-1:Gal4}T17#7a1, P{CG9227:Gal4}T32#10a2 and P{CG3259:Gal4}T39#13a1 were gifts from Tomer Avidor-Reiss (Harvard Medical School, Boston, MA, USA). P{UAS-RFP}31 was a gift from Maurice Kernan (Stony Brook University, New-York, NY, USA) and P{GawB:elav}C155 and P{UAS-mCD8::GFP.L}LL6 strains were provided by the Drosophila Bloomington Stock Center, IN, USA.

Fly genetics and observations

Fly genotypes used to extract RNA were st dRfx253e ca/DfS143702for mutants and st e ca/DfS143702for control flies [23]. Control and mutant flies presented in Figure 1 share the same genotype with the exception of the third chromosome, which is heterozygous for the dRfx253 carrying chromosome in the controls. Genotypes for flies were: oseg1 flies, y w P{UAS:CD8:GFP}/Y; P{mecCP:Gal437a1}/+; P{UAS:CD8:GFP} Rfx253/Rfx49; osm-1 flies, w; P{Osm-1:Gal4T17#7a1}/+; P{UAS:CD8:GFP} Rfx253/Rfx49; CG3259 flies, y w P{UAS:CD8:GFP}; st Rfx253e P{CG3259:Gal4T39#13a1}/Rfx49; and CG9227 flies, y w P{UAS:CD8:GFP}/Y; st Rfx253P{CG9227:Gal4T32#10a2}/Rfx49. Control and mutant flies presented in Figure 4 were sibs sorted from the same crosses. For CG6129 expression in a dRfx deficient background, females (w elavc155P{UAS-RFP31}; Rfx49/TM6B, Tb) were crossed with males (w;P{CG6129:EGFP}M33; Rfx253/TM6B, Tb). For CG31036, the crosses were: females (w; P{CG31036:EGFP}F27; DfS143702/TM6B, Tb) with males (w; P{CG31036:EGFP}F27; Rfx253/TM6B, Tb).

The preparation of embryos for staining assays was carried out according to general methods described previously [90]. Live observations of dechorionated embryos and larvae were performed on mounted material under coverslips in DakoCytomation media. For pupae immunostaining, 72- to 96-hour old animals were fixed for 20 minutes in 4% paraformaldehyde, 3% triton X-100 in phosphate-buffered saline. Primary antibodies were rabbit anti-GFP (1:250) from Torres Pines Biolabs (Houston, TX, USA), or (1:500) from Molecular Probes (Invitrogen, Carlsbad, CA, USA), mouse anti-eys 21A6 and mouse anti-Futch 22C10 (kindly provided by S Benzer), mouse anti-elav 9F8A9 (1:500) obtained from the Developmental Studies Hybridoma Bank, Iowa City, IA, USA. Secondary conjugated antibodies were A488 and A546-anti-mouse and anti-rabbit (Molecular Probes, Invitrogen, Carlsbad, CA, USA). Images were obtained on a Zeiss Imager Z1 and LSM510 confocal microscope.

Additional data files

The following additional data are available with the online version of this paper. Additional data file 1 is a table listing the full set of 412 X-box genes conserved between D. melanogaster and D. pseudoobscura. Our X-box search across D. melanogaster and D. pseudoobscura species identified 412 genes with a conserved X-box both in sequence and distance upstream of the ATG of homologous genes between the two fly genomes. Additional data file 2 is a table of the DCBB genes list established for D. melanogaster. Genes presented in this table are homologous to proteins identified as putative or confirmed ciliary or basal body components. They are sorted as follows: a first group of genes with annotated molecular functions, a second group of genes for which homologs in vertebrates have been reported, a third group of genes with no vertebrate homolog. Each category is sorted by the number of studies reporting each gene or its homolog. Additional data file 1 is a table listing the number of Drosophila genes homologous to ciliary genes identified in previously published studies.



Bardet Biedl syndrome


base pair


coding sequence


Drosophila cilia and basal body


intraflagellar transport


regulatory factor X


transient receptor potential.


  1. 1.

    Rosenbaum JL, Witman GB: Intraflagellar transport. Nat Rev Mol Cell Biol. 2002, 3: 813-825. 10.1038/nrm952.

    PubMed  CAS  Article  Google Scholar 

  2. 2.

    Pan J, Wang Q, Snell WJ: Cilium-generated signaling and cilia-related disorders. Lab Invest. 2005, 85: 452-463. 10.1038/labinvest.3700253.

    PubMed  CAS  Article  Google Scholar 

  3. 3.

    Marshall WF, Nonaka S: Cilia: tuning in to the cell's antenna. Curr Biol. 2006, 16: R604-614. 10.1016/j.cub.2006.07.012.

    PubMed  CAS  Article  Google Scholar 

  4. 4.

    Singla V, Reiter JF: The primary cilium as the cell's antenna: signaling at a sensory organelle. Science. 2006, 313: 629-633. 10.1126/science.1124534.

    PubMed  CAS  Article  Google Scholar 

  5. 5.

    Badano JL, Mitsuma N, Beales PL, Katsanis N: The ciliopathies: an emerging class of human genetic disorders. Annu Rev Genomics Hum Genet. 2006, 7: 125-148. 10.1146/annurev.genom.7.080505.115610.

    PubMed  CAS  Article  Google Scholar 

  6. 6.

    Snell WJ, Pan J, Wang Q: Cilia and flagella revealed: from flagellar assembly in Chlamydomonas to human obesity disorders. Cell. 2004, 117: 693-697. 10.1016/j.cell.2004.05.019.

    PubMed  CAS  Article  Google Scholar 

  7. 7.

    Bisgrove BW, Yost HJ: The roles of cilia in developmental disorders and disease. Development. 2006, 133: 4131-4143. 10.1242/dev.02595.

    PubMed  CAS  Article  Google Scholar 

  8. 8.

    Rosenbaum J: Intraflagellar transport. Curr Biol. 2002, 12: R125-10.1016/S0960-9822(02)00703-0.

    PubMed  CAS  Article  Google Scholar 

  9. 9.

    Sloboda RD: A healthy understanding of intraflagellar transport. Cell Motil Cytoskeleton. 2002, 52: 1-8. 10.1002/cm.10035.

    PubMed  CAS  Article  Google Scholar 

  10. 10.

    Ostrowski LE, Blackburn K, Radde KM, Moyer MB, Schlatzer DM, Moseley A, Boucher RC: A proteomic analysis of human cilia: identification of novel components. Mol Cell Proteomics. 2002, 1: 451-465. 10.1074/mcp.M200037-MCP200.

    PubMed  CAS  Article  Google Scholar 

  11. 11.

    Pazour GJ, Agrin N, Leszyk J, Witman GB: Proteomic analysis of a eukaryotic cilium. J Cell Biol. 2005, 170: 103-113. 10.1083/jcb.200504008.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  12. 12.

    Broadhead R, Dawe HR, Farr H, Griffiths S, Hart SR, Portman N, Shaw MK, Ginger ML, Gaskell SJ, McKean PG, et al: Flagellar motility is required for the viability of the bloodstream trypanosome. Nature. 2006, 440: 224-227. 10.1038/nature04541.

    PubMed  CAS  Article  Google Scholar 

  13. 13.

    Avidor-Reiss T, Maer AM, Koundakjian E, Polyanovsky A, Keil T, Subramaniam S, Zuker CS: Decoding cilia function: defining specialized genes required for compartmentalized cilia biogenesis. Cell. 2004, 117: 527-539. 10.1016/S0092-8674(04)00412-X.

    PubMed  CAS  Article  Google Scholar 

  14. 14.

    Li JB, Gerdes JM, Haycraft CJ, Fan Y, Teslovich TM, May-Simera H, Li H, Blacque OE, Li L, Leitch CC, et al: Comparative genomics identifies a flagellar and basal body proteome that includes the BBS5 human disease gene. Cell. 2004, 117: 541-552. 10.1016/S0092-8674(04)00450-7.

    PubMed  CAS  Article  Google Scholar 

  15. 15.

    Stolc V, Samanta MP, Tongprasit W, Marshall WF: Genome-wide transcriptional analysis of flagellar regeneration in Chlamydomonas reinhardtii identifies orthologs of ciliary disease genes. Proc Natl Acad Sci USA. 2005, 102: 3703-3707. 10.1073/pnas.0408358102.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  16. 16.

    Brody SL, Yan XH, Wuerffel MK, Song SK, Shapiro SD: Ciliogenesis and left-right axis defects in forkhead factor HFH-4-null mice. Am J Respir Cell Mol Biol. 2000, 23: 45-51.

    PubMed  CAS  Article  Google Scholar 

  17. 17.

    Gresh L, Fischer E, Reimann A, Tanguy M, Garbay S, Shao X, Hiesberger T, Fiette L, Igarashi P, Yaniv M, et al: A transcriptional network in polycystic kidney disease. EMBO J. 2004, 23: 1657-1668. 10.1038/sj.emboj.7600160.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  18. 18.

    Reith W, Herrero-Sanchez C, Kobr M, Silacci P, Berte C, Barras E, Fey S, Mach B: MHC class II regulatory factor RFX has a novel DNA binding domain and functionally independant dimerization domain. Genes Dev. 1990, 4: 1528-1540. 10.1101/gad.4.9.1528.

    PubMed  CAS  Article  Google Scholar 

  19. 19.

    Gajiwala KS, Chen H, Cornille F, Roques BP, Reith W, Mach B, Burley SK: Structure of the winged-helix protein hRFX1 reveals a new mode of DNA binding. Nature. 2000, 403: 916-921. 10.1038/35002634.

    PubMed  CAS  Article  Google Scholar 

  20. 20.

    Emery P, Durand B, Mach B, Reith W: RFX proteins, a novel family of DNA binding proteins conserved in the eukaryotic kingdom. Nucleic Acids Res. 1996, 24: 803-807. 10.1093/nar/24.5.803.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  21. 21.

    Swoboda P, Adler HT, Thomas JH: The RFX-type transcription factor DAF-19 regulates sensory neuron cilium formation in C. elegans. Mol Cell. 2000, 5: 411-421. 10.1016/S1097-2765(00)80436-0.

    PubMed  CAS  Article  Google Scholar 

  22. 22.

    Vandaele C, Coulon-Bublex M, Couble P, Durand B: Drosophila regulatory factor X is an embryonic type I sensory neuron marker also expressed in spermatids and in the brain of Drosophila. Mech Dev. 2001, 103: 159-162. 10.1016/S0925-4773(01)00340-9.

    PubMed  CAS  Article  Google Scholar 

  23. 23.

    Dubruille R, Laurencon A, Vandaele C, Shishido E, Coulon-Bublex M, Swoboda P, Couble P, Kernan M, Durand B: Drosophila regulatory factor X is necessary for ciliated sensory neuron differentiation. Development. 2002, 129: 5487-5498. 10.1242/dev.00148.

    PubMed  CAS  Article  Google Scholar 

  24. 24.

    Bonnafe E, Touka M, AitLounis A, Baas D, Barras E, Ucla C, Moreau A, Flamant F, Dubruille R, Couble P, et al: The transcription factor RFX3 directs nodal cilium development and left-right asymmetry specification. Mol Cell Biol. 2004, 24: 4417-4427. 10.1128/MCB.24.10.4417-4427.2004.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  25. 25.

    Baas D, Meiniel A, Benadiba C, Bonnafe E, Meiniel O, Reith W, Durand B: A deficiency in RFX3 causes hydrocephalus associated with abnormal differentiation of ependymal cells. Eur J Neurosci. 2006, 24: 1020-1030. 10.1111/j.1460-9568.2006.05002.x.

    PubMed  CAS  Article  Google Scholar 

  26. 26.

    Ait-Lounis A, Baas D, Barras E, Benadiba C, Charollais A, Nlend Nlend R, Liegeois D, Meda P, Durand B, Reith W: Novel function of the ciliogenic transcription factor RFX3 in development of the endocrine pancreas. Diabetes. 2007, 56: 950-959. 10.2337/db06-1187.

    PubMed  CAS  Article  Google Scholar 

  27. 27.

    Liu Y, Pathak N, Kramer-Zucker A, Drummond IA: Notch signaling controls the differentiation of transporting epithelia and multiciliated cells in the zebrafish pronephros. Development. 2007, 134: 1111-1122. 10.1242/dev.02806.

    PubMed  CAS  Article  Google Scholar 

  28. 28.

    Reith W, Mach B: The bare lymphocyte syndrome and the regulation of MHC expression. Annu Rev Immunol. 2001, 19: 331-373. 10.1146/annurev.immunol.19.1.331.

    PubMed  CAS  Article  Google Scholar 

  29. 29.

    Blackshear PJ, Graves JP, Stumpo DJ, Cobos I, Rubenstein JL, Zeldin DC: Graded phenotypic response to partial and complete deficiency of a brain-specific transcript variant of the winged helix transcription factor RFX4. Development. 2003, 130: 4539-4552. 10.1242/dev.00661.

    PubMed  CAS  Article  Google Scholar 

  30. 30.

    Zarbalis K, May SR, Shen Y, Ekker M, Rubenstein JL, Peterson AS: A focused and efficient genetic screening strategy in the mouse: identification of mutations that disrupt cortical development. PLoS Biol. 2004, 2: E219-10.1371/journal.pbio.0020219.

    PubMed  PubMed Central  Article  Google Scholar 

  31. 31.

    Araki R, Takahashi H, Fukumura R, Sun F, Umeda N, Sujino M, Inouye ST, Saito T, Abe M: Restricted expression and photic induction of a novel mouse regulatory factor X4 transcript in the suprachiasmatic nucleus. J Biol Chem. 2004, 279: 10237-10242. 10.1074/jbc.M312761200.

    PubMed  CAS  Article  Google Scholar 

  32. 32.

    Zhang D, Zeldin DC, Blackshear PJ: Regulatory factor X4 variant 3: A transcription factor involved in brain development and disease. J Neurosci Res. 2007,

    Google Scholar 

  33. 33.

    Haycraft CJ, Schafer JC, Zhang Q, Taulman PD, Yoder BK: Identification of CHE-13, a novel intraflagellar transport protein required for cilia formation. Exp Cell Res. 2003, 284: 249-261. 10.1016/S0014-4827(02)00089-7.

    Article  Google Scholar 

  34. 34.

    Haycraft CJ, Swoboda P, Taulman PD, Thomas JH, Yoder BK: The C. elegans homolog of the murine cystic kidney disease gene Tg737 functions in a ciliogenic pathway and is disrupted in osm-5 mutant worms. Development. 2001, 128: 1493-1505.

    PubMed  CAS  Google Scholar 

  35. 35.

    Schafer JA, Haycraft CJ, Thomas JH, Yoder BK, Swoboda P: xbx-1 encodes a dynein light intermediate chain (DLIC) required for retrograde intraflagellar transport and cilia assembly in C. elegans. Mol Biol Cell. 2003, 14: 2057-2070. 10.1091/mbc.E02-10-0677.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  36. 36.

    Efimenko E, Bubb K, Mak HY, Holzman T, Leroux MR, Ruvkun G, Thomas JH, Swoboda P: Analysis of xbx genes in C. elegans. Development. 2005, 132: 1923-1934. 10.1242/dev.01775.

    PubMed  CAS  Article  Google Scholar 

  37. 37.

    Blacque OE, Perens EA, Boroevich KA, Inglis PN, Li C, Warner A, Khattra J, Holt RA, Ou G, Mah AK, et al: Functional genomics of the cilium, a sensory organelle. Curr Biol. 2005, 15: 935-941. 10.1016/j.cub.2005.04.059.

    PubMed  CAS  Article  Google Scholar 

  38. 38.

    Chen N, Mah A, Blacque OE, Chu J, Phgora K, Bakhoum MW, Hunt Newbury CR, Khattra J, Chan S, Go A, et al: Identification of ciliary and ciliopathy genes in Caenorhabditis elegans through comparative genomics. Genome Biol. 2006, 7: R126-10.1186/gb-2006-7-12-r126.

    PubMed  PubMed Central  Article  Google Scholar 

  39. 39.

    Fan Y, Esmail MA, Ansley SJ, Blacque OE, Boroevich K, Ross AJ, Moore SJ, Badano JL, May-Simera H, Compton DS, et al: Mutations in a member of the Ras superfamily of small GTP-binding proteins causes Bardet-Biedl syndrome. Nat Genet. 2004, 36: 989-993. 10.1038/ng1414.

    PubMed  CAS  Article  Google Scholar 

  40. 40.

    Richards S, Liu Y, Bettencourt BR, Hradecky P, Letovsky S, Nielsen R, Thornton K, Hubisz MJ, Chen R, Meisel RP, et al: Comparative genome sequencing of Drosophila pseudoobscura: chromosomal, gene, and cis-element evolution. Genome Res. 2005, 15: 1-18. 10.1101/gr.3059305.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  41. 41.

    Bergman CM, Pfeiffer BD, Rincon-Limas DE, Hoskins RA, Gnirke A, Mungall CJ, Wang AM, Kronmiller B, Pacleb J, Park S, et al: Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome. Genome Biol. 2002, 3: RESEARCH0086-10.1186/gb-2002-3-12-research0086.

    PubMed  PubMed Central  Article  Google Scholar 

  42. 42.

    Emberly E, Rajewsky N, Siggia ED: Conservation of regulatory elements between two species of Drosophila. BMC Bioinformatics. 2003, 4: 57-10.1186/1471-2105-4-57.

    PubMed  PubMed Central  Article  Google Scholar 

  43. 43.

    Berman BP, Pfeiffer BD, Laverty TR, Salzberg SL, Rubin GM, Eisen MB, Celniker SE: Computational identification of developmental enhancers: conservation and function of transcription factor binding-site clusters in Drosophila melanogaster and Drosophila pseudoobscura. Genome Biol. 2004, 5: R61-10.1186/gb-2004-5-9-r61.

    PubMed  PubMed Central  Article  Google Scholar 

  44. 44.

    Andersen JS, Wilkinson CJ, Mayor T, Mortensen P, Nigg EA, Mann M: Proteomic characterization of the human centrosome by protein correlation profiling. Nature. 2003, 426: 570-574. 10.1038/nature02166.

    PubMed  CAS  Article  Google Scholar 

  45. 45.

    Keller LC, Romijn EP, Zamora I, Yates JR, Marshall WF: Proteomic analysis of isolated Chlamydomonas centrioles reveals orthologs of ciliary-disease genes. Curr Biol. 2005, 15: 1090-1098. 10.1016/j.cub.2005.05.024.

    PubMed  CAS  Article  Google Scholar 

  46. 46.

    Gherman A, Davis EE, Katsanis N: The ciliary proteome database: an integrated community resource for the genetic and functional dissection of cilia. Nat Genet. 2006, 38: 961-962. 10.1038/ng0906-961.

    PubMed  CAS  Article  Google Scholar 

  47. 47.

    Inglis PN, Boroevich KA, Leroux MR: Piecing together a ciliome. Trends Genet. 2006, 22: 491-500. 10.1016/j.tig.2006.07.006.

    PubMed  CAS  Article  Google Scholar 

  48. 48.

    The Ciliary Proteome Database. []

  49. 49.

    The Ciliome Database. []

  50. 50.

    Hou Y, Qin H, Follit JA, Pazour GJ, Rosenbaum JL, Witman GB: Functional analysis of an individual IFT protein: IFT46 is required for transport of outer dynein arms into flagella. J Cell Biol. 2007, 176: 653-665. 10.1083/jcb.200608041.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  51. 51.

    Bell LR, Stone S, Yochem J, Shaw JE, Herman RK: The molecular identities of the Caenorhabditis elegans intraflagellar transport genes dyf-6, daf-10 and osm-1. Genetics. 2006, 173: 1275-1286. 10.1534/genetics.106.056721.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  52. 52.

    Eberl DF, Hardy RW, Kernan MJ: Genetically similar transduction mechanisms for touch and hearing in Drosophila. J Neurosci. 2000, 20: 5981-5988.

    PubMed  CAS  Google Scholar 

  53. 53.

    Bahe S, Stierhof YD, Wilkinson CJ, Leiss F, Nigg EA: Rootletin forms centriole-associated filaments and functions in centrosome cohesion. J Cell Biol. 2005, 171: 27-33. 10.1083/jcb.200504107.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  54. 54.

    Yang J, Liu X, Yue G, Adamian M, Bulgakov O, Li T: Rootletin, a novel coiled-coil protein, is a structural component of the ciliary rootlet. J Cell Biol. 2002, 159: 431-440. 10.1083/jcb.200207153.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  55. 55.

    Yang J, Gao J, Adamian M, Wen XH, Pawlyk B, Zhang L, Sanderson MJ, Zuo J, Makino CL, Li T: The ciliary rootlet maintains long-term stability of sensory cilia. Mol Cell Biol. 2005, 25: 4129-4137. 10.1128/MCB.25.10.4129-4137.2005.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  56. 56.

    Yang J, Adamian M, Li T: Rootletin interacts with C-Nap1 and may function as a physical linker between the pair of centrioles/basal bodies in cells. Mol Biol Cell. 2006, 17: 1033-1040. 10.1091/mbc.E05-10-0943.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  57. 57.

    Andrade YN, Fernandes J, Vazquez E, Fernandez-Fernandez JM, Arniges M, Sanchez TM, Villalon M, Valverde MA: TRPV4 channel is involved in the coupling of fluid viscosity changes to epithelial ciliary activity. J Cell Biol. 2005, 168: 869-874. 10.1083/jcb.200409070.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  58. 58.

    Reiter JF, Skarnes WC: Tectonic, a novel regulator of the Hedgehog pathway required for both activation and inhibition. Genes Dev. 2006, 20: 22-27. 10.1101/gad.1363606.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  59. 59.

    Baron DM, Ralston KS, Kabututu ZP, Hill KL: Functional genomics in Trypanosoma brucei identifies evolutionarily conserved components of motile flagella. J Cell Sci. 2007, 120: 478-491. 10.1242/jcs.03352.

    PubMed  CAS  Article  Google Scholar 

  60. 60.

    Niu Y, Murata T, Watanabe K, Kawakami K, Yoshimura A, Inoue J, Puri RK, Kobayashi N: MIP-T3 associates with IL-13Ralpha1 and suppresses STAT6 activation in response to IL-13 stimulation. FEBS Lett. 2003, 550: 139-143. 10.1016/S0014-5793(03)00860-3.

    PubMed  CAS  Article  Google Scholar 

  61. 61.

    Laoukili J, Perret E, Willems T, Minty A, Parthoens E, Houcine O, Coste A, Jorissen M, Marano F, Caput D, et al: IL-13 alters mucociliary differentiation and ciliary beating of human respiratory epithelial cells. J Clin Invest. 2001, 108: 1817-1824. 10.1172/JCI200113557.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  62. 62.

    Skowron M, Perret E, Marano F, Caput D, Tournier F: Interleukin-13 alters mucociliary differentiation of human nasal epithelial cells. Chest. 2003, 123: 373S-374S. 10.1378/chest.123.3_suppl.373S.

    PubMed  Article  Google Scholar 

  63. 63.

    Takemaru K, Yamaguchi S, Lee YS, Zhang Y, Carthew RW, Moon RT: Chibby, a nuclear beta-catenin-associated antagonist of the Wnt/Wingless pathway. Nature. 2003, 422: 905-909. 10.1038/nature01570.

    PubMed  CAS  Article  Google Scholar 

  64. 64.

    Li FQ, Singh AM, Mofunanya A, Love D, Terada N, Moon RT, Takemaru K: Chibby promotes adipocyte differentiation through inhibition of {beta}-catenin signaling. Mol Cell Biol. 2007, 27: 4347-4354. 10.1128/MCB.01640-06.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  65. 65.

    Singh AM, Li FQ, Hamazaki T, Kasahara H, Takemaru K, Terada N: Chibby, an antagonist of the Wnt/beta-catenin pathway, facilitates cardiomyocyte differentiation of murine embryonic stem cells. Circulation. 2007, 115: 617-626. 10.1161/CIRCULATIONAHA.106.642298.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  66. 66.

    The Berkeley Drosophila Genome Project. []

  67. 67.

    Uga S, Kuwabara M: On the fine structure of the chordotonal sensillum in antenna of Drosophila melanogaster. J Electron Microsc. 1965, 14: 173-181.

    Google Scholar 

  68. 68.

    Moulins M: Ultrastructure of chordotonal organs. Structure and Function of Proprioreceptors in the Invertebrates. Edited by: Mill PJ. 1976, London: Chapman and Hall, 387-426.

    Google Scholar 

  69. 69.

    Husain N, Pellikka M, Hong H, Klimentova T, Choe KM, Clandinin TR, Tepass U: The agrin/perlecan-related protein eyes shut is essential for epithelial lumen formation in the Drosophila retina. Dev Cell. 2006, 11: 483-493. 10.1016/j.devcel.2006.08.012.

    PubMed  CAS  Article  Google Scholar 

  70. 70.

    Dawe HR, Smith UM, Cullinane AR, Gerrelli D, Cox P, Badano JL, Blair-Reid S, Sriram N, Katsanis N, Attie-Bitach T, et al: The Meckel-Gruber Syndrome proteins MKS1 and meckelin interact and are required for primary cilium formation. Hum Mol Genet. 2007, 16: 173-186. 10.1093/hmg/ddl459.

    PubMed  CAS  Article  Google Scholar 

  71. 71.

    Markstein M, Levine M: Decoding cis-regulatory DNAs in the Drosophila genome. Curr Opin Genet Dev. 2002, 12: 601-606. 10.1016/S0959-437X(02)00345-3.

    PubMed  CAS  Article  Google Scholar 

  72. 72.

    Zhao G, Schriefer LA, Stormo GD: Identification of muscle-specific regulatory modules in Caenorhabditis elegans. Genome Res. 2007, 17: 348-357. 10.1101/gr.5989907.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  73. 73.

    Ray K, Perez SE, Yang Z, Xu J, Ritchings BW, Steller H, Goldstein LS: Kinesin-II is required for axonal transport of choline acetyltransferase in Drosophila. J Cell Biol. 1999, 147: 507-518. 10.1083/jcb.147.3.507.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  74. 74.

    Ou G, Koga M, Blacque OE, Murayama T, Ohshima Y, Schafer JC, Li C, Yoder BK, Leroux MR, Scholey JM: Sensory ciliogenesis in Caenorhabditis elegans: assignment of IFT components into distinct modules based on transport and phenotypic profiles. Mol Biol Cell. 2007, 18: 1554-1569. 10.1091/mbc.E06-09-0805.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  75. 75.

    Basto R, Lau J, Vinogradova T, Gardiol A, Woods CG, Khodjakov A, Raff JW: Flies without centrioles. Cell. 2006, 125: 1375-1386. 10.1016/j.cell.2006.05.025.

    PubMed  CAS  Article  Google Scholar 

  76. 76.

    Peel N, Stevens NR, Basto R, Raff JW: Overexpressing centriole-replication proteins in vivo induces centriole overduplication and de novo formation. Curr Biol. 2007, 17: 834-843. 10.1016/j.cub.2007.04.036.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  77. 77.

    Rodrigues-Martins A, Bettencourt-Dias M, Riparbelli M, Ferreira C, Ferreira I, Callaini G, Glover DM: DSAS-6 organizes a tube-like centriole precursor, and its absence suggests modularity in centriole assembly. Curr Biol. 2007, 17: 1465-1472. 10.1016/j.cub.2007.07.034.

    PubMed  CAS  Article  Google Scholar 

  78. 78.

    Rodrigues-Martins A, Riparbelli M, Callaini G, Glover DM, Bettencourt-Dias M: Revisiting the role of the mother centriole in centriole biogenesis. Science. 2007, 316: 1046-1050. 10.1126/science.1142950.

    PubMed  CAS  Article  Google Scholar 

  79. 79.

    Gopfert MC, Robert D: Motion generation by Drosophila mechanosensory neurons. Proc Natl Acad Sci USA. 2003, 100: 5514-5519. 10.1073/pnas.0737564100.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  80. 80.

    Moran DT, Varela FJ, Rowley JC: Evidence for active role of cilia in sensory transduction. Proc Natl Acad Sci USA. 1977, 74: 793-797. 10.1073/pnas.74.2.793.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  81. 81.

    Haycraft CJ, Banizs B, Aydin-Son Y, Zhang Q, Michaud EJ, Yoder BK: Gli2 and Gli3 localize to cilia and require the intraflagellar transport protein polaris for processing and function. PLoS Genet. 2005, 1: e53-10.1371/journal.pgen.0010053.

    PubMed  PubMed Central  Article  Google Scholar 

  82. 82.

    Huangfu D, Anderson KV: Signaling from Smo to Ci/Gli: conservation and divergence of Hedgehog pathways from Drosophila to vertebrates. Development. 2006, 133: 3-14. 10.1242/dev.02169.

    PubMed  CAS  Article  Google Scholar 

  83. 83.

    Caspary T, Larkins CE, Anderson KV: The graded response to Sonic Hedgehog depends on cilia architecture. Dev Cell. 2007, 12: 767-778. 10.1016/j.devcel.2007.03.004.

    PubMed  CAS  Article  Google Scholar 

  84. 84.

    Rebeiz M, Posakony JW: GenePalette: a universal software tool for genome sequence visualization and analysis. Dev Biol. 2004, 271: 431-438. 10.1016/j.ydbio.2004.04.011.

    PubMed  CAS  Article  Google Scholar 

  85. 85.

    Celniker SE, Wheeler DA, Kronmiller B, Carlson JW, Halpern A, Patel S, Adams M, Champe M, Dugan SP, Frise E, et al: Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence. Genome Biol. 2002, 3: RESEARCH0079-10.1186/gb-2002-3-12-research0079.

    PubMed  PubMed Central  Article  Google Scholar 

  86. 86.

    VISTA. []

  87. 87.

    Marshall WF: Human cilia proteome contains homolog of zebrafish polycystic kidney disease gene qilin. Curr Biol. 2004, 14: R913-914. 10.1016/j.cub.2004.10.011.

    PubMed  CAS  Article  Google Scholar 

  88. 88.

    Loppin B, Lepetit D, Dorus S, Couble P, Karr TL: Origin and neofunctionalization of a Drosophila paternal effect gene essential for zygote viability. Curr Biol. 2005, 15: 87-93. 10.1016/j.cub.2004.12.071.

    PubMed  CAS  Article  Google Scholar 

  89. 89.

    Spradling AC: P element mediated transformation. Drosophila: a Practical Approach. Edited by: Roberts D. 1986, Oxford: IRL Press, 175-189.

    Google Scholar 

  90. 90.

    Rothwell WF, Sullivan W: Fluorescent analysis of Drosophila embryos. Drosophila Protocols. Edited by: Sullivan W, Ashburner M, Hawley RS. 2000, Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press, 141-158.

    Google Scholar 

  91. 91.

    Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I: VISTA: computational tools for comparative genomics. Nucleic Acids Res. 2004, 32: W273-279. 10.1093/nar/gkh458.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  92. 92.

    Han YG, Kwok BH, Kernan MJ: Intraflagellar transport is required in Drosophila to differentiate sensory cilia but not sperm. Curr Biol. 2003, 13: 1679-1686. 10.1016/j.cub.2003.08.034.

    PubMed  CAS  Article  Google Scholar 

  93. 93.

    Murayama T, Toh Y, Ohshima Y, Koga M: The dyf-3 gene encodes a novel protein required for sensory cilium formation in Caenorhabditis elegans. J Mol Biol. 2005, 346: 677-687. 10.1016/j.jmb.2004.12.005.

    PubMed  CAS  Article  Google Scholar 

  94. 94.

    Efimenko E, Blacque OE, Ou G, Haycraft CJ, Yoder BK, Scholey JM, Leroux MR, Swoboda P: Caenorhabditis elegans DYF-2, an orthologue of human WDR19, is a component of the intraflagellar transport machinery in sensory cilia. Mol Biol Cell. 2006, 17: 4801-4811. 10.1091/mbc.E06-04-0260.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  95. 95.

    Yu H, Pretot RF, Burglin TR, Sternberg PW: Distinct roles of transcription factors EGL-46 and DAF-19 in specifying the functionality of a polycystin-expressing sensory neuron necessary for C. elegans male vulva location behavior. Development. 2003, 130: 5217-5227. 10.1242/dev.00678.

    PubMed  CAS  Article  Google Scholar 

  96. 96.

    Winkelbauer ME, Schafer JC, Haycraft CJ, Swoboda P, Yoder BK: The C. elegans homologs of nephrocystin-1 and nephrocystin-4 are cilia transition zone proteins involved in chemosensory perception. J Cell Sci. 2005, 118: 5575-5587. 10.1242/jcs.02665.

    PubMed  CAS  Article  Google Scholar 

Download references


Work in the laboratory of BD was supported by the CNRS, the ACI "Jeune Chercheur" and ACI "Biologie du Développement" and by a Grant "ANR Maladies Rares" (n° 930 AR 17). R Dubruille was supported by a doctoral fellowship from the French Ministry of Education and Research and a fellowship from the Fondation pour la Recherche Médicale. G Grenier was supported by a doctoral fellowship from the Région Rhône-Alpes and the Fondation pour la Recherche Médicale. Work in the laboratory of PS was supported by the Swedish Research Council (VR) and by the Swedish Foundation for Strategic Research (SSF). Special thanks go to Stephen Richards and William Gilbert, who provided files prior to public release. We are indebted to Laurent Duret for precious help with genome comparisons and programming. We wish to thank Mélodie Robach, Maria Ouzounova and Abdelkader Selmi for their technical contribution during undergraduate internships and Jérome Schmitt for helpful technical assistance with the fly stocks. Confocal microscopy observations were performed at the CTμ of the Université Lyon 1. We also wish to thank Joelle Thomas for careful reading of the manuscript.

Author information



Corresponding author

Correspondence to Anne Laurençon.

Additional information

Authors' contributions

A.L. and B.D. designed the experiments, wrote and revised completed this manuscript; A.L., E.E. and P.S. performed genome wide X boxes search. A.L., R.D. and G.G. realized the identification of the first set of target genes. A.L., R.B., V.R. and E.C. carried out fly genetic experiments. A.L., V.R. and R.B. performed cytology.

Electronic supplementary material

Additional data file 1: Our X-box search across D. melanogaster and D. pseudoobscura species identified 412 genes with a conserved X-box both in sequence and distance upstream of the ATG of homologous genes between the two fly genomes. (XLS 102 KB)

Additional data file 2: Genes presented in this table are homologous to proteins identified as putative or confirmed ciliary or basal body components. They are sorted as follows: a first group of genes with annotated molecular functions, a second group of genes for which homologs in vertebrates have been reported, a third group of genes with no vertebrate homolog. Each category is sorted by the number of studies reporting each gene or its homolog. (XLS 258 KB)

Additional data file 3: Number of Drosophila genes homologous to ciliary genes identified in previously published studies. (DOC 54 KB)

Authors’ original submitted files for images

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Laurençon, A., Dubruille, R., Efimenko, E. et al. Identification of novel regulatory factor X (RFX) target genes by comparative genomics in Drosophila species. Genome Biol 8, R195 (2007).

Download citation


  • Green Fluorescent Protein
  • Additional Data File
  • Green Fluorescent Protein Expression
  • Chordotonal Organ
  • Sensory Cilium