Sequencing three crocodilian genomes to illuminate the evolution of archosaurs and amniotes
- John A St John1,
- Edward L Braun2,
- Sally R Isberg3, 5,
- Lee G Miles5,
- Amanda Y Chong5,
- Jaime Gongora5,
- Pauline Dalzell4, 5,
- Christopher Moran5,
- Bertrand Bed'Hom6,
- Arkhat Abzhanov7,
- Shane C Burgess8,
- Amanda M Cooksey8,
- Todd A Castoe9,
- Nicholas G Crawford10,
- Llewellyn D Densmore11,
- Jennifer C Drew12,
- Scott V Edwards7,
- Brant C Faircloth13,
- Matthew K Fujita7,
- Matthew J Greenwold14,
- Federico G Hoffmann8, 15,
- Jonathan M Howard16,
- Taisen Iguchi17,
- Daniel E Janes18, 19,
- Shahid Yar Khan1,
- Satomi Kohno20,
- AP Jason de Koning9,
- Stacey L Lance21,
- Fiona M McCarthy22,
- John E McCormack23,
- Mark E Merchant24,
- Daniel G Peterson8, 25,
- David D Pollock9,
- Nader Pourmand1,
- Brian J Raney26,
- Kyria A Roessler1,
- Jeremy R Sanford16,
- Roger H Sawyer14,
- Carl J Schmidt27,
- Eric W Triplett28,
- Tracey D Tuberville21,
- Miryam Venegas-Anaya11,
- Jason T Howard29,
- Erich D Jarvis29,
- Louis J GuilletteJr20,
- Travis C Glenn30,
- Richard E Green1 and
- David A Ray8, 15Email author
© BioMed Central Ltd. 2012
Published: 31 January 2012
The International Crocodilian Genomes Working Group (ICGWG) will sequence and assemble the American alligator (Alligator mississippiensis), saltwater crocodile (Crocodylus porosus) and Indian gharial (Gavialis gangeticus) genomes. The status of these projects and our planned analyses are described.
KeywordsGenomics evolution Crocodylia Archosauria amniote
The importance of reptilian genomics
Order Crocodylia is a key group within Reptilia and genome drafts from crocodilians would provide insights into ancestral reptilian and amniote genomes. These genome assemblies will also enable more detailed inferences on the evolution of three additional lineages of substantial interest to vertebrate biologists: dinosaurs, pterosaurs and birds. Crocodilians and birds are the only extant members of Archosauria (a clade that also includes dinosaurs and pterosaurs along with several extinct lineages) . Among archosaurs, only the genomes of chicken (Gallus gallus ), turkey (Meleagris gallopavo ) and zebra finch (Taeniopygia guttata ) have been sequenced, although several additional avian genomes, such as the Mallard duck (Anas platyrhynchos , budgerigar (Melopsittacus undulatus, a type of parrot) and a set of other avian taxa  are currently underway . Crocodilians are the best extant outgroup for comparative analysis of avian genomes, and, as such, would substantially enhance analyses of the large set of bird genomes that are expected to be available shortly. Avian and crocodilian genomes provide the best hope for elucidating the gene and genomic properties of dinosaurs and other extinct archosaurs, about which we have learned surprising amounts (for example, genome size and limited protein sequences) considering we have no access to the DNA of these organisms [15–19]. In the broadest sense, Crocodylia represent an important vertebrate clade, and their genomes hold information that will illuminate the underlying relationships among all amniotes. In addition, crocodilians present several interesting biological questions that can be approached from a genomic perspective, many of these will be discussed below.
Background on crocodilians and project justification
The order Crocodylia, which typically refers to the clade that includes the extant crocodilians , is an ecologically successful group of reptiles that originated in the mid- to upper-Cretaceous period (approximately100 Mya) [21, 22]. Crocodilians are apex predators in the marine and freshwater habitats where they reside. They play a major role in warm-water ecosystems throughout the world. Extant crocodilians are members of a larger group, termed the Crocodylomorpha, that appeared in the fossil record by the upper Triassic (about 200-250 Mya) [8, 1], a date coincident with molecular estimates of the avian-crocodilian divergence [2, 22, 23]. Crocodylia is divided into three families with extant members, Alligatoridae (alligators and caimans), Crocodylidae (crocodiles) and Gavialidae (gharials) [21, 23]; the Gavialidae are traditionally thought to be the outgroup of a clade comprising Alligatoridae and Crocodylidae . However, recent phylogenetic analyses of both molecular data [22, 24] and combined molecular and morphological data  support a closer relationship between Crocodylidae and Gavialidae (Figure 1).
Crocodilians have been a part of the human narrative for centuries, appearing in modern popular culture (for example, the wildlife documentary series The Crocodile Hunter), scientific documentaries, as ancient mummies and in cave paintings. They are prized for their hides and meat, and some species, such as the American alligator, the Nile crocodile (Crocodylus niloticus) and the saltwater crocodile, are ranched (that is their eggs are brought in from the wild) and/or farmed (in which captive breeding stock produce the eggs). Globally, crocodilians are a source of trade worth more than $US500 million . However, crocodilians likely have their most profound economic impact as tourist attractions [27, 28]. Thoughtful ecotourism could be the best hope for saving endangered crocodilians, such as the critically endangered gharial, from extinction and their habitats from destruction.
Given their popularity, their status as the sister group of dinosaurs, and their inherent public fascination, efforts focused on crocodilian genomics are ideally suited for education and outreach focused on evolution and comparative genomics. Indeed, the preliminary data from our efforts has been used in a pilot genomics course at the University of Florida that integrates with undergraduate research. The consortium plans to make material for genomics pedagogy and public outreach available in parallel with the release of the genome assemblies.
In addition to their ecological, sociological and economic significance, crocodilians have genomes that will be useful sources of data for biological and biomedical research. Alligator serum has been shown to contain broad spectrum antibiotic peptides [29–32]. The American alligator has been used extensively as a model for examining the environmental impact of various contaminants, including endocrine disrupting xenobiotics [33–36]. Crocodilians represent important research organisms for diverse fields that include evolution and phylogenetics [25, 37–39], functional morphology [37, 40], osmoregulation , sex determination [41–45], hybridization [46–48] and population genetics [49–51]. To provide the genomic resources necessary to expand our understanding of these fascinating organisms, the ICGWG is obtaining and assembling genome sequences for the American alligator, saltwater crocodile, and gharial, one representative from each of the extant crocodilian families. For further information about the project and preliminary assemblies, see Ref. .
Properties of crocodilian genomes and available genomic resources
Short of whole genome sequencing, much work has been done on crocodilian genomes, especially the American alligator and Australian saltwater crocodile. The genome of the American alligator is approximately 2.5 gigabases  comprising 16 pairs of chromosomes [54, 55]. The genome size of the saltwater crocodile is around 2.78 gigabases  with 17 pairs of chromosomes [54, 57]. The genome size of the gharial is currently unknown, although it is likely to be approximately 2-3 gigabases, given the genome sizes of other crocodilians. Like the American alligator, the gharial has 16 chromosomes . Unlike organisms with genetic sex-determination systems, crocodilians are not thought to have sex chromosomes . Instead sex is determined by incubation temperature of the egg . Although microchromosomes are common among other reptiles (including birds), and there is striking variation in chromosome sizes within crocodilians, the smallest crocodilian chromosomes are not generally regarded as small enough to be classified as microchromosomes [54, 58, 57, 55].
Libraries of bacterial artificial chromosomes (BACs) are available for all three species of interest and these will be used for each genome project. The American alligator BAC library currently has about 10× clone coverage , the saltwater crocodile library has approximately 3.7× clone coverage  and the gharial library has about 5.7× clone coverage, assuming it is a 2.7 gigabase genome (X. Shan, unpublished data). Several large-scale nucleotide datasets have been collected for the American alligator, including 21 assembled BAC sequences completed through the NISC Comparative Sequencing Initiative , and 3,276 Sanger BAC-end reads . A linkage map based on microsatellite loci  for the saltwater crocodile is also available. Additionally some saltwater crocodile microsatellite loci have been mapped by fluorescence in situ hybridization (FISH) to physical chromosomes using fosmids and BACs ( and P. Dalzell unpublished data), which will facilitate anchoring portions of the genome assembly to chromosomes.
In addition to genomic sequences and mapping information, both Sanger and 454 transcriptome data for the crocodile and alligator are available [63, 64]. Transcriptome data will be further augmented by a diversity of tissue-specific cDNA libraries from multiple species that will be sequenced using Illumina RNA-seq to assist gene annotations. The cDNA sequences will also enable further scaffold ordering and orientation for transcripts that are split between multiple genomic fragments . We will use these legacy and new data to further improve the initial de novo assemblies. To view the preliminary assemblies, see Ref. .
Sequencing strategy for the three crocodilian genomes
Owing to the availability of diverse legacy data, we are pursuing different strategies for the sequencing and assembly of each genome, as described below.
For the American alligator genome, we are following the Allpaths-LG recommended pipeline  of a combination of high coverage pairs of overlapping reads with a second, moderate coverage, longer insert mate-pair library. This pipeline has yielded good results with a variety of assemblies including de novo reassemblies of mouse and human , and was successfully employed in an independently evaluated genome assembly contest . We have combined approximately 50× coverage from an overlapping, Illumina, short-insert library with about 20× coverage from an Illumina 2 kbp mate-pair library. To investigate genetic variation and increase coverage, we will combine these reads with a set of short, non-overlapping 2 × 100 bp Illumina reads at approximately 50× coverage. In addition to providing deeper coverage, these data will also provide information about genetic variation in American alligators due to single nucleotide polymorphism differences between the diploid chromosomes of an individual. We will further scaffold the assembly using low coverage BAC-end sequences, and we will carry out FISH mapping to assign scaffolds to chromosomes.
To sequence the saltwater crocodile genome, we are combining high coverage Illumina short insert sequencing with low coverage 454 libraries in a hybrid approach, similar to that used for the turkey genome . We currently have about 80× coverage from a non-overlapping, short-insert library and an additional 40× from an overlapping short-insert library. We also plan to generate about 20× coverage from an Illumina 2 kbp mate-pair library. To supplement the Illumina data, we have generated 1× coverage of unpaired 454 reads (about 700 bp in length), and plan to generate an additional 2× coverage from 3 kbp and 6 kbp paired 454 reads. We will also end-sequence the crocodile BAC library using a method similar to the fosmid-based ShARC method described by Gnerre et al. . Some of these BACs are known to contain microsatellite DNA markers used in the crocodile linkage map  and others have already been FISH mapped to chromosomes in the crocodile . We will integrate this information for scaffolding and assigning scaffolds to chromosomes. As with the American alligator genome, we are also generating transcriptome data for the saltwater crocodile for both annotation and scaffolding purposes. We will also use the 454 brain transcriptome data that exists for the American alligator  and the Nile crocodile  in our analyses. We will use these EST and RNA-seq data, along with the other resources described above, to further order and orient scaffolds within the assembly.
Finally, we will assemble the gharial genome using a hybrid approach similar to that used for the saltwater crocodile. To do this, we have generated 40× coverage from an overlapping short-insert library. This will be combined with sequences from 400 bp and 700 bp paired-end Illumina libraries sequenced to give approximately 30× coverage, as well as 2-3× genome coverage consisting of 454 shotgun reads and 3 kbp and 6 kbp paired-end 454 libraries with FLX+ reads. Finally we will generate approximately 20× coverage from an Illumina 2 kbp mate-pair library. The gharial is a critically endangered species, making it nearly impossible to collect a wide variety of tissues for transcriptome data. Nonetheless, we have collected blood, which will be used to generate Illumina RNA-seq data. As with the American alligator and saltwater crocodile, we will use de novo assembled transcripts to improve the assembly.
Project timeline and goals
The first phase of our sequencing effort, in which we generate high coverage short insert and overlapping libraries, has been completed for American alligator and saltwater crocodile and is ongoing for the Indian gharial. The data generated for alligator and crocodile were used to generate early draft assemblies for those genomes. The second phase will involve generating longer distance mate-pair libraries and BAC-end sequences to improve the assemblies. We plan to have the data gathered for this phase by mid-March 2012. The third and final phase will involve FISH mapping the BACs to assign scaffolds to chromosomes. When all three phases are completed the assemblies should be as contiguous as possible, given the combination of high coverage short distance information generated in phase one with lower coverage long distance information generated in phase two. The third phase is not critical for the most pressing questions involving crocodilian genomics; individual genes and their regulatory regions will be of primary interest, as opposed to the long-range linkage required for identifying selective sweeps. Thus we will proceed with this third phase in parallel with our other comparative genomic analyses. Once the three genomes are assembled, we will perform comparative genomic analyses both within Order Crocodylia, and among crocodilians and other members of Reptilia.
The completion of each of these phases will be publicly communicated via the website, and links to the data and assemblies will be available to researchers with restrictions as detailed below. We anticipate data collection and initial analyses to be complete by June 2012, and we plan to submit the genome paper within one year of finalizing these initial analyses. The Toronto Statement  suggests that there be a one-year period of initial analyses and publication, after which the broader community would be free to use this data in an unrestricted manner. Precise dates at which we complete data collection and initial analysis, and thus the beginning of the embargo period on the genome data, will be promptly posted on the website .
Status of the current preliminary genome assemblies
Overview of the current draft assembliesa.
Estimated Length (Gbp)
Assembly Length (Gbp)
Estimated % Coverage
Contig N50 (Kbp)
Contig N90 (Kbp)
Scaffold N50 (Kbp)
Scaffold N90 (Kbp)
Quality control of intermediate assemblies and raw data
We will employ additional quality metrics to detect and describe the collapse of segmental duplications within our assemblies. Specifically, read-depth is a sensitive measure of this assembly artifact. Preliminary analysis suggests that such artifacts are not common in alligator or crocodile genomes (data not shown). We will employ a final form of quality control by examining the relative synteny of our three crocodilian candidate assemblies. Because alligators, crocodiles, and gharials appear to have undergone few chromosome-level rearrangements , we expect a high level of synteny between accurate assemblies. Once we begin scaffolding all of our assemblies with longer mate-pair and BAC data, we will assess their relative quality by measuring the effect on overall crocodilian synteny.
Planned analyses and experiments
Here we outline major questions, types of analyses and analytical goals that will be included in the core publication of these completed genomes. The Toronto Statement  suggests these questions should be articulated to identify these topics as embargoed during preparation of the genome publication. The ICGWG will address a number of research questions at both the level of genome evolution and crocodilian biology that we describe below.
A crucial step in making genome resources useful to the scientific community is generating gene annotations. We will perform gene finding for crocodilians using the Ensembl  and Augustus  annotation pipelines and combine the output. We will also partner with groups sequencing additional avian genomes and update the crocodile annotations as needed. Gene finders will initially be trained using the chicken genome and the results from the pipelines will be compared to identify accuracy at both the gene and exon level. Genes will be assigned standardized gene nomenclature based on chicken gene names where there is an unambiguous 1:1 functional ortholog, or a gene identifier in cases where this is not possible. We will also provide preliminary functional annotation for proteins and transcripts using standard Gene Ontology Consortium methods, including functional analysis of motifs and domains and manual curation of orthologs. The ICGWG will perform these analyses to complement and extend those performed by NCBI and Ensembl once the draft genomes are submitted to those organizations.
One major focus will be the large-scale structure of crocodilian genomes, focusing on the degree of syntenic conservation at different scales within these genomes. Karyotype analysis suggests a remarkable conservation of synteny among crocodilians, with the alligator and crocodile having undergone fewer than five chromosomal rearrangements visible at the microscopic level  despite 80 million years of evolutionary divergence. However, the level of syntenic conservation at small scales within these genomes remains unclear, and we expect our genome assemblies to illuminate this topic. Microchromosomes are absent in crocodilians [54, 55, 59] but present in birds, lizards and snakes, tuatara, and turtles [4, 84]. This absence in crocodilians almost certainly represents a derived feature of crocodilians. We will examine the fate of these genetic units within crocodilian genomes. Do microchromosomes comprise linked components within the genomes of the only major reptilian clade without microchromosomes?
Recent work showed that the lizard, Anolis carolinensis, unlike other amniotes sequenced to date (with the possible exception of turtles ), has a homogeneous genome that lacks GC-rich isochores [76, 4]. Our preliminary analyses indicate that crocodilians have a higher GC-content and greater heterogeneity than Anolis (Figure 4), but these analyses are less clear regarding the scale of the observed GC-content variation. Do crocodilians have GC-rich isochores that are similar to those in mammals and birds or do the patterns of GC-content heterogeneity appear distinct?
We will also carry out a number of traditional analyses of genome content using the crocodilian genomes, focusing on repeated sequences and gene families. These analyses include the evolution of repeat families and patterns of TE proliferation. We will compare the repeat family content within crocodilian genomes and with other reptiles and amniotes. Additionally, we will conduct analyses of gene family evolution within reptiles and crocodilians to identify specific genes and other functional elements, including the identification of ultra-conserved regions and potential micro RNA sequences, with a special focus on those sequences that could have been gained or lost both within the crocodilians and in comparison to the other relevant lineages that are now available for investigation.
We will use these three crocodilian genomes to infer their ancestral genome. This, combined with existing and soon to be released bird genomes, will enable some inference of the ancestral archosaur genome. Reconstructing the ancestral archosaur genome has obvious implications for expanding our understanding of the genomes of extinct archosaurs, like the non-bird dinosaurs and pterosaurs (Figure 1).
There are also several biological questions specific to crocodilians that we will address by analyzing genomic and RNA-seq data and via experimental techniques. For example, despite having a temperature-dependent sex-determination system seemingly without sex chromosomes, the sexes of crocodilians have been shown to have very different recombination rates . Identification of the genes that are differentially expressed in the male and female crocodilian gonads might provide insight into the perplexing observation.
SNP discovery arising from the genome sequencing is particularly relevant to farm-bred saltwater crocodiles. Large panels of SNP markers will enable more refined linkage maps , more precise mapping of quantitative trait loci (QTL) than is currently possible with microsatellite markers  and eventually the implementation of genomic selection in crocodile breeding programs.
Eventually members of the ICGWG hope to address additional questions beyond the scope of the initial genome paper. These might be presented in satellite publications. One of these involves the sex determination system of American alligators. Which genes are the initial temperature sensitive regulators that trigger the downstream, largely conserved  sex-determination system? Having the genome sequences available for these three crocodilians will enable a new wave of discoveries about the evolutionary histories of crocodilians, non-avian reptiles and birds, and amniotes generally.
How other groups can join the consortium, or publish independently with our early release data
This project is affiliated with the Genome 10 K (G10K) initiative . We invite other G10K affiliates and the broader scientific community to access and make use of the draft assembly and raw read data that we have produced. Any group performing non-genome-scale analyses that are sufficiently independent of the analyses described above are welcome to use these data without restriction. As a matter of courtesy and to avoid duplicated effort, we request that competing genome-scale projects or analyses that overlap with the areas stated above disclose their status to the ICGWG consortium (formal inquiries and requests to join the working group should be made to D.A.R.) and cite this and subsequent papers that provide the data. Versioned assemblies, further project description, and a complete list of current ICGWG members can be accessed on the website dedicated to this project .
This work was supported by grants to D.A.R. (MCB-1052500, MCB-0841821, DEB-1020865 from the U.S. National Science Foundation) and funds from the Institute for Genomics, Biocomputing and Biotechnology at Mississippi State University. E.L.B., E.W.T., and collaborators at the University of Florida were supported by funds from the U.S. National Science Foundation (DUE-0920151). T.I. received financial support from the National Institute for Basic Biology and Grants-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology of Japan. S.R.I., L.G.M., J.G., P.D. and C.M. were supported by Australian Rural Industries Research and Development Corporation grants (RIRDC PRJ-000549, RIRDC PRJ-005355, RIRDC PRJ-002461). M.K.F. received financial support from a U.S. National Science Foundation Biological Informatics Postdoctoral Fellowship (DBI-0905714). R.E.G. is a Searle Scholar and a Sloan Fellow. E.D.J. was supported by the Howard Hughes Medical Institute and the National Institutes of Health. We are grateful to Kent Vliet (University of Florida) and the Alligator Farm (St. Augustine, Florida) for providing access to fresh gharial blood.
- Hedges SB, Kumar S: The Timetree of Life. 2009, Oxford University Press, USAGoogle Scholar
- Katsu Y, Braun EL, Guillette LJ, Iguchi T: From reptilian phylogenomics to reptilian genomes: analyses of c-Jun and DJ-1 proto-oncogenes. Cytogenet Genome Res. 2009, 127: 79-93. 10.1159/000297715.PubMedView ArticleGoogle Scholar
- Janes DE, Organ CL, Fujita MK, Shedlock AM, Edwards SV: Genome evolution in Reptilia, the sister group of mammals. Annu Rev Genomics Hum Genet. 2010, 11: 239-264. 10.1146/annurev-genom-082509-141646.PubMedView ArticleGoogle Scholar
- Alföldi J, di Palma F, Grabherr M, Williams C, Kong L, Mauceli E, Russell P, Lowe CB, Glor RE, Jaffe JD, Ray DA, Boissinot S, Shedlock AM, Botka C, Castoe TA, Colbourne JK, Fujita MK, Moreno RG, Hallers ten BF, Haussler D, Heger A, Heiman D, Janes DE, Johnson J, de Jong PJ, Koriabine MY, Lara M, Novick PA, Organ CL, Peach SE, et al: The genome of the green anole lizard and a comparative analysis with birds and mammals. Nature. 2011, 477: 587-591. 10.1038/nature10390.PubMedPubMed CentralView ArticleGoogle Scholar
- NHGRI Genome Sequencing Proposals. [http://www.genome.gov]
- Castoe TA, Bronikowski AM, Brodie ED, Edwards SV, Pfrender ME, Shapiro MD, Pollock DD, Warren WC: A proposal to sequence the genome of a garter snake (Thamnophis sirtalis). Stand Genomic Sci. 2011, 4: 257-270. 10.4056/sigs.1664145.PubMedPubMed CentralView ArticleGoogle Scholar
- Castoe TA, de Koning AJ, Hall KT, Yokoyama KD, Gu W, Smith EN, Feschotte C, Uetz P, Ray DA, Dobry J, Bogden R, Mackessy SP, Bronikowski AM, Warren WC, Secor SM, Pollock DD: Sequencing the genome of the Burmese python (Python molurus bivittatus) as a model for studying extreme adaptations in snakes. Genome Biol. 2011, 12: 406-10.1186/gb-2011-12-7-406.PubMedPubMed CentralView ArticleGoogle Scholar
- Brusatte S, Benton M, Desojo J, Langer M: The higher-level phylogeny of Archosauria (Tetrapoda: Diapsida). J Syst Paleontol. 2010, 8: 3-47. 10.1080/14772010903537732.View ArticleGoogle Scholar
- International Chicken Genome Sequencing Consortium: Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004, 432: 695-716. 10.1038/nature03154.View ArticleGoogle Scholar
- Dalloul RA, Long JA, Zimin AV, Aslam L, Beal K, Blomberg LA, Bouffard P, Burt DW, Crasta O, Crooijmans RPMA, Cooper K, Coulombe RA, De S, Delany ME, Dodgson JB, Dong JJ, Evans C, Frederickson KM, Flicek P, Florea L, Folkerts O, Groenen MAM, Harkins TT, Herrero J, Hoffmann S, Megens H-J, Jiang A, de Jong P, Kaiser P, Kim H, et al: Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo): genome assembly and analysis. PLoS Biol. 2010, 8: e1000475-10.1371/journal.pbio.1000475.PubMedPubMed CentralView ArticleGoogle Scholar
- Warren WC, Clayton DF, Ellegren H, Arnold AP, Hillier LW, Künstner A, Searle S, White S, Vilella AJ, Fairley S, Heger A, Kong L, Ponting CP, Jarvis ED, Mello CV, Minx P, Lovell P, Velho TAF, Ferris M, Balakrishnan CN, Sinha S, Blatti C, London SE, Li Y, Lin Y-C, George J, Sweedler J, Southey B, Gunaratne P, Watson M, et al: The genome of a songbird. Nature. 2010, 464: 757-762. 10.1038/nature08819.PubMedPubMed CentralView ArticleGoogle Scholar
- Mallard duck (Anas platyrhynchos) project. [http://pre.ensembl.org/Anas_platyrhynchos/Info/Index]
- The Avian Genomes Project. [http://aviangenomes.org]
- Genome 10K Community of Scientists: Genome 10K: A proposal to obtain whole-genome sequence for 10 000 vertebrate species. J Hered. 2009, 100: 659-674.PubMed CentralView ArticleGoogle Scholar
- Organ CL, Shedlock AM, Meade A, Pagel M, Edwards SV: Origin of avian genome size and structure in non-avian dinosaurs. Nature. 2007, 446: 180-184. 10.1038/nature05621.PubMedView ArticleGoogle Scholar
- Organ CL, Brusatte SL, Stein K: Sauropod dinosaurs evolved moderately sized genomes unrelated to body size. Proc R Soc Lond B: Biol Sci. 2009, 276: 4303-4308. 10.1098/rspb.2009.1343.View ArticleGoogle Scholar
- Organ CL, Janes DE, Meade A, Pagel M: Genotypic sex determination enabled adaptive radiations of extinct marine reptiles. Nature. 2009, 461: 389-392. 10.1038/nature08350.PubMedView ArticleGoogle Scholar
- Schweitzer MH, Zheng W, Organ CL, Avci R, Suo Z, Freimark LM, Lebleu VS, Duncan MB, Vander Heiden MG, Neveu JM, Lane WS, Cottrell JS, Horner JR, Cantley LC, Kalluri R, Asara JM: Biomolecular characterization and protein sequences of the Campanian hadrosaur B. canadensis. Science. 2009, 324: 626-631. 10.1126/science.1165069.PubMedView ArticleGoogle Scholar
- Organ CL, Shedlock AM: Palaeogenomics of pterosaurs and the evolution of small genome size in flying vertebrates. Biol Lett. 2009, 5: 47-50. 10.1098/rsbl.2008.0491.PubMedPubMed CentralView ArticleGoogle Scholar
- Brochu CA, Wagner JR, Jouve S, Sumrall CD, Densmore LD: A correction corrected: consensus over the meaning of Crocodylia and why it matters. Syst Biol. 2009, 58: 537-543. 10.1093/sysbio/syp053.PubMedView ArticleGoogle Scholar
- Brochu C: Phylogenetic approaches toward crocodylian history. Annual Review of Earth and Planetary Sciences. 2003, 31: 357-397. 10.1146/annurev.earth.31.100901.141308.View ArticleGoogle Scholar
- Roos J, Aggarwal RK, Janke A: Extended mitogenomic phylogenetic analyses yield new insight into crocodylian evolution and their survival of the Cretaceous-Tertiary boundary. Molecular Phylogenetics and Evolution. 2007, 45: 663-673. 10.1016/j.ympev.2007.06.018.PubMedView ArticleGoogle Scholar
- Densmore LD: Biochemical and immunological systematics of the Order Crocodilia. Evol Biol. 1983, 16: 397-465.View ArticleGoogle Scholar
- Harshman J, Huddleston CJ, Bollback JP, Parsons TJ, Braun MJ: True and false gharials: a nuclear gene phylogeny of crocodylia. Syst Biol. 2003, 52: 386-402. 10.1080/10635150390197028.PubMedView ArticleGoogle Scholar
- Gatesy J, Amato G, Norell M, Desalle R, Hayashi C: Combined support for wholesale taxic atavism in gavialine crocodylians. Syst Biol. 2003, 52: 403-422. 10.1080/10635150390197037.PubMedView ArticleGoogle Scholar
- Ross JP: Crocodiles. Status Survey and Conservation Action Plan. 1998, IUCN, Gland, Switzerland and Cambridge, UK: ICUN/SSC Crocodile Specialist Group, 2Google Scholar
- Ryan C: Saltwater crocodiles as tourist attractions. J Sustain Tour. 1998, 6: 314-327. 10.1080/09669589808667319.View ArticleGoogle Scholar
- Ryan C, Harvey K: Who likes saltwater crocodiles? Analysing socio-demographics of those viewing tourist wildlife attractions based on saltwater crocodiles. J Sustain Tour. 2000, 8: 426-433. 10.1080/09669580008667377.View ArticleGoogle Scholar
- Merchant M, Thibodeaux D, Loubser K, Elsey RM: Amoebacidal effects of serum from the American alligator (Alligator mississippiensis). J Parasitol. 2004, 90: 1480-1483. 10.1645/GE-3382.PubMedView ArticleGoogle Scholar
- Merchant ME, Roche C, Elsey RM, Prudhomme J: Antibacterial properties of serum from the American alligator (Alligator mississippiensis). Comp Biochem Physiol B. 2003, 136: 505-513. 10.1016/S1096-4959(03)00256-2.PubMedView ArticleGoogle Scholar
- Merchant ME, Pallansch M, Paulman RL, Wells JB, Nalca A, Ptak R: Antiviral activity of serum from the American alligator (Alligator mississippiensis). Antiviral Res. 2005, 66: 35-38. 10.1016/j.antiviral.2004.12.007.PubMedView ArticleGoogle Scholar
- Merchant ME, Leger N, Jerkins E, Mills K, Pallansch MB, Paulman RL, Ptak RG: Broad spectrum antimicrobial activity of leukocyte extracts from the American alligator (Alligator mississippiensis). Vet Immunol Immunopath. 2006, 110: 221-228. 10.1016/j.vetimm.2005.10.001.View ArticleGoogle Scholar
- Milnes MR, Guillette LJ: Alligator tales: New lessons about environmental contaminants from a sentinel species. BioScience. 2008, 58: 1027-1036. 10.1641/B581106.View ArticleGoogle Scholar
- Campbell KR: Ecotoxicology of crocodilians. Appl Herpetol. 2003, 1: 45-163. 10.1163/157075403766451225.View ArticleGoogle Scholar
- Guillette LJ, Gross TS, Masson GR, Matter JM, Percival HF, Woodward AR: Developmental abnormalities of the gonad and abnormal sex hormone concentrations in juvenile alligators from contaminated and control lakes in Florida. Environ Health Perspect. 1994, 102: 680-688. 10.1289/ehp.94102680.PubMedPubMed CentralView ArticleGoogle Scholar
- Wu TH, Cañas JE, Rainwater TR, Platt SG, McMurry ST, Anderson TA: Organochlorine contaminants in complete clutches of Morelet's crocodile (Crocodylus moreletii) eggs from Belize. Environ Pollut. 2006, 144: 151-157. 10.1016/j.envpol.2005.12.021.PubMedView ArticleGoogle Scholar
- Grigg GC, Seebacher F, Franklin CE: Crocodilian Biology and Evolution. 2001, Surrey Beatty & SonsGoogle Scholar
- Brochu CA: Calibration age and quartet divergence date estimation. Evolution. 2004, 58: 1375-1382.PubMedView ArticleGoogle Scholar
- Brochu CA: Morphology, fossils, divergence timing, and the phylogenetic relationships of Gavialis. Syst Biol. 1997, 46: 479-522. 10.1093/sysbio/46.3.479.PubMedView ArticleGoogle Scholar
- Rayfield E: Establishing a framework for archosaur cranial mechanics. Paleobiology. 2008, 34: 494-515. 10.1666/07006.1.View ArticleGoogle Scholar
- Deeming DC, Ferguson MWJ: The mechanism of temperature dependent sex determination in Crocodilians: A hypothesis. Integr Comp Biol. 1989, 29: 973-985. 10.1093/icb/29.3.973.Google Scholar
- Lang JW, Andrews HV: Temperature-dependent sex determination in crocodilians. J Exp Zool. 1994, 270: 28-44. 10.1002/jez.1402700105.View ArticleGoogle Scholar
- Western PS, Harry JL, Marshall Graves JA, Sinclair AH: Temperature-dependent sex determination in the American alligator: expression of SF1, WT1 and DAX1 during gonadogenesis. Gene. 2000, 241: 223-232. 10.1016/S0378-1119(99)00466-7.PubMedView ArticleGoogle Scholar
- Pieau C, Dorizzi M, Richard-Mercier N: Temperature-dependent sex determination and gonadal differentiation in reptiles. Cell Mol Life Sci. 1999, 55: 887-900. 10.1007/s000180050342.PubMedView ArticleGoogle Scholar
- Ferguson MW, Joanen T: Temperature of egg incubation determines sex in Alligator mississippiensis. Nature. 1982, 296: 850-853. 10.1038/296850a0.PubMedView ArticleGoogle Scholar
- Cedeño-Vázquez JR, Rodriguez D, Calmé S, Ross JP, Densmore LD, Thorbjarnarson JB: Hybridization between Crocodylus acutus and Crocodylus moreletii in the Yucatan Peninsula: I. Evidence from mitochondrial DNA and morphology. J Exp Zool A Ecol Genet Physiol. 2008, 309: 661-673.PubMedView ArticleGoogle Scholar
- Ray DA, Dever JA, Platt SG, Rainwater TR, Finger AG, McMurry ST, Batzer MA, Barr B, Stafford PJ, McKnight J, Densmore LD: Low levels of nucleotide diversity in Crocodylus moreletii and evidence of hybridization with C. acutus. Conserv Genet. 2004, 5: 449-462.View ArticleGoogle Scholar
- Weaver JP, Rodriguez D, Venegas-Anaya M, Cedeño-Vázquez JR, Forstner MRJ, Densmore LD: Genetic characterization of captive Cuban crocodiles (Crocodylus rhombifer) and evidence of hybridization with the American crocodile (Crocodylus acutus). J Exp Zool A Ecol Genet Physiol. 2008, 309: 649-660.PubMedView ArticleGoogle Scholar
- Davis LM, Glenn TC, Elsey RM, Dessauer HC, Sawyer RH: Multiple paternity and mating patterns in the American alligator, Alligator mississippiensis. Mol Ecol. 2001, 10: 1011-1024. 10.1046/j.1365-294X.2001.01241.x.PubMedView ArticleGoogle Scholar
- Davis LM, Glenn TC, Strickland DC, Guillette LJ, Elsey RM, Rhodes WE, Dessauer HC, Sawyer RH: Microsatellite DNA analyses support an east-west phylogeographic split of American alligator populations. J Exp Zool. 2002, 294: 352-372. 10.1002/jez.10189.PubMedView ArticleGoogle Scholar
- Ryberg WA, Fitzgerald LA, Honeycutt RL, Cathey JC: Genetic relationships of American alligator populations distributed across different ecological and geographic scales. J Exp Zool. 2002, 294: 325-333. 10.1002/jez.10207.PubMedView ArticleGoogle Scholar
- The International Crocodilian Genomes Working Group. [http://crocgenomes.org]
- Krishan A, Dandekar P, Nathan N, Hamelik R, Miller C, Shaw J: DNA index, genome size, and electronic nuclear volume of vertebrates from the Miami Metro Zoo. Cytometry A. 2005, 65: 26-34.PubMedView ArticleGoogle Scholar
- Cohen MM, Gans C: The chromosomes of the order Crocodilia. Cytogenet Genome Res. 1970, 9: 81-105. 10.1159/000130080.View ArticleGoogle Scholar
- Valleley EM, Harrison CJ, Cook Y, Ferguson MW, Sharpe PT: The karyotype of Alligator mississippiensis, and chromosomal mapping of the ZFY/X homologue, Zfc. Chromosoma. 1994, 103: 502-507. 10.1007/BF00337388.PubMedView ArticleGoogle Scholar
- Shan X, Ray DA, Bunge JA, Peterson DG: A bacterial artificial chromosome library for the Australian saltwater crocodile (Crocodylus porosus) and its utilization in gene isolation and genome characterization. BMC Genomics. 2009, 10 (Suppl 2): S9-10.1186/1471-2164-10-S2-S9.PubMedPubMed CentralView ArticleGoogle Scholar
- King M, Honeycutt R, Contreras N: Chromosomal repatterning in crocodiles: C, G and N-banding and the in situ hybridization of 18S and 26S rRNA cistrons. Genetica. 1986, 70: 191-201. 10.1007/BF00122186.View ArticleGoogle Scholar
- Dalzell P, Miles LG, Isberg SR, Glenn TC, King C, Murtagh V, Moran C: Standardized Reference Ideogram for Physical Mapping in the Saltwater Crocodile (Crocodylus porosus). Cytogenet Genome Res. 2009, 127: 204-212. 10.1159/000293286.PubMedView ArticleGoogle Scholar
- Shedlock AM, Botka CW, Zhao S, Shetty J, Zhang T, Liu JS, Deschavanne PJ, Edwards SV: Phylogenomics of nonavian reptiles and the structure of the ancestral amniote genome. P Natl Acad Sci Usa. 2007, 104: 2767-2772. 10.1073/pnas.0606204104.View ArticleGoogle Scholar
- Miyake T, Amemiya CT: BAC libraries and comparative genomics of aquatic chordate species. Comp Biochem Physiol C Toxicol Pharmacol. 2004, 138: 233-244. 10.1016/j.cca.2004.07.001.PubMedView ArticleGoogle Scholar
- NISC Comparative Sequencing Initiative. [http://www.nisc.nih.gov]
- Miles LG, Isberg SR, Glenn TC, Lance SL, Dalzell P, Thomson PC, Moran C: A genetic linkage map for the saltwater crocodile (Crocodylus porosus). BMC Genomics. 2009, 10: 339-10.1186/1471-2164-10-339.PubMedPubMed CentralView ArticleGoogle Scholar
- Chojnowski JL, Franklin J, Katsu Y, Iguchi T, Guillette LJ, Kimball RT, Braun EL: Patterns of vertebrate isochore evolution revealed by comparison of expressed mammalian, avian, and crocodilian genes. J Mol Evol. 2007, 65: 259-266. 10.1007/s00239-007-9003-2.PubMedView ArticleGoogle Scholar
- Nabholz B, Künstner A, Wang R, Jarvis ED, Ellegren H: Dynamic evolution of base composition: causes and consequences in avian phylogenomics. Mol Biol Evol. 2011, 28: 2197-2210. 10.1093/molbev/msr047.PubMedPubMed CentralView ArticleGoogle Scholar
- Mortazavi A, Schwarz EM, Williams B, Schaeffer L, Antoshechkin I, Wold BJ, Sternberg PW: Scaffolding a Caenorhabditis nematode genome with RNA-seq. Genome Res. 2010, 20: 1740-1747. 10.1101/gr.111021.110.PubMedPubMed CentralView ArticleGoogle Scholar
- Gnerre S, MacCallum I, Przybylski D, Ribeiro FJ, Burton JN, Walker BJ, Sharpe T, Hall G, Shea TP, Sykes S, Berlin AM, Aird D, Costello M, Daza R, Williams L, Nicol R, Gnirke A, Nusbaum C, Lander ES, Jaffe DB: High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci USA. 2011, 108: 1513-1518. 10.1073/pnas.1017351108.PubMedPubMed CentralView ArticleGoogle Scholar
- Earl D, Bradnam K, St John J, Darling A, Lin D, Fass J, Yu HOK, Buffalo V, Zerbino DR, Diekhans M, Nguyen N, Ariyaratne PN, Sung W-K, Ning Z, Haimel M, Simpson JT, Fonseca NA, Birol I, Docking TR, Ho IY, Rokhsar DS, Chikhi R, Lavenier D, Chapuis G, Naquin D, Maillet N, Schatz MC, Kelley DR, Phillippy AM, Koren S, et al: Assemblathon 1: A competitive assessment of de novo short read assembly methods. Genome Res. 2011, 21: 2224-2241. 10.1101/gr.126599.111.PubMedPubMed CentralView ArticleGoogle Scholar
- Tzika AC, Helaers R, Schramm G, Milinkovitch MC: Reptilian-transcriptome v1.0, a glimpse in the brain transcriptome of five divergent Sauropsida lineages and the phylogenetic position of turtles. EvoDevo. 2011, 2: 19-10.1186/2041-9139-2-19.PubMedPubMed CentralView ArticleGoogle Scholar
- Toronto International Data Release Workshop Authors, Birney E, Hudson TJ, Green ED, Gunter C, Eddy S, Rogers J, Harris JR, Ehrlich SD, Apweiler R, Austin CP, Berglund L, Bobrow M, Bountra C, Brookes AJ, Cambon-Thomsen A, Carter NP, Chisholm RL, Contreras JL, Cooke RM, Crosby WL, Dewar K, Durbin R, Dyke SOM, Ecker JR, Emam El K, Feuk L, Gabriel SB, Gallacher J, Gelbart WM, et al: Prepublication data sharing. Nature. 2009, 461: 168-170.View ArticleGoogle Scholar
- Li R, Fan W, Tian G, Zhu H, He L, Cai J, Huang Q, Cai Q, Li B, Bai Y, Zhang Z, Zhang Y, Wang W, Li J, Wei F, Li H, Jian M, Li J, Zhang Z, Nielsen R, Li D, Gu W, Yang Z, Xuan Z, Ryder OA, Leung FC-C, Zhou Y, Cao J, Sun X, Fu Y, et al: The sequence and de novo assembly of the giant panda genome. Nature. 2010, 463: 311-317. 10.1038/nature08696.PubMedPubMed CentralView ArticleGoogle Scholar
- Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J: Repbase Update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 2005, 110: 462-467. 10.1159/000084979.PubMedView ArticleGoogle Scholar
- Price AL, Jones NC, Pevzner PA: De novo identification of repeat families in large genomes. Bioinformatics. 2005, 21 (Suppl 1): i351-i358. 10.1093/bioinformatics/bti1018.PubMedView ArticleGoogle Scholar
- Shedlock A: Phylogenomic investigation of CR1 LINE diversity in reptiles. Syst Biol. 2006, 55: 902-911. 10.1080/10635150601091924.PubMedView ArticleGoogle Scholar
- Kordis D: Transposable elements in reptilian and avian (Sauropsida) genomes. Cytogenet Genome Res. 2009, 127: 94-111. 10.1159/000294999.PubMedView ArticleGoogle Scholar
- Ray D, Hedges D, Herke S, Fowlkes J, Barns E, LaVie D, Goodwin L, Densmore L, Batzer M: Chompy: An infestation of MITE-like repetitive elements in the crocodilian genome. Gene. 2005, 362: 1-10.PubMedView ArticleGoogle Scholar
- Fujita MK, Edwards SV, Ponting CP: The Anolis lizard genome: An amniote genome without isochores. Genome Biol Evol. 2011, 3: 974-984. 10.1093/gbe/evr072.PubMedPubMed CentralView ArticleGoogle Scholar
- Chojnowski JL, Braun EL: Turtle isochore structure is intermediate between amphibians and other amniotes. Integr Comp Biol. 2008, 48: 454-462. 10.1093/icb/icn062.PubMedView ArticleGoogle Scholar
- OASES. [http://www.ebi.ac.uk/~zerbino/oases]
- Zerbino DR, Birney E: Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008, 18: 821-829. 10.1101/gr.074492.107.PubMedPubMed CentralView ArticleGoogle Scholar
- Bairoch A, Apweiler R: The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 2000, 28: 45-48. 10.1093/nar/28.1.45.PubMedPubMed CentralView ArticleGoogle Scholar
- Blanchette M, Kent WJ, Riemer C, Elnitski L, Smit AFA, Roskin KM, Baertsch R, Rosenbloom K, Clawson H, Green ED, Haussler D, Miller W: Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 2004, 14: 708-715. 10.1101/gr.1933104.PubMedPubMed CentralView ArticleGoogle Scholar
- Flicek P, Amode MR, Barrell D, Beal K, Brent S, Chen Y, Clapham P, Coates G, Fairley S, Fitzgerald S, Gordon L, Hendrix M, Hourlier T, Johnson N, Kähäri A, Keefe D, Keenan S, Kinsella R, Kokocinski F, Kulesha E, Larsson P, Longden I, McLaren W, Overduin B, Pritchard B, Riat HS, Rios D, Ritchie GRS, Ruffier M, Schuster M, et al: Ensembl 2011. Nucleic Acids Res. 2011, 39: D800-806. 10.1093/nar/gkq1064.PubMedPubMed CentralView ArticleGoogle Scholar
- Stanke M, Diekhans M, Baertsch R, Haussler D: Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008, 24: 637-644. 10.1093/bioinformatics/btn013.PubMedView ArticleGoogle Scholar
- Norris TB, Rickards GK, Daugherty CH: Chromosomes of tuatara, Sphenodon, a chromosome heteromorphism and an archaic reptilian karyotype. Cytogenet Genome Res. 2004, 105: 93-99. 10.1159/000078014.PubMedView ArticleGoogle Scholar
- Haag ES, Doty AV: Sex determination across evolution: Connecting the dots. PLoS Biol. 2005, 3: e21-10.1371/journal.pbio.0030021.PubMedPubMed CentralView ArticleGoogle Scholar
- Sereno PC: The Evolution of Dinosaurs. Science. 1999, 284: 2137-2147. 10.1126/science.284.5423.2137.PubMedView ArticleGoogle Scholar
- Chiappe LM: Glorified Dinosaurs: The Origin and Early Evolution of Birds. 2007, Wiley-Liss, 1Google Scholar
- Shen X-X, Liang D, Wen J-Z, Zhang P: Multiple genome alignments facilitate development of NPCL markers: A case study of tetrapod phylogeny focusing on the position of turtles. Mol Biol Evol. 2011, 28: 3237-3252. 10.1093/molbev/msr148.PubMedView ArticleGoogle Scholar
- Lyson TR, Sperling EA, Heimberg AM, Gauthier JA, King BL, Peterson KJ: MicroRNAs support a turtle + lizard clade. Biol Lett. 2011, 8: 104-107.PubMedPubMed CentralView ArticleGoogle Scholar
- Dolezel J, Bartos J, Voglmayr H, Greilhuber J: Nuclear DNA content and genome size of trout and human. Cytometry A. 2003, 51: 127-128.PubMedView ArticleGoogle Scholar