The draft genome of the C3 panicoid grass species Dichanthelium oligosanthes
- Anthony J. Studer†1, 2Email author,
- James C. Schnable†1, 3,
- Sarit Weissmann1,
- Allison R. Kolbe4,
- Michael R. McKain1,
- Ying Shao1, 5,
- Asaph B. Cousins4,
- Elizabeth A. Kellogg1 and
- Thomas P. Brutnell1Email author
© The Author(s). 2016
Received: 23 September 2015
Accepted: 5 October 2016
Published: 28 October 2016
Comparisons between C3 and C4 grasses often utilize C3 species from the subfamilies Ehrhartoideae or Pooideae and C4 species from the subfamily Panicoideae, two clades that diverged over 50 million years ago. The divergence of the C3 panicoid grass Dichanthelium oligosanthes from the independent C4 lineages represented by Setaria viridis and Sorghum bicolor occurred approximately 15 million years ago, which is significantly more recent than members of the Bambusoideae, Ehrhartoideae, and Pooideae subfamilies. D. oligosanthes is ideally placed within the panicoid clade for comparative studies of C3 and C4 grasses.
We report the assembly of the nuclear and chloroplast genomes of D. oligosanthes, from high-throughput short read sequencing data and a comparative transcriptomics analysis of the developing leaf of D. oligosanthes, S. viridis, and S. bicolor. Physiological and anatomical characterizations verified that D. oligosanthes utilizes the C3 pathway for carbon fixation and lacks Kranz anatomy. Expression profiles of transcription factors along developing leaves of D. oligosanthes and S. viridis were compared with previously published data from S. bicolor, Zea mays, and Oryza sativa to identify a small suite of transcription factors that likely acquired functions specifically related to C4 photosynthesis.
The phylogenetic location of D. oligosanthes makes it an ideal C3 plant for comparative analysis of C4 evolution in the panicoid grasses. This genome will not only provide a better C3 species for comparisons with C4 panicoid grasses, but also highlights the power of using high-throughput sequencing to address questions in evolutionary biology.
KeywordsDichanthelium oligosanthes PACMAD Panicoid grass Photosynthesis Carbonic anhydrase
The availability of complete genome sequences from multiple lineages is enabling a much deeper understanding of both the mechanistic basis of evolution and the diversification of gene regulatory networks. Furthermore, the breadth of genome sequences available provides opportunities to utilize non-model species in comparative genomics [1, 2]. Comparative approaches are made more powerful by sampling across the phylogenetic tree, particularly in cases of convergent evolution, and provide insight into the networks that underpin complex traits [3, 4]. High-throughput sequencing facilitates deep transcriptomic and genomic surveys, which can be leveraged to deduce the evolution of gene families by duplication and subsequent neofunctionalization and subfunctionalization of individual gene copies.
Growing concern over food and energy security has spurred translational research to increase the productivity and sustainability of crops. Optimization of photosynthesis is one approach that has the potential to greatly increase crop yields [5, 6]. Specifically, several groups are investigating the evolution of C4 from C3 photosynthesis with the objective of installing C4 traits into C3 species to improve yield . Enhanced photosynthetic efficiency associated with C4 photosynthesis not only increases productivity (i.e. grain or biomass yield), but also nutrient and water use efficiency . These benefits are the result of a carbon concentrating mechanism (CCM) that evolved to increases the CO2 concentration around the carboxylating enzyme ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco). Concentrating CO2 around Rubisco reduces its oxygenase activity, thereby significantly decreasing the amount of energy lost to photorespiration. The CCM of the majority of C4 species is achieved through partitioning of the biochemical reactions of photosynthesis into two cell types (mesophyll, M, and bundle sheath, BS) .
Despite being a complex trait, C4 photosynthesis has independently evolved over 60 times in the angiosperms  and at least 22 times in the grasses . The grasses are one of the most ecologically and economically significant plant clades and thus insights into the origins of C4 should provide opportunities for breeding and engineering improved germplasm. However, to date, comparative genomic approaches to studying the evolution of C4 photosynthesis have been limited to comparisons between crop species. These include C4 crops such as Zea mays, Sorghum bicolor, Setaria italica, and Saccharum officinarum from the clade containing the subfamilies Panicoideae, Arundinoideae, Chloridoideae, Micrairoideae, Aristidoideae, and Danthonioideae (PACMAD) and C3 crops such as Oryza sativa and Triticum aestivum from the clade containing the subfamilies Bambusoideae, Ehrhartoideae, and Pooideae (BEP). The limitations of PACMAD-BEP comparisons are that these two groups of grasses diverged more than 50 million years ago and the BEP clade contains no C4 species .
Distant evolutionary relationships sometimes fail to identify the genomic changes associated with the evolutionary emergence of C4 photosynthesis because differences in the photosynthetic pathway are confounded with the many other changes that occurred in the long independent history of the two lineages. The use of PACMAD-BEP comparisons has been driven by the availability of genomic resources. Currently the only published panicoid genome sequences are for panicoid species that utilize the C4 pathway for carbon fixation [12–15].
As the number of sequenced genomes increases, a more comprehensive understanding of the genes involved in C4 photosynthesis can be achieved. To this end, we sequenced and assembled a draft genome of D. oligosanthes. Histological, biochemical, and transcriptomic analyses confirm the C3 nature of D. oligosanthes and demonstrate its usefulness as a C3 panicoid grass for evolutionary comparisons. Furthermore, characteristics of D. oligosanthes also make it a potentially suitable genetic model for dissecting traits such as perenniality, cold tolerance, and flowering time. We demonstrate here how this high quality draft genome provides novel insights into the evolution and diversification of C4 photosynthesis in the grasses.
Results and discussion
Life history and phylogeny
The genus Dichanthelium includes ca. 72 species, which collectively are known as rosette grasses . All species are perennial and plants overwinter as a rosette that grows to produce sparsely branched culms in the spring (Fig. 1) [18, 19] (AJS and EAK, personal observations). In many species the rosette leaves senesce late in the growing season, the culms develop more branches, and a second round of flowering occurs—hence the genus name, which means “twice-flowering.” Cleistogamy is common in Dichanthelium species . While some of the inflorescences are borne well above the leaf sheath, others, particularly those from culm branches, never fully exert and self-pollinate without opening  (EAK, personal observations).
Dichanthelium is a member of the grass subfamily Panicoideae, tribe Paniceae. Like other members of Panicoideae, it has spikelets with two flowers, the upper one bisexual and the lower staminate or sterile (EAK, personal observations). Like other members of the tribe Paniceae, its chromosomes are in multiples of 9 . Species of Dichanthelium are similar to but morphologically distinct from species of Panicum, so for many years Dichanthelium was treated as a subgenus of Panicum , a treatment that is still followed by some authors . However, it was recognized as a distinct genus by Gould and Clark in 1978 .
Phylogenetic data support the distinction of Dichanthelium as a separate genus [11, 23, 24], showing that it is clearly not a lineage in the Panicum clade. Within the tribe Paniceae, the position of Dichanthelium is uncertain. Chloroplast sequences show that it is closely related to the large clade (the MPC clade) that includes groups of species utilizing different C4 photosynthesis subtypes: Melinidinae (PEPCK), Panicinae (NAD-ME), and Cenchrinae (NADP-ME) . Depending on the sample of taxa and chloroplast sequences, Dichanthelium is either sister to the MPC plus the Australian species Homopholis and Walwhalleya  or as part of a larger clade including mostly C3 species but also the C4 members of the Australian Neurachninae . In contrast, data from a single nuclear gene sequence place it sister to all Paniceae except Echinochloa . In either case, it is more closely related to the C4 panicoid species than any of the C3 grasses for which complete genome assemblies are currently available (e.g. O. sativa and B. distachyon). The close relationship between Dichanthelium and C4 Paniceae is confirmed when sequences from the D. oligosanthes chloroplast genes rbcL, ndhF, and matK were used in maximum likelihood and Bayesian analyses with select species previously used to construct a phylogeny of the grasses (Fig. 1c) .
Anatomy and physiology of Dichanthelium oligosanthes
D. oligosanthes physiology and biochemical characteristics
D. oligosanthes has a distinctively C3 isotopic signature (Table 1; ), which is consistent with other Dichanthelium species that have been previously reported [17, 39, 40]. This value reflects strong isotopic discrimination by Rubisco, demonstrating that CO2 fixation occurs via the C3 cycle. Both C4 species and Type II C3-C4 intermediates have distinct isotopic signatures indicating CO2 fixed by PEPC. However, type I intermediates have a C3-like isotopic signature, but can be differentiated by anatomical and biochemical characteristics that are more similar to C4 species [9, 41]. Taken together, the leaf anatomy, gas exchange, and biochemical measurements corroborate previous reports that D. oligosanthes is a C3 species .
Nuclear and chloroplast genome assembly and annotation
To estimate the genome size of D. oligosanthes, flow cytometry and k-mer abundance assays were performed. Flow cytometry of D. oligosanthes accession Kellogg 1175 produced an estimated genome size of approximately 960 Mb, placing it within the range for diploid panicoid grasses. Single copy sequences present a distinctive peak in histograms of k-mer abundance centered at the average depth of sequencing. Sequences repeated twice in the genome form a second peak at twice the average sequencing depth and so on. Based on k-mer analysis, the estimated genome size of D. oligosanthes was revised downward to 750 Mb of which approximately 360 Mb is single copy sequence.
Sequence analysis was performed on a single individual derived from self-pollination of a wild-collected individual (Kellogg 1175), collected at the Shaw Nature Reserve in Gray Summit, MO. D. oligosanthes is a predominately self-pollinating species, so heterozygosity was expected to be low. A D. oligosanthes draft assembly was generated using data from libraries with median 180 bp insert and 5 kb insert sizes. Sequencing was performed on an Illumina HiSeq 2000 platform with 100 bp paired end sequencing. Approximately 90 Gb and 86 Gb of sequence was generated from the 180 bp and 5 kb libraries, respectively. These data were assembled using Allpaths-LG . Additional scaffolding was conducted using two mate pair insert libraries (5 kb median and 6.3 kb median insert size) and the software package SSPACE  and sequence present in a number of gaps was determined using GapCloster . The final assembly consisted of 17,441 scaffolds (589 megabases), which were constructed from 76,905 contigs (476 megabases of sequence). The assembly we present here therefore covers 78 % of the estimated total genome of D. oligosanthes, including determined sequence for 63 % of the genome. Based on the alignment of a set of low copy genes conserved in S. bicolor, S. italica, and Panicum hallii, we determined that our assembly contains at least 98 % of the D. oligosanthes gene space (3358/3430 genes identified). A total of 30,153 genes were annotated through a combination of homology-based and de novo annotation using Maker2 . For these genes, 1 kb of promoter sequence was recovered 94.2 % of the time, and 5 kb of promoter sequence was recovered 80.5 % of the time. 86.5 % of all annotated genes were present on multi-gene scaffolds, enabling syntenic comparisons to other grass genomes.
D. oligosanthes chloroplast genome assembly statistics
Comparative analysis of genes expression across leaf development
One of the unique features of monocot leaves is that developmental processes proceed linearly, with the base segments being the least and the tip being the most differentiated . This continuous gradient has been exploited previously to investigate the expression of genes related to photosynthesis in Z. mays and O. sativa  and S. bicolor and S. viridis . As a C3 panicoid grass, D. oligosanthes provides a unique opportunity to examine the diversification of genes and networks associated with C4 photosynthesis using comparative transcriptomic approaches. To expand the gradient analyses, developmental leaf gradients were constructed for D. oligosanthes and the closely related C4 species S. viridis. The S. viridis gradient from  was not used because the data were not replicated.
Hierarchical clustering of global gene expression profiles using Spearman correlation values (see Additional file 1: Figure S2) indicates that the replicates of each segment are strongly correlated and each segment clusters separately (Fig. 5d). Pearson correlation analysis produced similar results. The strong correlation between segments 3 and 4 suggests that the tip of the leaf may be fully mature. These new leaf gradients provide an opportunity to investigate a variety of biological processes, including changes in gene regulation linked to the evolution of C4 photosynthesis.
C4 carbon shuttle gene expression
Consistent with the results of the anatomical and physiological analyses, the expression profile of all six core C4 enzymes indicates that D. oligosanthes utilizes the ancestral C3 carbon fixation pathway. Large amounts of CA protein are known to be present in the leaves of C3 plants  and of the six genes encoding enzymes in the C4 pathway, only carbonic anhydrase1 (ca1) was expressed at a high level in D. oligosanthes (Fig. 6). Significant accumulation of transcripts encoded by four other C4 genes was observed in S. viridis and S. bicolor. Expression levels of the C4 genes were similar in S. bicolor and S. viridis except for ca1 and nicotinamide adenine dinucleotide phosphate malic enzyme (nadp-me), both of which are twofold lower in S. bicolor (Fig. 6). Although comparing absolute expression levels across species introduces numerous potential sources of bias and error, a difference in the number of tandemly duplicated ca gene copies likely explains the expression difference between S. bicolor and S. viridis. Unlike ca, nadp-me does not have highly expressed paralogs. Protein blot and enzyme activity assays would be needed to determine whether differences in enzymatic efficiency and/or differences in translational regulation compensate for the difference in transcriptional abundance for this gene.
Three major subtypes of C4 photosynthesis are recognized as (1) NADP-ME, (2) NAD-ME, and (3) PEPCK, named for the primary decarboxylating enzyme employed by each subtype. While traditionally these pathways have been viewed as independent, biochemical data and recent modeling of the C4 pathways revealed that the PEPCK pathway could be complementary in NADP-ME subtype species, such as Z. mays, S. viridis, and S. bicolor [58, 59]. Accordingly, although Z. mays is classified as an NADP-ME C4 subtype, the PEPCK pathway is likely active and contributes to total photosynthesis [60, 61]. This is reflected in the expression of pepck1 in Z. mays, which accumulates to 2000 rpkm in the developing leaf tip and follows the expected expression profile of a gene involved in photosynthesis . Interestingly, very low pepck expression levels were observed in both S. viridis and S. bicolor (Fig. 6). The reported lack of PEPCK protein in S. bicolor  confirms the expression result and provides further evidence that the PEPCK pathway is not active in these species. Z. mays and S. bicolor are believed to share a recent common ancestor , suggesting the acquisition of the PEPCK pathway in Z. mays may be a relatively new evolutionary innovation or, alternatively, that this secondary pathway was lost in S. bicolor after it diverged from Z. mays.
C4 transcription factor identification
The four TFs identified that have expression profiles specific to Z. mays most likely result from the use of the Z. mays leaf developmental gradient as the initial filtering step and do not reflect a difference in the number of TFs with lineage-specific gene expression patterns among the species included in this analysis. However, given that the PEPCK pathway is specific to Z. mays in this small sample of C4 species, it is tempting to speculate that the Z. mays specific TFs contribute to regulation of the PEPCK pathway. Interestingly, utilizing data on cell type specific expression from the Z. mays eFP browser (http://bar.utoronto.ca/efp_maize/cgi-bin/efpWeb.cgi), a single TF was identified as preferentially expressed in bundle sheath cells, where PEPCK is needed for C4 photosynthesis. Taken together, these results suggest that the myb TF encoded by GRMZM2G130149 may be one of the genes that regulate the transcription of pepck in Z. mays.
C4 specific amino acids under selection
In addition to probing the evolution of C4 photosynthesis using expression data, we also investigated amino acid substitutions in key C4 enzymes. Specific residues in several of the C4 carbon shuttle genes have been previously shown to be under positive selection in C4 lineages. These include PEPC , which encodes the first carboxylation reaction in C4 photosynthesis, the decarboxylating enzymes NADP-ME  and PEPCK , as well as the large subunit of Rubisco (rbcL) . Peptide sequences for these genes in D. oligosanthes were compared to the known amino acid sequences across the grass family to identify signatures of selection in the amino acid sequences prior to the divergence of the C4 lineages from D. oligosanthes.
Christin et al. identified a set of nine amino acid substitutions in the PEPCK gene that exhibited positive selection in species classified as the PEPCK C4 subtype, such as Urochloa maximus . Of the two PEPCK genes present in Z. mays, the paralog that showed an expression pattern in our data consistent with a role in C4 photosynthesis (PEPCK1) contains two of the nine amino acid substitutions shown to be under positive selection in C4 species that utilize the PEPCK pathway. However, the copy that does not show a photosynthesis-linked pattern of expression (PEPCK2) does not contain any of the positively selected amino acid substitutions. In S. bicolor and S. viridis, where no expression evidence links PEPCK to C4 photosynthesis, none of the positively selected amino acid substitutions are present (Fig. 8b).
Structural evolution of the carbonic anhydrase gene family in the grasses
The first biochemical step in C4 photosynthesis is catalyzed by a beta-carbonic anhydrase (CA), which hydrates CO2 to produce bicarbonate. The grass lineage contains a locus with multiple tandemly arranged ca genes. The ca genes comprising this locus in Z. mays were previously defined and functionally characterized in Z. mays . Rapid Amplification of complementary DNA (cDNA) Ends (RACE) was used to define the ca1 transcription start site in D. oligosanthes, S. viridis, and S. bicolor, as well as Brachypodium distachyon and Oryza sativa. These experiments confirmed the expression of a ca1 transcript in D. oligosanthes predicted to be plastid localized, which includes a ~3 kb first intron that is also present in B. distachyon, O. sativa, and S. bicolor. This long transcript predicted to encode a plastid-targeted isoform was also present, but not predicted in S. viridis. Transcript data from RACE experiments were used to improve the ca1 gene annotation for D. oligosanthes. Furthermore, RNA-seq data revealed that ca1 is the most highly expressed of the ca gene copies in both C3 and C4 species.
To enable functional comparisons between species, the orthologous/paralogous relationship of the tandemly arranged ca genes was examined for grass species with available genomes. ca1 coding sequences were also obtained for Panicum virgatum (switchgrass), Hordeum vulgare (barley), Triticum aestivum (wheat), and Musa acuminata (banana). A maximum-likelihood tree of the ca gene copies produced distinct clades and within clades the phylogenetic relationships were preserved (Additional file 1: Figure S4). This analysis provides comprehensive naming of the gene copies across species (Fig. 9).
Based on these comparative analyses, the following model describes the evolution of the ca gene family (Fig. 9). The base copy number for the ca tandem array in the grasses is two (ca1 and ca4) and in the BEP clade these two tandem gene copies have been retained without modification. In the lineage leading to the panicoid grasses, a duplication of ca1 generated ca3 prior to the divergence of the Andropogoneae and Paniceae lineages. The ca2 gene copy appears to have arisen from a second duplication of ca1 in the lineage leading to the Andropogoneae tribe and may have been linked with the duplication/fusion event that resulted in two active site domains of ca1 for species in the same lineage. A deletion of the ca4 gene is only observed in Z. mays. It is currently not clear where ca4 is expressed or if it is functional; however, the conservation of this gene for 50 million years in both the BEP and (some) PACMAD lineages suggests it likely retained a function for at least some portion of this time. As the number of available grass genomes increases, a better estimate of the timing of gene duplication and loss events will be obtained. As gene duplication provides a foundation for subfunctionalization and neofunctionalization of gene function, comparative genomic approaches that exploit synteny relationships should increase our power to detect signatures of selection.
The parallel evolution of C4 photosynthesis, a complex trait that requires the co-option and redeployment of enzymes from a wide range of biochemical pathways, as well as significant modifications of leaf anatomy, has long been a scientific puzzle. Here, we present the genome sequence of a C3 species from within a clade rich in C4 origins, the PACMAD grasses. We definitively show that despite having many C4 relatives, D. oligosanthes uses the C3 pathway for carbon fixation by coupling anatomical and physiological characteristics with sequence-based approaches. We demonstrate the usefulness of D. oligosanthes as a C3 species for investigating C4 evolution in the panicoid grasses through a variety of analyses. D. oligosanthes is also well placed for future studies of abiotic stress tolerance and adaption to temperate climates. The majority of panicoid grasses are native to tropical or subtropical environments. As a close relative indigenous to the continental United States, D. oligosanthes may be able to provide insight into the mechanisms employed by natural selection to develop cold and freezing tolerance in panicoid grass species, knowledge that would be of great value to cold sensitive panicoid crops such as Z. mays, S. bicolor, and S. officinarum. To the best of our knowledge, D. oligosanthes is only the third non-cultivated grass species with a sequenced genome, and the first C3 grass in the otherwise C4-rich PACMAD clade.
Importantly, the results presented here highlight how comparisons among multiple closely related species produce smaller candidate gene lists that may be linked to a trait than pairwise comparisons between individual species. This is especially true when traits have evolved in parallel. Until recently, the time-consuming and expensive process of genome sequencing meant genome assemblies were only available for economically significant plant species (and a small number of widely used models). The uneven evolutionary distribution and small number of model/crop species limited the usefulness of comparative approaches. Today, researchers can select species based on informative phenotypes or phylogenetic positions and generate genomic data as needed. As scientists move beyond pairwise comparisons and begin to seek answers from large clusters of related species with sequenced genomes, new comparative genomics tools and statistical approaches will need to be developed.
A wild-grown plant was collected from the Shaw Nature Reserve, in Gray’s Summit, MO. A voucher specimen, Kellogg 1175, is deposited at the herbarium of the Missouri Botanical Garden (MO). Locality data are available at http://www.tropicos.org/Specimen/100315254.
Chloroplast gene sequences from rbcL, ndhF, and matK used for the phylogenetic analysis were aligned manually, and the best-fit model was estimated as GTR + G + I by RAxML using all sites . The Bayesian analysis was performed using MrBayes v3.2, with 5 million generations, sampling every 1000 generations. Substitution rates and state frequencies used a Dirichlet prior. The gamma distribution was approximated using four categories.
Two-millimeter strips were cut crosswise to the leaf of mature D. oligosanthes plants and were fixed for 2 h in 2 % gluteraldehyde in 100 mM, pH 6.8, PIPES buffer at room temperature. The leaves were then washed three times in 100 mM, pH 6.8, PIPES buffer. The leaves were post-fixed in buffered osmium tetroxide for 1.5 h and rinsed three times in water. The leaves were dehydrated in an ethanol/acetone series as follow: 5, 10, 20, 30, 50, 75, 95 % ETOH for 20 min each, followed by 30 min in 100 % ETOH, 15 min in 100 % acetone, and a second 45-min incubation in fresh 100 % acetone. The leaves were infiltrated with Spur’s resin (Cat. no. RT14300, Electron Microscopy Science) dissolved in 100 % acetone as follows: 5 % 12 h, 10 % 12 h, 25 % 24 h, 50 % 24 h, 75 % 24 h, 100 % 24 h. The leaves were then embedded in 100 % resin and incubated at 60 °C for two days. Resin blocks were cut into 1 μM sections using a Leica Ultracut UCT microtome. Slides were stained in 1 % toluidine blue solution (Cat. No. T-140, Spectrum), and then sealed with a No. 1.5 cover glass and Permount (Cat. No. SP15-100, Fisher Scientific). Images were acquired using a Nikon E800 microscope with a 60X PCAN APO, 1.4NA oil immersion phase objective, Digital capture Q-imaging Ratiga 1300 camera, coupled with an LCD color filter, and Q-imaging software.
Gas exchange measurements
Net rates of CO2 assimilation were measured on young, fully expanded leaves using the LI-COR 6400XT gas exchange system (LI-COR Biosciences). Measurements were made at 25 °C and an irradiance of 1500 μmol quanta m–2 s–1 . CO2 response curves were measured at 25, 19.4, 16.6, 13.5, 7.6, 4.7, 36.7, 60.4, and 140.3 Pa. After gas exchange measurements were complete, leaf material was flash frozen and stored at –80 ° C for enzyme assays.
Approximately 2 cm2 leaf material was ground on ice using a mortar and pestle in 1 mL of 50 mM HEPES (pH 7.8), 1 % (v/v) polyvinylpolypyrrolidone, 1 mM EDTA, 10 mM dithiothreitol, 0.1 % (v/v) Triton X-100, and 2 % (v/v) protease inhibitor cocktail (Sigma-Aldrich). Crude extracts were centrifuged at 4 ° C for 1 min at 17,000 g, and supernatant was collected for the CA, Rubisco, and PEPC assays.
CA assays were performed at 25 °C in 2 mL of assay buffer (CO2-free 100 mM EPPS-NaOH, pH 8.0, 10 mM DTT). CA activity was measured using a membrane inlet mass spectrometer to measure the rates of 18O2 exchange from labeled 13C18O2 to H2 16O with a total carbon concentration of 1 mM [69–71]. The hydration rates were calculated from the enhancement in the rate of 18O loss over the uncatalyzed rate by applying the non-enzymatic first-order rate constant .
Rubisco activity was spectrophotometrically measured in 1 mL of assay buffer (100 mM EPPS-NaOH (pH 8.0), 20 mM MgCl2, 1 mM EDTA, 1 mM ATP, 5 mM creatine phosphate, 20 mM NaHCO3, 0.5 mM ribulose 1,5-bisphosphate, 0.2 mM NADH) containing coupling enzymes (12.5 units mL–1 creatine phosphokinase, 250 units mL–1 CA, 22.5 units mL–1 phosphoglycerokinase, 20 units mL–1 glyceraldehyde-3-phosphodehydrogenase, 56 units mL–1 triose isomerase, and 20 units mL–1 glycerol-3-phosphodehydrogenase). Rubisco activity was calculated from the consumption of NADH, which was monitored via the change in absorption at 340 nm .
PEPC activity was assayed in 1 mL of assay buffer (100 mM EPPS-NaOH (pH 8.0), 20 mM MgCl2, 1 mM EDTA, 5 mM NaHCO3, 0.2 mM NADH, 5 mM D-glucose-6-phosphate, 12 units mL–1 malate dehydrogenase, and 4 mM phosphoenolpyruvate). NADH consumption was monitored at 340 nm .
The CO2 response curves for C3 and C4 photosynthesis were modeled according to von Caemmerer (2000). The C3 model was matched to gas exchange data from D. oligosanthes using a V cmax = 77 μmol m–2 s–1 and a J max = 144 μmol m–2 s–1. C4 photosynthesis was modeled using V cmax = 35 μmol m–2 s–1 to have comparable maximum photosynthetic rates to those measured in D. oligosanthes. All other modeling parameters were taken from von Caemmerer (; see Tables 2.3 and 4.1).
Dried leaf material was placed in tin capsules and combusted in a hydrogen/carbon/nitrogen elemental analyzer (ECS 4010; Costech Analytical) to determine carbon isotopic composition.
Growth conditions for physiological measurements
Plants were grown in a controlled-environment growth chamber (Biochambers; GC-16, Winnipeg, Manitoba, Canada) with a 14-h photoperiod and a photosynthetic photon flux density of 500 μmol m–2 s–1 at leaf height. Relative humidity was maintained at approximately 40 %. Day/night air temperatures were 28 and 18 °C, respectively. Plants were watered as needed and fertilized weekly.
Genome sequencing and assembly
All DNA used for sequencing was taken from a single F2 plant descended from the original collection, Kellogg 1175. Following the recommended Allpaths-LG sequencing protocol , a 180 bp insert DNA-seq library and 5 kb mate pair DNA libraries were prepared and sequenced at the Cornell University Sequencing Core. Each library was sequenced twice in 2 × 100 bp paired end sequencing lanes. Both libraries were quality trimmed using Trimmomatic . The mate pair libraries were subject to a second quality control step in which polymerase chain reaction duplicate reads (defined as cases where the first 20 base pairs for both forward and reverse reads were identical between independently sequenced clusters) were removed using a simple python script. The resulting dataset was assembled using Allpaths-LG , with the ploidy file set to 2 (diploid), and otherwise default settings. Final scaffolding was conducted with SSPACE  utilizing long mate pair libraries sequenced on an Illumina MiSeq using 2 × 150 sequencing.
A subset of the total sequencing data (2.3 Gigabytes) was used to assemble the chloroplast genome. SPAdes v.3.1.0 was used to make the initial assembly using the “only-assembler” option with k-mer sizes of 55 and 87. The SPAdes contigs were blasted (blastn) against the Z. mays chloroplast genome (NC_001666.2) and mitochondrial genome (NC_007982.1) with an e-value cutoff of 1e-40. The contigs were identified as plastid or mitochondrial using a custom script to determine the optimal e-value and longest single hit length. Plastid-like sequences were then put into Sequencher (Genecodes— 5.2.4). Ends of contigs were identified and were trimmed by 100 base pairs to remove potentially misassembled regions. Twenty base pair sequences from the end of trimmed contigs were used to search all raw reads for exact matching sequence using “grep.” This was repeated with the reverse complement of the sequence. The matching reads were aligned to the contigs in Sequencher and used to extend the contig ends. This process continued until all gaps were closed. The sequence was orientated for the typical representation of large single copy, inverted repeat B, small single copy, and inverted repeat A. Jellyfish v.2.1.3  was used to estimate 20-mer abundance from the reads and these abundance values were matched to the assembled plastome to determine a sliding window coverage across the assembly. No anomalies in coverage were identified and the assembly was considered complete. Annotation of the plastome was completed using DOGMA . A graphical representation of the annotated plastome was created using Circos v.0.66  as implemented in Verdant (verdant.iplantcollaborative.org).
Gene annotation was conducted using Maker2 . For each of O. sativa, Brachypodium distachyon, S. bicolor, Z. mays, and S. italica, the “primaryTranscriptOnly” files for cDNA and protein sequences were downloaded from phytozome and alignments of these files against the D. oligosanthes genome assembly were used for the evidence based portion of Maker2’s analysis (Maker2 also incorporates ab initio gene prediction approaches into its final analysis).
Leaf gradient construction
Plants were sown in MetroMix360 and grown in Conviron BDW-40 controlled chambers under 500 μmol/m2/s of light with 12-h light/dark cycles at 31 °C and 22 °C, respectively, and a constant 50 % relative humidity. Leaf tissue was sampled after the second leaf ligule formed but before the third leaf had fully expanded. This corresponds to three weeks after planting for D. oligosanthes, and nine days after planting for S. viridis. Samples were collected in the morning 2 h after the lights turned on and were immediately flash frozen in liquid nitrogen. Four plants were pooled for each segment replicate of D. oligosanthes and ten plants were pooled for each segment replicate of S. viridis. TriPure Isolation Reagent (Sigma) was used to extract total RNA following the manufacturer’s recommendations. Libraries were prepared according to Wang et al. . D. oligosanthes libraries were sequenced on a HiSeq2000 using a 100 bp paired end run and S. viridis libraries were sequenced on a HiSeq2500 using a 100 bp single end run.
Gene expression analysis
Raw Illumina reads from RNA-seq libraries were subjected to quality trimming using CutAdapt  with a minimum quality score of 20 and a minimum read length of 25 bp after trimming. Trimmed reads were aligned to reference genomes (S. bicolor v1.4, D. oligosanthes v1.0, and S. italica v2.1) using , allowing splicing over canonical RNA-splice sites, a maximum number of reported alignments per read of 10, a maximum mismatch rate of 3 (allowing up to three SNPs or one SNP and one InDel). FPKM values were calculated using Cufflinks .
Leaf gradient and TF analysis
A minimum FPKM threshold was applied to the Cufflinks output. This threshold stipulates that a gene must have an FPKM value equal to or greater than one in at least one segment of any replicate. Correlation matrices and heat map visualizations were constructed using R software packages (code available upon request). CoGe Blast and CoGe SynFind tools were used to find core C4 gene and TF orthologs for the different grass species [47, 48]. Segments 1, 4, 9, and 14 from Z. mays and 1, 3, 6, and 9 from O. sativa from previously published datasets  were compared to the four segment gradient data for D. oligosanthes and S. viridis generated in this study. Transcription factor leaf gradient expression profiles from the different species were overlaid and compared by eye.
PEPC amino acid tree
The coding sequence for Do021545.1 contained an alternative splice site and had to be manually curated with raw sequencing reads. Amino acid sequences are listed in Supplementary Material. Amino acid sequences were aligned using the Muscle aligner. The evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model. The bootstrap consensus tree inferred from 500 replicates was taken to represent the evolutionary history of the taxa analyzed. Branches corresponding to partitions reproduced in less than 50 % bootstrap replicates are collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) are shown next to the branches. The analysis involved 12 amino acid sequences. All positions containing gaps and missing data were eliminated. There was a total of 863 positions in the final dataset. Evolutionary analyses were conducted in MEGA (version 6.06) .
ca gene family
RACE was performed using the GeneRacer kit (Invitrogen, cat#L1502-01) using RNA extracted from leaf tissue. Ten independent clones were sequenced per RACE reaction. ca gene coding sequences were aligned using the Muscle codon aligner in MEGA (version 6.06) . ca1 sequences from Z. mays and S. bicolor have duplicated active site domains. These gene fusions were treated as two separate genes in the analysis. Alignment gaps were removed and then the alignment was loaded into Cipres Science Gateway (http://www.phylo.org/), where RaxML  was used to generate a maximum-likelihood tree.
Carbon concentrating mechanism
Dual Organellar GenoMe Annotator
Nicotinamide adenine dinucleotide malic enzyme
Nicotinamide adenine dinucleotide phosphate malic enzyme
Rapid Amplification of cDNA Ends
Rubisco large subunit
We thank the Danforth Center greenhouse staff for their assistance cultivating the D. oligosanthes accessions and C. Shyu for generously providing the S. viridis leaf image.
This work was supported by a Department of Energy - Division of Biosciences Fellowship of the Life Sciences Research Foundation to AJS (DE-FG02-12ER16337), a NSF PGRP fellowship to JCS (IOS-1306738), a grant from the National Science Foundation (MCB-1127017) and funding from the Donald Danforth Plant Science Center to TPB, a grant from the Division of Chemical Sciences, Geosciences and Biosciences, Office of Basic Energy Sciences, Photosynthetic Systems (DE-FG02-09ER16062) and with instrumentation from an NSF Major Research Instrumentation grant to ABC (#0923562).
Availability of data and materials
The Dichanthelium v.1 genome assembly and gene model annotations are freely available from NCBI under accession number LWDX00000000. The raw sequencing reads used for the genome assembly have been submitted to NCBI’s SRA under accession SRP067405, SRS1207314, SRX1483064, SRR3018854, SRX1488867, SRR3018860. RNA-seq data from the D. oligosanthes, and S. viridis leaf gradients have been deposited in the NCBI SRA (SRP058858 and SRP063517, respectively).
AS designed experiments and analyzed data, JS designed experiments and analyzed data, SW performed the anatomical analysis and collected leaf gradient data, AK performed physiology experiments and analyzed data, MM assembled and annotated the chloroplast genome, YS conceived the sequencing strategy and collected data, AC designed physiology experiments and analyzed data, EK provided the accession, designed experiments, and did phylogenetic analyses, TB designed experiments and analyzed data. All authors contributed, read, and approved the final manuscript.
The Donald Danforth Plant Science Center is pursuing a patent on behalf of AS and TB on aspects related to the findings in this manuscript.
Ethics approval and consent to participate
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Ellegren H. Genome sequencing and population genomics in non-model organisms. Trends Ecol Evol. 2014;29:51–63.View ArticlePubMedGoogle Scholar
- Lyons E, Pedersen B, Kane J, Freeling M. The value of nonmodel genomes and an example using SynMap within CoGe to dissect the hexaploidy that predates the rosids. Trop Plant Biol. 2008;1:181–90.View ArticleGoogle Scholar
- Koonin EV, Aravind L, Kondrashov AS. The impact of comparative genomics on our understanding of evolution. Cell. 2000;101:573–6.View ArticlePubMedGoogle Scholar
- Christin PA, Salamin N, Savolainen V, Duvall MR, Besnard G. C4 Photosynthesis evolved in grasses via parallel adaptive genetic changes. Curr Biol. 2007;17:1241–7.View ArticlePubMedGoogle Scholar
- Zhu XG, Long SP, Ort DR. Improving photosynthetic efficiency for greater yield. Annu Rev Plant Biol. 2010;61:235–61.View ArticlePubMedGoogle Scholar
- Ort DR, Merchant SS, Alric J, Barkan A, Blankenship RE, Bock R, et al. Redesigning photosynthesis to sustainably meet global food and bioenergy demand. Proc Natl Acad Sci U S A. 2015;112:8529–36.View ArticlePubMedPubMed CentralGoogle Scholar
- Sage RF, Zhu XG. Exploiting the engine of C4 photosynthesis. J Exp Bot. 2011;62:2989–3000.View ArticlePubMedGoogle Scholar
- Sage RF, Sage TL, Kocacinar F. Photorespiration and the evolution of C4 photosynthesis. Annu Rev Plant Biol. 2012;63:19–47.View ArticlePubMedGoogle Scholar
- Sage RF. The evolution of C4 photosynthesis. New Phytol. 2004;161:341–70.View ArticleGoogle Scholar
- Sage RF, Christin PA, Edwards EJ. The C4 plant lineages of planet Earth. J Exp Bot. 2011;62:3155–69.View ArticlePubMedGoogle Scholar
- Grass Phylogeny Working Group II. New grass phylogeny resolves deep evolutionary relationships and discovers C4 origins. New Phytol. 2012;193:304–12.View ArticleGoogle Scholar
- Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, et al. The B73 maize genome: complexity, diversity, and dynamics. Science. 2009;326:1112–5.View ArticlePubMedGoogle Scholar
- Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, et al. The Sorghum bicolor genome and the diversification of grasses. Nature. 2009;457:551–6.View ArticlePubMedGoogle Scholar
- Zhang G, Liu X, Quan Z, Cheng S, Xu X, Pan S, et al. Genome sequence of foxtail millet (Setaria italica) provides insights into grass evolution and biofuel potential. Nature Biotechnol. 2012;30:549–54.View ArticleGoogle Scholar
- Cannarozzi G, Plaza-Wuthrich S, Esfeld K, Larti S, Wilson YS, Girma D, et al. Genome and transcriptome sequencing identifies breeding targets in the orphan crop tef (Eragrostis tef). BMC Genomics. 2014;15:581.View ArticlePubMedPubMed CentralGoogle Scholar
- Gould FW. Chromosome numbers of southwest gasses. Am J Bot. 1958;45:757–67.View ArticleGoogle Scholar
- Brautigam A, Schliesky S, Kulahoglu C, Osborne CP, Weber AP. Towards an integrative model of C4 photosynthetic subtypes: insights from comparative transcriptome analysis of NAD-ME, NADP-ME, and PEP-CK C4 species. J Exp Bot. 2014;65:3579–93.View ArticlePubMedPubMed CentralGoogle Scholar
- Freckmann RW, Lelong MG. Dichanthelium. In: Barkworth ME, editor. Flora of North America, vol. 25. New York: Oxford University Press; 2003.Google Scholar
- Mohlenbrock RH. Vascular flora of Illinois. Carbondale: SIU Press; 2002.Google Scholar
- Clayton WD, Renvoize SA. Genera graminum. Grasses of the world. Kew Bulletin Additional Series. 1986;13:1–389.Google Scholar
- Clayton WD, Harman KT, Williamson H. GrassBase - The online world grass flora. 2006 onwards. http://www.kew.org/data/grasses-db.html.
- Gould FW, Clark CA. Dichanthelium (Poaceae) in the United States and Canada. Ann Missouri Bot Gard. 1978;65:1088–132.View ArticleGoogle Scholar
- Giussani LM, Cota-Sanchez JH, Zuloaga FO, Kellogg EA. A Molecular phylogeny of the grass subfamily Panicoideae (Poaceae) shows multiple origins of C4 photosynthesis. Am J Bot. 2001;88:1993–2012.View ArticlePubMedGoogle Scholar
- Aliscioni SS, Giussani LM, Zuloaga FO, Kellogg EA. A Molecular phylogeny of Panicum (Poaceae: Paniceae): tests of monophyly and phylogenetic placement within the Panicoideae. Am J Bot. 2003;90:796–821.View ArticlePubMedGoogle Scholar
- Washburn JD, Schnable JC, Davidse G, Pires JC. Phylogeny and photosynthesis of the grass tribe Paniceae. Amer J Bot. 2015;102:1493–505.View ArticleGoogle Scholar
- Vicentini A, Barber JC, Aliscioni SS, Giussani LM, Kellogg EA. The age of the grasses and clusters of origins of C4 photosynthesis. Glob Chang Biol. 2008;14:2963–77.View ArticleGoogle Scholar
- von Caemmerer S, Quick WP, Furbank RT. The development of C4 rice: current progress and future challenges. Science. 2012;336:1671–2.View ArticleGoogle Scholar
- Brown WV. Variations in anatomy, associations, and origins of Kranz tissue. Amer J Bot. 1975;62:395–402.View ArticleGoogle Scholar
- Hattersley PW. Characterization of C4 type leaf anatomy in grasses (Poaceae). Mesophyll: bundle sheath area ratios. Amer J Bot. 1984;53:163–79.Google Scholar
- Hattersley PW. Variations in photosynthetic pathway. In: Soderstrom TR, Hilu KW, Campbell CS, Barkworth ME, editors. Grass systematics and evolution. Washington, DC: Smithsonian Institution Press; 1987. p. 49–64.Google Scholar
- Hattersley PW, Watson L. Diversification of photosynthesis. In: Chapman GP, editor. Grass evolution and domestication. Cambridge: Cambridge University Press; 1992.Google Scholar
- Hattersley PW, Watson L. Anatomical parameters for predicting photosynthetic pathways of grass leaves: The “maximum lateral cell count” and the “maximum cells distant count”. Phytomorphol. 1975;25:325–33.Google Scholar
- Brown WV. The Kranz syndrome and its subtypes in grass systematics. Mem Torrey Bot Club. 1977;23:1–97.Google Scholar
- von Caemmerer S. Biochemical models of leaf photosynthesis. Collingwood: CSIRO Publishing; 2000.Google Scholar
- Gillon J, Yakir D. Influence of carbonic anhydrase activity in terrestrial vegetation on the 18O content of atmospheric CO2. Science. 2001;291:2584–7.View ArticlePubMedGoogle Scholar
- Edwards GE, Walker DA. C3, C4: mechanisms, and cellular and environmental regulation, of photosynthesis (1983). Oxford: Blackwell Scientific Publications; 1983.Google Scholar
- Svensson P, Blasing OE, Westhoff P. Evolution of C4 phosphoenolpyruvate carboxylase. Arch Biochem Biophys. 2003;414:180–8.View ArticlePubMedGoogle Scholar
- O’Leary M. Carbon isotope fractionation in plants. Phytochemistry. 1981;21:553–67.View ArticleGoogle Scholar
- Smith BN, Brown WV. The Kranz syndrome in the Gramineae as indicated by carbon isotopic ratios. Am J Bot. 1973;60:505–13.View ArticleGoogle Scholar
- Brown WV, Smith BN. The genus Dichanthelium (Gramineae). Bull Torrey Bot Club. 1975;102:10–3.View ArticleGoogle Scholar
- Edwards GE, Ku MSB. The biochemistry of C3-C4 intermediates. In: Hatch MD, Boardman K, editors. The Biochemistry of Plants, vol. 14. New York: Academic Press; 1987. p. 275–325.Google Scholar
- Moss DN, Krenzer EG, Brun WA. Carbon dioxide compensation points in related plant species. Science. 1969;164:187–8.View ArticlePubMedGoogle Scholar
- Gnerre S, Maccallum I, Przybylski D, Ribeiro FJ, Burton JN, Walker BJ, et al. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci U S A. 2011;108:1513–8.View ArticlePubMedGoogle Scholar
- Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics. 2011;27:578–9.View ArticlePubMedGoogle Scholar
- Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, et al. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. GigaScience. 2012;1:18.View ArticlePubMedPubMed CentralGoogle Scholar
- Holt C, Yandell M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics. 2011;12:491.View ArticlePubMedPubMed CentralGoogle Scholar
- Lyons E, Freeling M. How to usefully compare homologous plant genes and chromosomes as DNA sequences. Plant J. 2008;53:661–73.View ArticlePubMedGoogle Scholar
- Lyons E, Pedersen B, Kane J, Alam M, Ming R, Tang H, et al. Finding and comparing syntenic regions among Arabidopsis and the outgroups papaya, poplar, and grape: CoGe with rosids. Plant Physiol. 2008;148:1772–81.View ArticlePubMedPubMed CentralGoogle Scholar
- Lyons E, Freeling M, Kustu S, Inwood W. Using genomic sequencing for classical genetics in E. coli K12. Plos One. 2011;6:e16717.View ArticlePubMedPubMed CentralGoogle Scholar
- Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–77.View ArticlePubMedPubMed CentralGoogle Scholar
- Marcais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011;27:764–70.View ArticlePubMedPubMed CentralGoogle Scholar
- Wyman SK, Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004;20:3252–5.View ArticlePubMedGoogle Scholar
- Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19:1639–45.View ArticlePubMedPubMed CentralGoogle Scholar
- Li P, Ponnala L, Gandotra N, Wang L, Si Y, Tausta SL, et al. The developmental dynamics of the maize leaf transcriptome. Nat Genet. 2010;42:1060–7.View ArticlePubMedGoogle Scholar
- Wang L, Czedik-Eysenberg A, Mertz RA, Si Y, Tohge T, Nunes-Nesi A, et al. Comparative analyses of C4 and C3 photosynthesis in developing leaves of maize and rice. Nat Biotechnol. 2014;32:1158–65.View ArticlePubMedGoogle Scholar
- Ding Z, Weissmann S, Wang M, Du B, Huang L, Wang L, et al. Identification of photosynthesis-associated C4 candidate genes through comparative leaf gradient transcriptome in multiple lineages of C3 and C4 species. Plos One. 2015;10:e0140629.View ArticlePubMedPubMed CentralGoogle Scholar
- Okabe K, Yang S, Tsuziki M, Miyachi S. Carbonic anhydrase: Its content in spinach leaves and its taxonomic diversity studied with anti-spinach leaf carbonic anhydrase antibody. Plant Sci Lett. 1984;33:145–53.View ArticleGoogle Scholar
- Wang Y, Brautigam A, Weber AP, Zhu XG. Three distinct biochemical subtypes of C4 photosynthesis? A modelling analysis. J Exp Bot. 2014;65:3567–78.View ArticlePubMedPubMed CentralGoogle Scholar
- Furbank RT. Evolution of the C4 photosynthetic mechanism: are there really three C4 acid decarboxylation types? J Exp Bot. 2011;62:3103–8.View ArticlePubMedGoogle Scholar
- Hatch MD. The C4-pathway of photosynthesis. Evidence for an intermediate pool of carbon dioxide and the identity of the donor C4-dicarboxylic acid. Biochem J. 1971;125:425–32.View ArticlePubMedPubMed CentralGoogle Scholar
- Wingler A, Walker RP, Chen Z-H, Leegood RC. Phosphoenolpyruvate carboxykinase is involved in the decarboxylation of aspartate in the bundle sheath of maize. Plant Physiol. 1999;120:539–45.View ArticlePubMedPubMed CentralGoogle Scholar
- Koteyeva NK, Voznesenskaya EV, Edwards GE. An assessment of the capacity for phosphoenolpyruvate carboxykinase to contribute to C4 photosynthesis. Plant Sci. 2015;235:70–80.View ArticlePubMedGoogle Scholar
- Aubry S, Kelly S, Kümpers BM, Smith-Unna RD, Hibberd JM. Deep evolutionary comparison of gene expression identifies parallel recruitment of trans-factors in two independent origins of C4 photosynthesis. Plos One. 2014;10:e1004365.Google Scholar
- Christin PA, Samaritani E, Petitpierre B, Salamin N, Besnard G. Evolutionary insights on C4 photosynthetic subtypes in grasses from genomics and phylogenetics. Genome Biol Evol. 2009;1:221–30.View ArticlePubMedPubMed CentralGoogle Scholar
- Christin PA, Petitpierre B, Salamin N, Buchi L, Besnard G. Evolution of C4 phosphoenolpyruvate carboxykinase in grasses, from genotype to phenotype. Mol Biol Evol. 2009;26:357–65.View ArticlePubMedGoogle Scholar
- Christin PA, Salamin N, Muasya AM, Roalson EH, Russier F, Besnard G. Evolutionary switch and genetic convergence on rbcL following the evolution of C4 photosynthesis. Mol Biol Evol. 2008;25:2361–8.View ArticlePubMedGoogle Scholar
- Studer AJ, Gandin A, Kolbe AR, Wang L, Cousins AB, Brutnell TP. A limited role for carbonic anhydrase in C4 photosynthesis as revealed by a ca1ca2 double mutant in maize. Plant Physiol. 2014;165:608–17.View ArticlePubMedPubMed CentralGoogle Scholar
- Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3.View ArticlePubMedPubMed CentralGoogle Scholar
- Badger MR, Price GD. Carbonic anhydrase activity associated with the cyanobacterium Synechococcus PCC7942. Plant Physiol. 1989;89:51–60.View ArticlePubMedPubMed CentralGoogle Scholar
- von Caemmerer S, Quinn V, Hancock NC, Price GD, Furbank RT, Ludwig M. Carbonic anhydrase and C4 photosynthesis: a transgenic analysis. Plant Cell Enivron. 2004;27:697–703.View ArticleGoogle Scholar
- Cousins AB, Badger MR, von Caemmerer S. C4 photosynthetic isotope exchange in NAD-ME- and NADP-ME-type grasses. J Exp Bot. 2008;59:1695–703.View ArticlePubMedGoogle Scholar
- Jenkins CLD, Furbank RT, Hatch MD. Inorganic carbon diffusion between C4 mesophyll and bundle sheath cells - direct bundle sheath CO2 assimilation in intact leaves in the presence of an inhibitor of the C4 pathway. Plant Physiol. 1989;91:1356–63.View ArticlePubMedPubMed CentralGoogle Scholar
- Walker B, Ariza LS, Kaines S, Badger MR, Cousins AB. Temperature response of in vivo Rubisco kinetics and mesophyll conductance in Arabidopsis thaliana: comparisons to Nicotiana tabacum. Plant Cell Environ. 2013;36:2108–19.View ArticlePubMedGoogle Scholar
- Sun W, Ubierna N, Ma JY, Cousins AB. The influence of light quality on C4 photosynthesis under steady-state conditions in Zea mays and Miscanthusxgiganteus: changes in rates of photosynthesis but not the efficiency of the CO2 concentrating mechanism. Plant Cell Environ. 2012;35:982–93.View ArticlePubMedGoogle Scholar
- Bolger AM, Lohse M, Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.View ArticlePubMedPubMed CentralGoogle Scholar
- Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnetjournal. 2011;17:10–2.Google Scholar
- Wu TD, Nacu S. Fast and SNP-tolerant detection of complex variants and splicing in short reads. Bioinformatics. 2010;26:873–81.View ArticlePubMedPubMed CentralGoogle Scholar
- Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28:511–5.View ArticlePubMedPubMed CentralGoogle Scholar
- Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Pinto H, Sharwood RE, Tissue DT, Ghannoum O. Photosynthesis of C3, C3-C4, and C4 grasses at glacial CO2. J Exp Bot. 2014;65:3669–81.View ArticlePubMedPubMed CentralGoogle Scholar