Intraspecific variation of recombination rate in maize
Genome Biology volume 14, Article number: R103 (2013)
In sexually reproducing organisms, meiotic crossovers ensure the proper segregation of chromosomes and contribute to genetic diversity by shuffling allelic combinations. Such genetic reassortment is exploited in breeding to combine favorable alleles, and in genetic research to identify genetic factors underlying traits of interest via linkage or association-based approaches. Crossover numbers and distributions along chromosomes vary between species, but little is known about their intraspecies variation.
Here, we report on the variation of recombination rates between 22 European maize inbred lines that belong to the Dent and Flint gene pools. We genotype 23 doubled-haploid populations derived from crosses between these lines with a 50 k-SNP array and construct high-density genetic maps, showing good correspondence with the maize B73 genome sequence assembly. By aligning each genetic map to the B73 sequence, we obtain the recombination rates along chromosomes specific to each population. We identify significant differences in recombination rates at the genome-wide, chromosome, and intrachromosomal levels between populations, as well as significant variation for genome-wide recombination rates among maize lines. Crossover interference analysis using a two-pathway modeling framework reveals a negative association between recombination rate and interference strength.
To our knowledge, the present work provides the most comprehensive study on intraspecific variation of recombination rates and crossover interference strength in eukaryotes. Differences found in recombination rates will allow for selection of high or low recombining lines in crossing programs. Our methodology should pave the way for precise identification of genes controlling recombination rates in maize and other organisms.
In sexually reproducing organisms, crossovers (COs) stabilize the pairing of homologous chromosomes during meiosis and ensure their correct segregation. By reciprocal exchange of parental genetic material, the COs also lead to new allelic combinations and thus play an important role in creating genetic diversity. Meiotic recombination occurs during prophase I of meiosis, when DNA double-strand breaks (DSBs) catalyzed by the topo-isomerase-related enzyme SPO11  are repaired via reciprocal exchange of genetic material between homologous chromosomes. The final number of recombination events depends on (1) the number of DSBs, and (2) the proportion of DSBs that are repaired as COs. The remaining DSBs may be repaired via other pathways leading to small conversions called non-crossovers, or may even be repaired using the sister chromatid instead of the homologous chromosome . In organisms such as mice, humans, or plants, the number of DSBs is at least 10 times higher than the number of COs [3–5]. The number of COs can vary under the control of genetic factors  that affect recombination rate, but there is also some homeostasis of the CO number that modulates the CO/DSB ratio . CO interference is a phenomenon observed in almost all organisms, by which two successive COs on a chromosome are rarely very close to each other [7, 8]. The standard picture considers that close to a DSB that is repaired as a CO, other DSBs are preferentially repaired as non-crossovers [9–11], so interference may play a role in the regulation of CO numbers and distributions. Two distinct pathways of CO formation have been found to coexist in most species investigated, including Saccharomyces cerevisiae[12, 13], Solanum lycopersicum, Arabidopsis thaliana, and Mus musculus. In plants, the majority of COs are formed via pathway 1, which depends on genes of the ZMM family (hereafter referred to as P1), and which are subject to interference [11, 14, 17]. The remaining COs are formed via pathway 2 (hereafter referred to as P2), which depends on the Mus81 gene, and in which there is little or no interference .
The meiotic recombination rate is known to vary among species [19, 20]. In mammals, some intraspecific variation of local recombination rate has been revealed by sperm typing and has been explained by cis and trans genetic factors. In particular, the allelic diversity in the zinc finger domain of PRDM9, a DNA-binding protein, has been shown to influence recombination hot-spot activity [21, 22]. In plants no PRDM9 homologs have been identified so far and little is known about the variation of the genome-wide recombination rates (GWRRs) within species. Most data come from chiasmata counts or linkage mapping experiments [20, 23]. However, to our knowledge, there are hardly any cases in plants where many crosses involving distantly related parents have been used to compare recombination rates and infer the patterns of recombination along chromosomes based on high-density linkage maps. The same holds for CO interference: even though it plays an important role in determining recombination rate, intraspecific variation of interference has rarely been investigated in plants. Understanding the landscape of recombination within a species is of intrinsic interest, and it is also important in the context of genome-based prediction  and genome-wide association studies , since recombination determines the extent of linkage disequilibrium in populations under study. The extent of linkage disequilibrium has an impact on the linkage phase between predictive markers and quantitative trait loci (QTL) and on the marker densities required to find significant associations between markers and traits of interest.
Maize (Zea mays L.) has been the subject of genetic studies for more than a hundred years . It is a diploid species (n = 10) with ancient polyploid origin . The first genetic linkage map in maize based on DNA markers was constructed using restriction fragment length polymorphism markers . Since then, many more genetic maps have been published based on restriction fragment length polymorphism, amplified fragment length polymorphism, or simple sequence repeat markers [29, 30]. With increasing amounts of maize sequence information, single nucleotide polymorphism (SNP) markers were developed and used for diversity analysis and genetic mapping in maize [31, 32]. Recently, the Illumina® MaizeSNP50 genotyping array comprising around 50,000 SNPs was developed and the first high-density genetic maps of maize were published using two intermated recombinant inbred line populations . These maps comprised up to 21,000 SNP markers and were compared with the maize genome reference sequence generated from the US inbred line B73, which is the largest and most complex plant genome sequenced so far . The B73 AGP v2 assembly covers 2.07 Gb of the approximately 2.5 Gb maize genome.
For studying intraspecific variation of recombination rate in maize, we created two half-sib panels, one for Dent and one for Flint inbred lines. Dent and Flint are two maize gene pools that are important for hybrid breeding in Europe . The two half-sib panels comprised a total of 24 full-sib families. The Dent panel comprises 10 crosses of a central Dent inbred line (F353) with diverse Dent founder inbreds of the European Dent gene pool that were developed in Europe and North America. In the Flint panel a central Flint inbred line (UH007) was crossed to 11 genetically diverse Flint founder inbreds of the European Flint breeding material. The Flint founder lines originate from early maize introductions to Spain as well as from so-called Northern Flints from France and Germany. In addition, two reciprocal crosses between the two central Dent and Flint lines were analyzed and in each panel one cross with the US Dent line B73 was included as a connection to the US nested association mapping (NAM) population .
Here, we report on the analysis of the variation of recombination rate in 23 populations of maize using SNPs from the MaizeSNP50 BeadChip . All populations consist of doubled haploid (DH) lines obtained by in vivo haploid induction . DH lines produced with this method reflect female meiosis. Unlike F2 or RILs, DH lines have the great advantage that the genetic information of each gamete is directly observed. Our objectives were to analyze intraspecific variation for (1) GWRR and chromosome-wide recombination rate, (2) recombination landscape along chromosomes, and (3) CO interference, using high-density genetic linkage maps. We found significant differences of GWRR between individual populations as well as between Dent × Dent versus Flint × Flint populations, but not between the Dent and Flint gene pools in general. GWRR was not correlated with the genetic structure of the lines as inferred from admixture analysis. We analyzed the recombination landscapes in all populations and found significant differences between individual populations and between pools. Finally, we characterized quantitatively the interfering and non-interfering CO formation pathways and found a negative correlation over all chromosomes between interference intensity of pathway P1 and GWRR.
We constructed two half-sib panels comprising a total of 24 maize full-sib families of DH lines for the analysis of intraspecific variation of recombination rates. The full-sib families within the two panels represent the diversity of important founder lines of the European Dent and Flint germplasm, respectively (Table 1; Additional file 1). In the Dent panel (prefix CFD), all full-sib families have the same common parental line F353 which was crossed to diverse Dent founder lines. The common parent of the Flint full-sib families (prefix CFF) is line UH007, which was crossed to diverse Flint founder lines. In all crosses, the haploid inducer line was used as male parent when crossed with the F1 plants for in vivo haploid induction, so our maps reflect only female meioses. Details on the diversity analysis of the parental lines are given in Additional file 2.
High-density genetic map construction in maize full-sib families
As a first step in the analysis of recombination, individual genetic maps were constructed for 23 out of the 24 populations and each of the 10 chromosomes. Map statistics are given in Table 1 with more details in Additional file 3, and the complete list of marker positions in all maps with the raw segregation data is given in Additional file 4. In total, 39,439 SNPs of the MaizeSNP50 array could be mapped in at least one population. Over all populations, around 33,884 COs were observed in 2,233 female meioses and, on average, 11,988 SNPs were mapped in one given population. The average for Dent × Dent populations (all CFD except CFD01) was 13,247 markers mapped and for Flint × Flint populations (all CFF except CFF01 and CFF02) the average was 9,874. For population CFF05, which comprised only 34 DHs, no stable map could be obtained. The total length of genetic maps ranged from 1,180 centiMorgan (cM) in population CFD05 to 1,893 cM in CFF03 with a mean of 1,508 ± 185 cM (mean ± standard deviation). The average genetic map length in the 10 Dent × Dent populations (CFD02 to CFD12) was 1,353 ± 99 cM, while the 11 genetic maps from Flint × Flint populations (CFF03 to CFF15) were, on average, longer at 1,645 ± 154 cM. The three maps from crosses between Dent and Flint parents (CFD01, CFF01, CFF02) had an intermediate average length of 1,570 ± 85 cM. The largest gap in a genetic map was 53.7 cM on the long arm of chromosome 7 (7L) in population CFF13, where no SNPs were polymorphic between the parents, most likely due to identity-by-descent (IBD). Details about genome-wide diversity and IBD segments between parents of the mapping populations can be found in Additional file 2. Synteny and colinearity of the genetic maps was compared with the B73 AGPv2 genome assembly. In general, a high agreement was found, demonstrating the high consistency of marker order across populations, with few exceptions as described in Additional file 2.
We observed distorted segregation in all populations. Deviations from the expected allele frequencies of 0.5 are displayed in Additional file 5. All 10 chromosomes carried regions with distorted segregation. Populations and chromosomes differed in their patterns of distortion. Only in some rare cases shared features were visible - for example, in the Flint (CFF) populations, chromosome 2 tends to be distorted towards the common parent allele in the centromeric regions and towards the founder line alleles in the distal regions of the long arm. In none of the regions where known gametophytic (ga) genes are located (chromosome 3L, ga7; chromosome 4S, ga1; chromosome 5L, ga2; chromosome 9S, ga8; data from ) was a consistent segregation towards either central or founder line allele across the full-sib families observed.
Intraspecific variation for genome-wide and chromosome-specific recombination rates in maize
The average GWRR can be expressed as the ratio of total genetic map length in centiMorgans divided by the genome size in megabase pairs. The genome size of maize is approximately 2.5 Gb, of which 2.1 Gb are sequenced and assembled in the B73 AGPv2 reference sequence. We used the 2.1 Gb B73 AGPv2 assembly as the basis of our calculations. Similarly, the average recombination rate of one chromosome is the ratio between the genetic map length in centiMorgans and the physical length in megabase pairs of this chromosome. The physical lengths of the 10 maize chromosomes in the B73 AGPv2 assembly are between 301 Mbp for chromosome 1 and 150 Mbp for chromosome 10. Genetic map lengths were measured for each population based on the same physical positions of extremities, correcting for particular regions (for example, IBD segments) where mapping information was not available or not reliable. The average GWRR over all populations was 0.74 ± 0.09 cM/Mbp (mean ± standard deviation; Table 1). Figure 1 (details in Additional file 3) shows that the GWRR and chromosome-wide recombination rate varied among chromosomes and among populations. In particular, chromosome 9 recombined the most and chromosome 4 the least. Statistical tests for pairwise comparisons of average recombination rates between chromosomes and between populations confirm these observations (Additional file 6).
The differences in GWRR between Flint × Flint populations and Dent × Dent populations were much larger than the intrinsic statistical uncertainties, leading us to consider that there is a genetic source of the variability observed for GWRR. To investigate to what extent some alleles more frequent in one pool than in the other may explain the diversity of GWRR, we first estimated for each parent its degree of 'flintness', based on admixture analysis performed on a large panel of lines including Dent and Flint accessions . We then analyzed the correlation between the average 'flintness' of the two parents of a cross and the GWRR of the resulting population. The results are shown in Figure 2, with the probability of belonging to the Dent and the Flint groups for each parental line (Figure 2A) and the highly significant correlation (P value = 4.6 × 10-5) observed between the GWRR in a population and the average 'flintness' of its parents (Figure 2B). This population structure effect explained 55% of the variance of the GWRR.
Nevertheless, we observed large differences in the GWRR of the two crosses of both central lines with B73 (CFD02 and CFF02, respectively). The GWRR in CFD02 was 0.642 cM/Mbp, whereas in CFF02 the GWRR was 0.811 cM/Mbp. We thus asked whether the significantly higher GWRR in the pooled Flint × Flint populations might follow from differences in the central parents F353 and UH007. To test this, we took an additive model whereby the genetic length of a genetic map is the average of effects contributed by each of its parental lines. These parent-specific effects were determined using the fact that the line B73 was crossed to both central lines (F353 in CFD02, and UH007 in CFF02), thus connecting the two half-sib panels. Then we tested for a correlation between founder line effects on GWRR and their 'flintness'. As shown in Figure 2C, no significant correlation was found (P value 0.45), indicating that in the founder lines studied here, no general difference in GWRR can be observed between Dent and Flint inbred lines. We conclude that the broad range of variation observed in Figure 2B originates from (1) a strong difference in GWRR between the two central lines F353 and UH007, and (2) additional variation between the different founder lines, as observed along the y-axis of Figure 2C.
Intraspecific differences in recombination landscapes
In addition to GWRR, we analyzed the patterns of recombination rate along the chromosomes, hereafter referred to as recombination landscapes, and compared them for each pairwise combination of linkage maps and between Dent and Flint pools. We wanted to test whether there were intraspecific differences for recombination landscapes given the large intraspecific diversity in maize and the fact that we observe a clear differentiation between lines in genetic diversity analyses (see Additional file 2 for details) and GWRR. To compare recombination landscapes, genetic positions of the markers were plotted against their physical positions to obtain Marey maps. Most such maps showed a rather smooth and monotonic pattern (for example, CFD01 chromosome 1 in Additional file 7). However, in some maps, there were large regions void of polymorphic markers (for example, CFF07 chromosome 1 in Additional file 7), or the map yielded two parallel horizontal lines of markers (for example, CFF01 chromosome 3 in Additional file 7). Such parallel lines could be explained by duplicated chromosomal segments, possible errors in the B73 assembly, and/or non-copy-specific SNP markers that would detect paralogous sequences. In such cases, the missing information was imputed from all other maps. After smoothing and forcing monotonicity (Additional file 8), the derivative was calculated to indicate the local value of recombination rate. The shape of the recombination landscape varied from chromosome to chromosome, and for a given chromosome this shape varied among the populations. This is illustrated in Figure 3 for six populations representing the extremes and the median GWRR in each pool for chromosomes 2 and 6. All chromosomes are shown in Additional file 9. On all chromosomes, recombination was close to zero around the centromeres (compare the almost horizontal curves in the Marey maps and minima in the curves of recombination rates) and increased towards the telomeres. Other structural elements in the maize genome also lead to locally reduced recombination rates, such as the nucleolus organizer region (NOR) on chromosome 6 (Figure 3; Additional file 9). Up to 34 heterochromatic knobs have been described in different maize accessions; however, for only a few of those is the physical position available and knob size can vary across lines . Although no cytological data about knob positions and size variation in our lines are available, some variation can be expected. In several cases recombination rates were reduced in one or more populations around the known knob positions - for example, on chromosomes 1L, 2L, 3L, 6L, 7L and 9L (Figure 3; Additional file 9) - but in some other cases, knobs were in highly recombining regions.
To compare the shapes of the recombination landscapes independently of the chromosome-wide rate of recombination, we normalized the local rates by the chromosome-wide rate. Using these normalized data, statistical tests based on 10 bins of equal genetic length per chromosome revealed significant differences in the shape of the recombination landscapes. In a population-wise comparison considering all chromosomes of a population together (Additional file 10), four populations, CFF03, CFF07, CFF12, and CFF13, clearly showed recombination landscapes significantly different from those of most other populations. Furthermore, when looking at individual chromosomes (Additional file 10), some populations, such as CFD11 for chromosome 1, CFD04 for chromosome 4, or CFF08 for chromosome 9, showed significantly different patterns in their recombination landscapes. In addition, COs were pooled across all Flint × Flint and across all Dent × Dent populations to compare the recombination landscapes between pooled Flint and Dent populations. For chromosomes 2, 4, 5, and 6, there were highly significant differences between recombination landscapes in the Dent and in the Flint pools (Bonferroni-corrected P value < 0.01). Significant local differences in the shape of the curves between the pooled Dent and the pooled Flint populations were found, for example, in the centromeric bin 5 and in bin 8 on chromosome 2 or in bin 6 on chromosome 6 (Additional file 11).
Finally, to reveal a possible correlation between local recombination rates and the pairwise genetic similarity of parental genomes, we used a sliding window of 10 Mbp within which we calculated a similarity index as the fraction of shared SNP alleles between the two parents. Overall, no consistent relationship was observed between local parental genome similarity and recombination rate. Peaks of recombination rates were associated with regions of high and low genetic similarity (Figure 3, Additional file 9). In all centromeric regions, recombination rates were very low, but values for pairwise parental similarity were highly variable.
Crossover interference analysis across populations
Crossover interference leads to a more regular spacing between adjacent COs than in the absence of interference, where COs are positioned randomly. Interference also affects the distribution of CO numbers per gamete by reducing its variance. In our data, the observed distributions for CO numbers and inter-CO distances departed from the ones expected without interference, allowing us to reject the hypothesis that there is no interference. We also characterized interference quantitatively by estimating the two parameters of the Gamma-sprinkling model : (1) the intensity nu of interference in the interfering pathway, hereafter referred to as P1, and (2) the proportion p of COs formed through the non-interfering pathway, hereafter referred to as P2. The parameters nu and p were estimated for each chromosome and each population. These parameters greatly varied among chromosomes and among populations, with values of nu ranging from 2 to 50, and p ranging from 0 to 0.4. Since the population sizes are modest, part of this variation is expected to be due to statistical noise. We thus focused on comparing pooled data, either over chromosomes or over populations.
Figure 4A shows the values estimated for nu and p when pooling all chromosomes together for each population and for the pooled Dent × Dent and Flint × Flint populations. Confidence intervals for estimates of nu and p from individual populations were large as expected given the population sizes (data not shown). Statistical tests showed that differences between individual populations were not significant at the 5% threshold level - hardly any pairs of individual populations had significantly different values for nu (Additional file 12). For the pairwise comparisons of populations pooling all chromosomes, only CFF06 and CFF13 had significantly lower values for nu than four or six other populations, respectively. For p, however (Additional file 13), its value was found to be significantly higher in CFF02 than in almost all other populations.
Figure 4B presents the values for nu and p for each chromosome estimated from the Dent × Dent and Flint × Flint pools. Dent × Dent populations tended to have both higher nu and higher p than Flint × Flint populations for most chromosomes. On the other hand, chromosomes 3, 4, and 7 showed a reversed pattern, while for chromosome 10 the Dent × Dent pool had higher nu but lower p. Pooling all chromosomes together, the interference strength nu observed across our populations ranged approximately between 2 and 8. To provide a qualitative understanding of the meaning behind these values, one can ask how much such interference levels reduce the probability of having a second CO near a first one in the same meiosis for COs formed within the interfering pathway P1. We found that the probability of having a second P1 CO at 40 cM from a first one is reduced by a factor between 1.8 (for nu = 2) and 20 (for nu = 8). At 10 cM from the first CO, this reduction factor ranges between 5 (for nu = 2) and 46,000 (for nu = 8). For nu > 3 there is almost no chance to find two P1 COs separated by less than 10 cM. Interference intensity in P1 (nu) was significantly negatively correlated with GWRR for chromosome 7. For other chromosomes, this correlation was not significant, but it was when pooling all chromosomes together (Additional file 14). On the other hand, no correlation was found between GWRR and the proportion p of non-interfering P2 COs, irrespective of the chromosome and pooled over all chromosomes (P values always > 0.18).
Recombination and reassortment of parental genomes lead to an increase of genetic variation. Understanding the mechanisms regulating recombination rate during meiosis would allow their manipulation to increase or decrease recombination rates according to specific requirements described in . Although many genes controlling the basic process of CO formation have been identified, little is known about factors influencing CO numbers and distribution, and GWRR. To our knowledge the present work is the most comprehensive study comparing meiotic recombination rates and CO interference within one species and between gene pools of the same species. We used two panels of maize half-sib families comprising 23 full-sib populations with a total of 2,233 DH lines to analyze intraspecific variation of recombination rates and recombination landscapes. The parents of the two panels for Dent and Flint maize were chosen to represent the diversity present in European maize germplasm. We analyzed DH lines produced by in vivo haploid induction, which reflect a single female meiosis. As expected, the average number of crossovers per DH line in our study (15.1) was about half the number observed in maize RILs (28.9) . This drawback of DH lines is counterbalanced by the faster development of the DH populations compared to RILs and by the complete homozygosity of DH lines, which are an immortal resource. Moreover, working on DH populations offers the unique possibility to analyze CO interference, while this is impossible in RILs because the successive independent meioses superpose the COs arising during each meiosis. The design of our connected half-sib panels comprising a large number of populations allowed for comparisons of GWRR, recombination landscape, and interference (1) across parents within each panel and (2) across panels via crosses of the central lines with line B73.
As a prerequisite for our approach, we constructed high-density genetic maps for a large number of populations. Genetic map lengths of the 23 populations varied from 1,180 cM to 1,893 cM with a mean of 1,508 cM, which is in the range observed in other high-density genetic maps of maize [32, 33]. Due to IBD regions in some of our populations, gaps were observed in the genetic maps. However, since most of these gaps were in pericentromeric regions, where recombination rates are low, most of them were not larger than 15 cM, the most extreme gap (53.7 cM) being on chromosome 7 in CFF13. For 76 markers, we found chromosome assignments different from their annotation in the B73 AGPv2. We provided genetic coordinates for 118 markers where no annotation was available. These results may help to improve future B73 genome assemblies. In addition, genetic maps for several chromosomal regions were found non-colinear with the B73 sequence, as was previously detected from two other maize populations . These discrepancies may be the consequence of mis-assemblies in the B73 genome, or due to lack of locus specificity for some SNP markers that would fall into duplicated genomic regions. The hypothesis of structural rearrangements between some of the parental lines and B73 cannot be excluded. However, given the design of the experiment, the possibility that both parents of a cross within one of the half-sib panels share a structural variation absent from both parents of another cross in the same panel is very limited.
Intraspecific variation of recombination rate
The average GWRR in our populations was 0.73 cM/Mbp, which is in the same range as can be calculated from other maize maps . This value is about 5-fold lower than in A. thaliana (3.6 cM/Mb)  which has a 20-fold smaller genome than maize, reflecting the well-known negative correlation of recombination rate with genome size among species . We found clear differences between chromsome-wide recombination rates, with chromosome 4 having the lowest average value (0.60 cM/Mbp) and chromosome 9 the highest (0.88 cM/Mb). We also observed a negative correlation between recombination rate and the physical length of the chromosomes (r = 0.66, P value 0.003), similar to what arises in human, mouse and rat . Such correlations suggest that the mechanisms regulating CO formation in these organisms tend to enforce some level of homeostasis in the number of COs per meiosis and per chromosome. For very short chromosomes, the obligatory CO ensures that there is at least one CO for each bivalent. We observed clear intraspecific variation for GWRR in this study. Similar levels of variation for GWRR were observed in both pools once the effect of the central line was removed.
Calculating GWRR assumes constant genome size in maize. Genome size differences were reported in maize, with temperate maize lines having up to 10% smaller genomes relative to B73 and these differences were correlated with the number of knob repeats . Genome sizes for the lines in our study are unknown; however, the up to 60% difference in map lengths (1,180 to 1,893 cM) in our study is much larger than the genome size differences recently reported . Therefore, it can be assumed that beyond possible genome size variation genetic factors influencing GWRR play a strong role in our plant material. Differences in recombination frequencies between maize inbred lines have been described based on genetic linkage maps and by using cytological methods to detect recombination nodules and to calculate CO rates [3, 43]. Trans-acting QTL affecting GWRR were identified in A. thaliana, maize, mouse, and wheat . In cattle, several QTL were mapped for male GWRR . For two QTL, putative causal variants were detected in the genes REC8 (a member of the kleisin family of 'structural maintenance of chromosome' proteins), and RNF212, a putative homolog of the yeast ZIP3 gene, which is involved in meiotic recombination. RNF212 is also known to be associated with genome-wide recombination in humans . Our findings on different GWRR between individual maize lines pave the way for development of specific crosses to identify genetic factors influencing GWRR by QTL mapping . Given the advances in high-throughput genotyping and genome sequencing, such QTL can be a starting point for fine-mapping and subsequent cloning of genes determining GWRR in plants. Characterizing the structural and functional variation of such genes would promote our understanding of the molecular mechanisms regulating meiotic recombination in plants and would be a highly valuable tool for plant breeders and geneticists.
Recombination landscapes in maize
Not only GWRR but also the landscapes of recombination along chromosomes are highly variable in our populations. We observed characteristic shapes of Marey map curves for each of the 10 maize chromosomes. The overall recombination profiles for each chromosome tend to follow gene density  but this does not explain local differences in recombination rate between populations. Structural variation between parental lines may be one mechanism influencing local recombination rates, as shown for a 26 kb retrotransposon cluster that reduced local recombination rate around the bz1 locus by a factor two . For the same genomic region it was shown that haplotype structure, as defined by the presence of helitrons and retrotransposons, strongly affected the occurrence of recombination events in heterozygous plants . The extensive structural variation in the maize genome can be seen already at the karyotype level, where large-scale variation was reported , but even more at the sequence level, where large variation was observed for repetitive element content, presence-absence or copy number variants [42, 51, 52]. Such structural variations may influence the pairing of homologs and recombination , although inverted regions may also pair normally in pachytene . Apart from the low recombination rates in the heterochromatic pericentromeric and NOR regions and a general increase of recombination rates towards the telomeres, we observed kinks in our Marey map curves in regions where heterochromatic knobs have been mapped in maize. This is the case, for example, on chromosome 4L in all populations, but only in some populations on chromosome 1L, suggesting that variation in knob regions may exist in our lines. Although 34 distinct knob regions were described in maize and its wild progenitor teosinte, most maize lines contain fewer than 12 such knobs, for which in addition polymorphisms are observed between lines [38, 55]. Due to a lack of data on knob positions in our parental lines, we could not examine the influence of knob polymorphism on the shape of the recombination landscapes for individual chromosomes more closely. Since knobs are often located in gene-dense regions and suppress local recombination  it is likely that some differences in the shapes of the recombination landscape are caused by knob polymorphism. Also outside putative knob regions, we observed many significant local differences in the pairwise comparison of recombination profiles between populations and for chromosomes 2, 4, 5, and 6 between the pooled Dent and Flint populations. We found no significant correlation between parental genetic similarity as determined by SNP markers and recombination rate, a result corroborated by a recent study in A. thaliana. Thus, factors influencing local recombination rates other than gene density and similarity at the DNA level must exist. It must be stated though, that for characterizing the influence of genomic features such as nucleotide diversity on local recombination rate, the 10 Mbp scale may be too coarse, so much higher resolution at the kilobase scale might be required, as recently shown in the model plant Medicago truncatula. In addition, the parental diversity as assessed by the mainly genic SNPs of the MaizeSNP50 array may not well reflect the structural differences that can have a major impact on local recombination rates . Our study has identified genomic regions with large differences in local recombination rates between inbred lines. This is an important basis for future studies to identify recombination hotspots and to study the influence of structural variation and genome diversity in defined crosses and genomic regions in maize.
Interference was previously shown to occur in maize , based on numbers and positions of late recombination nodules. That work found two pathways to be operating in maize, one interfering (P1) and the other (P2) contributing a proportion p of non-interfering crossovers. In the present study, we also found in most cases values of p significantly different from zero, with values averaging 0.1. This conclusion is compatible with the previous estimations that reported an average value of 0.15 . The populations where we found p = 0 (CFF04, CFF06, CFF13) may mostly reflect a low power due to limited population sizes: here, individual populations have between 50 and 134 DH lines, whereas in , the data set had more than 200 pachytene synaptonemal complexes (SCs), each of them giving about four times more power to the analysis than one DH line. It should be noticed, however, that our statistical tests were very conservative due to Bonferroni correction. Still, we found that CFF02 had a proportion p of non-interfering (P2) crossovers significantly higher than almost all other populations when considering all chromosomes pooled. To our knowledge, differences in interference features between different genotypes of the same species have not been shown so far. Values of p between 0 and 0.2 for different chromosomes were reported in A. thaliana, and between 0 and 0.21 in humans . Based on comparisons between MLH1 foci and late recombination nodules, p values around 0.3 were found in tomato . Considering the intensity nu of interference in the interfering pathway P1, our results indicated values ranging between 2.5 and 8 for all chromosomes pooled, which is similar to the range 4 to 10 found previously in maize . In Arabidopsis, nu was in the 10 to 21 range , whereas nu was estimated to be in the 6.9 to 7.9 range in tomato . Finally, based on the 23 full-sib populations, we found a significant negative correlation between the chromosome-pooled nu and the genome-wide recombination rate. This result is consistent with the hypothesis that interference may be one of the mechanisms at work to regulate the level of meiotic recombination, biasing the repair of DSBs towards non-COs rather than COs. Compared with the highly significant differences found for GWRR in our study, the variation for CO interference is detected here with much less statistical power. In future works with higher population sizes, providing smaller confidence intervals, but using the same half-sib design it should be possible to estimate the effects of the founder parents alone on interference parameters by removing the effects of the central lines, as we did for GWRR. Similarly as for GWRR, this should also enable the identification and localization of genetic factors influencing interference parameters.
Impact of variation in recombination rates on genetic studies and applied breeding programs
Covering the whole genome using dense genetic maps increases the chance to detect marker-trait associations, both in linkage analyses and association mapping. The construction of high-density genetic maps is now feasible for many crop species in a very cost-efficient way, either by SNP genotyping arrays or through genotyping-by-sequencing approaches [33, 58]. For precise estimation of QTL effects in genome-wide association studies and high accuracy genome-wide prediction of breeding values, it is a prerequisite that markers tag either the causal alleles or they must be in high linkage disequilibrium with the QTL of interest [24, 59]. The linkage phase between marker and favorable QTL alleles is crucial when predicting breeding values across breeds or gene pools . Recombination events may invert linkage phases of marker and QTL alleles between unrelated pools, and thus lead to reduced accuracies in prediction of breeding values and marker effects. In the context of genomic prediction or genome-wide association studies, understanding the landscape of recombination within a species is of particular interest, since regions with high recombination rates require higher marker densities. In addition, a detailed genome-wide and local picture of recombination rates permits adequate dimensioning of map-based cloning projects, marker-assisted selection strategies for specific traits, and crossing programs in cases where unfavorable linkage between traits needs to be broken. Recombination is a key factor determining the success with introgression of new variability from distantly related plant genetic resources or poorly adapted material, since introgression of new alleles or traits such as resistance genes in recurrent selection is often accompanied by undesired linkage drag. These effects of linkage drag may be drastic if the regions of interest are located in (peri)centromeric, recombination-poor regions. Choosing elite lines with high GWRR as recurrent backcross parents may help to speed up the introgression process. Finally, identifying genotypes carrying alleles for higher recombination rate may guide the choice of adequate parents for optimizing the number of generations required in breeding schemes. Variability in interference strength may also be of interest in breeding programs because interference is believed to mechanistically affect recombination rates. Just as for the selection of lines with higher or lower recombination rates , it should be feasible to develop maize lines with different levels of CO interference or proportion of P2 COs.
Meiosis and recombination fundamentally influence genome structure and evolution of species. Although key components of the meiotic pathway are known in plants, further research is needed to better understand the interplay of genetic factors controlling recombination, the role and mechanism of CO interference and the influence of chromatin features such as methylation and structural variation on recombination in plants. Recent advances in high-throughput genotyping and next-generation sequencing offer new opportunities to analyze at a large scale the genome-wide distribution of COs and to characterize sequence motifs around CO hotspots. The present study revealed large intraspecific variation for GWRR and recombination landscapes in maize. Our findings pave the way for the selection of proper crossing parents to analyze genetic factors influencing GWRR and local recombination rates. Finally, using appropriate experimental designs, QTL or association studies will enable the identification of QTL for GWRR.
Materials and methods
Two half-sib panels of 11 and 13 half-sib families were established in the Plant-KBBE project CornFed, one for European Dent and one for European Flint maize. The lines used in this study, their origins, and assignment to Dent or Flint pools are listed in Additional file 1. Each of the two panels consists of a central line (or common parent) that was crossed to founder lines that represent important and diverse breeding lines of the European maize germplasm. The central line (F353) of the Dent panel was crossed with 10 Dent founder lines. For the Dent panel the prefix for populations is CFD. In the Flint panel, the central line UH007 was crossed with 11 Flint founder lines and the prefix for populations of this panel is CFF. In addition, each of the founder lines was crossed with B73 and also the reciprocal populations F353 × UH007 and UH007 × F353 were generated. These additional populations were made to connect the two panels with each other and with the US NAM population  via the parental line B73. The crossing scheme of the two half-sib panels and their connection is shown in Additional file 15. All progenies were homozygous DH lines obtained from F1 plants. The resulting 24 DH populations consisted of 35 to 129 lines (Table 1). In total, 2,267 DH lines were used for analysis in this study.
Crosses between central and founder lines were made by hand-pollinations using F353 or UH007 as female lines and the founder lines as males. F1 plants were pollinated with an inducer line for in vivo haploid production followed by chromosome doubling and selfing of D0 plants, and subsequent multiplication to obtain D1 plants . Atypical lines within a cross or atypical plants within rows of DH lines were eliminated based on phenotypic observations.
Bulk samples of dried leaves or kernels from up to eight D1 plants derived from the same D0, were used for DNA extraction using the cetyl trimethylammonium bromide (CTAB) procedure. DNA samples were adjusted to 50 to 70 ng/μl and 200 ng per sample were used for genotyping. DH line purity and integrity was first checked using a custom 96plex VeraCode assay (Illumina®, San Diego, CA, USA) with genome-wide SNP markers to ensure that the lines carried only one of the parental alleles at each SNP, that they did not carry alleles of the inducer line and that they were derived from true F1 plants. For a subset of DH lines, 13 proprietary SNP markers assayed with the KASP™ technology (LGC Genomics, Berlin, Germany) were used for testing line purity and integrity. True DH lines were then used for genotyping with the Illumina® MaizeSNP50 BeadChip  on an Illumina® iScan platform. Array hybridization and raw data processing were performed according to manufacturer’s instructions (Illumina®). Raw data were analyzed in Illumina®’s Genome Studio software version v2011 (Illumina®) using an improved version of the public cluster file (MaizeSNP50_B.egt, ). SNP data were filtered based on the GTscore using a threshold of 0.7. Heterozygous SNPs were set to missing values (NA) and only markers with a minor allele frequency >0.1 per population were used for mapping. For each population, the allele of the central line was coded as the 'A' allele, and the allele of the founder line was coded as 'B' allele (Additional file 4). Raw genotyping data of parents and DH lines are available at NCBI Gene Expression Omnibus as dataset GSE50558 .
Analysis of parental genetic diversity
Genetic diversity between parental lines was assessed with genome-wide SNP markers by principal coordinate analysis, cluster analysis, and by a pairwise genome scan for polymorphism between the parents of each population. For details, see Additional file 8.
Genetic map construction
Genetic maps were constructed for each individual population as described earlier  using CarthaGene  called from custom R scripts. In the first step, statistically robust scaffold maps were constructed with marker distances of at least 10 cM. In a second step, marker density was increased to produce framework maps containing as many markers as possible, while keeping a LOD score >3.0 for the robustness of marker orders. Finally, the complete maps were obtained by placement of additional markers using bin-mapping . CentiMorgan (cM) distances were calculated using Haldane’s mapping function . Individual genetic maps and genotypic data used for construction of the maps (Additional file 4) were deposited at MaizeGDB under the project acronym CORNFED .
Physical map coordinates of SNPs
Chromosome and position assignments of SNPs of the MaizeSNP50 BeadChip supplied by the manufacturer (Illumina®, San Diego, CA, USA), are based on the B73 AGPv1 assembly with many markers lacking a chromosome and/or position information. We therefore performed a new mapping of the SNPs on the B73 AGPv2 assembly  using BWA . The new assignments were used for all analyses involving the physical mapping information. Assignments are available in Additional file 4.
Construction of bare and masked Marey maps
Given a chromosome and the associated genetic map of an individual population, we determined the marker positions on the B73 assembly. From these physical and genetic positions, we constructed a first Marey map  containing all syntenic markers. This Marey map was smoothed using cubic spline interpolations , producing a 'bare' Marey map that was forced to be monotonic. Then regions where mapping information was lacking (for example, segments IBD in the parents) were masked, producing 'masked' Marey maps (Additional file 9). The detailed procedure is explained in Additional file 8.
Once a bare Marey map was constructed, we defined the recombination landscape function as its derivative. Since the bare map was monotonic by construction, the recombination rates were positive as they should be. In effect, this landscape function provided the local recombination rate (in cM per Mbp) for any physical position of the B73 assembly. Note that this procedure did not distinguish the regions where these recombination rates were estimated reliably from those where they were not (unmasked versus masked regions). For comparison tests, it was thus necessary to resort to imputation, that is, to infer missing data from other maps in a conservative way.
Imputed Marey maps for comparison tests
To compare the genetic lengths and the recombination landscapes between two different populations or pools of populations, we used 'imputed' Marey maps where the information missing in the masked Marey map of either population was replaced by the information available in the other population. If a region had masked data in both populations, its content was imputed using the averaged data of all other populations. In all cases, the imputation procedure was designed so that missing or unreliable data in either population to be compared never induced artificial differences. The detailed procedure used for imputation is explained in Additional file 8.
Comparing genetic map lengths
For any given population, mapping data led to an estimate of the genetic length for a given chromosome. We examined pairwise differences in genetic length between populations as well as differences between pools of populations and tested them for their level of statistical significance. Such comparisons were performed from the imputed Marey maps with a significance threshold of 5%, using the welch.test() function in the R software and the conservative Bonferroni correction for multiple testing. The origin of the information (original or imputed data along the chromosomes) was taken into account when comparing populations or pools of populations. For details see Additional file 8.
Effect of population structure on recombination rate
Genetic map lengths tended to be longer in populations involving Flint parents, suggesting that some alleles of factors controlling recombination rate may be differentially fixed in the two pools. To use a solid and objective measure of degree of 'flintness', we estimated the probability of the 22 parental lines to belong to one of the two main groups (Dent or Flint). To do this, we estimated admixture in a combined analysis of the Dent and Flint diversity panels described earlier , which included also the lines of our study. This analysis was done with the Admixture software (version 1.22) , using 25,237 PANZEA SNPs and 559 maize lines. We chose k = 2 for the number of groups assuming the two pools Dent and Flint, and used the probability of each of the 22 parental lines to belong to the Flint pool for a correlation analysis with recombination rates. More precisely, we analyzed the correlation between the GWRRs of the 23 populations and the average of the 'flintness' of the two parents of each population using the function lm() of the R software. The associated R2 specifies what fraction of the variance in the GWRR is explained by the group structure of the parental lines. The function lm() also provides the P value for testing the absence of correlation.
Individual additive effects for recombination rate
The 23 genetic map lengths we estimated showed a clear positive correlation with the average 'flintness' of the parents in the crosses. However, the two central lines could be driving this correlation. To remove effects coming from the two central lines, we considered an additive model whereby the GWRR of a population produced by a cross is given by the average of two effects, one from each parent in the cross. For each founder line except for B73, there is a single cross in which it is involved. We took the GWRR of that cross and subtracted the GWRR of the cross involving the same central line and B73. This difference gives the individual additive effect of the founder line minus that of B73 up to a constant. This constant does not affect a putative correlation between individual-specific 'flintness' and GWRR. We performed the statistical test for significance of this correlation using the same procedures as in the previous section, central lines and B73 omitted.
Comparing recombination landscapes
Just as the genetic length of chromosome maps may differ, two recombination landscapes can have different features (different shapes of the Marey maps). To test whether these differences were statistically significant, we normalized the genetic lengths of the two maps or pools of maps to be compared, by rescaling both of them to their mean value. Then, to compare the shape of both normalized Marey maps, our approach was based on binning the landscapes, representing each as a histogram and then applying a chi-squared test with a conservative Bonferroni-corrected significance threshold of 5% (Additional file 11). The detailed procedure is explained in Additional file 8.
CO interference was modeled in the framework of the Gamma model , including a second pathway using the sprinkling procedure  whereby non-interfering pathway P2 COs are simply added to those of P1. So the features of CO distributions along chromosomes were modeled using two parameters: the intensity nu of interference in the interfering pathway P1, and the proportion p of COs formed through the non-interfering pathway P2. The detailed implementation of the maximum-likelihood method used to estimate the values of the two parameters nu and p of the model was described earlier .
To analyze interference in a pool of maps instead of an individual map, we used a similar maximum-likelihood approach, but the likelihood to be maximized was the product of likelihoods calculated for the individual maps. To test for differences between values of nu or p, we applied the Welch test (function welch.test() in R) with significance thresholds of 5%, using variances estimated from the Fisher information matrix obtained at the optimal values of the parameters.
Genome-wide recombination rate
Nested association mapping
Nucleolus organizer region
Quantitative trait loci
Single nucleotide polymorphism.
Keeney S, Giroux CN, Kleckner N: Meiosis-specific DNA double-strand breaks are catalyzed by Spo11, a member of a widely conserved protein family. Cell. 1997, 88: 375-384. 10.1016/S0092-8674(00)81876-0.
Hunter N, Kleckner N: The single-end invasion: an asymmetric intermediate at the double-strand break to double-holliday junction transition of meiotic recombination. Cell. 2001, 106: 59-70. 10.1016/S0092-8674(01)00430-5.
Anderson LK, Doyle GG, Brigham B, Carter J, Hooker KD, Lai A, Rice M, Stack SM: High-resolution crossover maps for each bivalent of Zea mays using recombination nodules. Genetics. 2003, 165: 849-865.
Moens PB, Kolas NK, Tarsounas M, Marcon E, Cohen PE, Spyropoulos B: The time course and chromosomal localization of recombination-related proteins at meiosis in the mouse are compatible with models that can resolve the early DNA-DNA interactions without reciprocal recombination. J Cell Sci. 2002, 115: 1611-1622.
Baudat F, Massy B: Regulating double-stranded DNA break repair towards crossover or non-crossover during mammalian meiosis. Chromosome Res. 2007, 15: 565-577. 10.1007/s10577-007-1140-3.
Martini E, Diaz RL, Hunter N, Keeney S: Crossover homeostasis in yeast meiosis. Cell. 2006, 126: 285-295. 10.1016/j.cell.2006.05.044.
Sturtevant AH: The behaviour of the chromosomes as studied through linkage. Mol Gen Genet. 1915, 13: 234-287. 10.1007/BF01792906.
Muller HJ: The mechanism of crossing-over. Am Nat. 1916, 50: 193-221. 10.1086/279534.
Szostak JW, Orrweaver TL, Rothstein RJ, Stahl FW: The double-strand-break repair model for recombination. Cell. 1983, 33: 25-35. 10.1016/0092-8674(83)90331-8.
Foss E, Lande R, Stahl FW, Steinberg CM: Chiasma interference as a function of genetic distance. Genetics. 1993, 133: 681-691.
Copenhaver GP, Housworth EA, Stahl FW: Crossover interference in arabidopsis. Genetics. 2002, 160: 1631-1639.
Hollingsworth NM, Brill SJ: The Mus81 solution to resolution: generating meiotic crossovers without Holliday junctions. Genes Dev. 2004, 18: 117-125. 10.1101/gad.1165904.
Stahl FW, Foss HM, Young LS, Borts RH, Abdullah MFF, Copenhaver GP: Does crossover interference count in Saccharomyces cerevisiae?. Genetics. 2004, 168: 35-48. 10.1534/genetics.104.027789.
Lhuissier FGP, Offenberg HH, Wittich PE, Vischer NOE, Heyting C: The mismatch repair protein MLH1 marks a subset of strongly interfering crossovers in tomato. Plant Cell. 2007, 19: 862-876. 10.1105/tpc.106.049106.
Higgins JD, Armstrong SJ, Franklin FCH, Jones GH: The Arabidopsis MutS homolog AtMSH4 functions at an early step in recombination: evidence for two classes of recombination in Arabidopsis. Genes Dev. 2004, 18: 2557-2570. 10.1101/gad.317504.
Guillon H, Baudat F, Grey C, Liskay RM, de Massy B: Crossover and noncrossover pathways in mouse meiosis. Mol Cell. 2005, 20: 563-573. 10.1016/j.molcel.2005.09.021.
Falque M, Anderson LK, Stack SM, Gauthier F, Martin OC: Two types of meiotic crossovers coexist in maize. Plant Cell. 2009, 21: 3915-3925. 10.1105/tpc.109.071514.
Mercier R, Jolivet S, Vezon D, Huppe E, Chelysheva L, Giovanni M, Nogué F, Doutriaux M-P, Horlow C, Grelon M, Mézard C: Two meiotic crossover classes cohabit in Arabidopsis: one is dependent on MER3, whereas the other one is not. Curr Biol. 2005, 15: 692-701. 10.1016/j.cub.2005.02.056.
Jensen-Seaman MI, Furey TS, Payseur BA, Lu Y, Roskin KM, Chen C-F, Thomas MA, Haussler D, Jacob HJ: Comparative recombination rates in the rat, mouse, and human genomes. Genome Res. 2004, 14: 528-538. 10.1101/gr.1970304.
Henderson IR: Control of meiotic recombination frequency in plant genomes. Curr Opin Plant Biol. 2012, 15: 556-561. 10.1016/j.pbi.2012.09.002.
Berg IL, Neumann R, Lam K-WG, Sarbajna S, Odenthal-Hesse L, May CA, Jeffreys AJ: PRDM9 variation strongly influences recombination hot-spot activity and meiotic instability in humans. Nat Genet. 2010, 42: 859-863. 10.1038/ng.658.
Ségurel L, Leffler EM, Przeworski M: The case of the fickle fingers: how the PRDM9 zinc finger protein specifies meiotic recombination hotspots in humans. PLoS Biol. 2011, 9: e1001211-10.1371/journal.pbio.1001211.
Sanchez-Moran E, Armstrong SJ, Santos JL, Franklin FCH, Jones GH: Variation in chiasma frequency among eight accessions of Arabidopsis thaliana. Genetics. 2002, 162: 1415-1422.
Meuwissen THE, Hayes BJ, Goddard ME: Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001, 157: 1819-1829.
Wang WYS, Barratt BJ, Clayton DG, Todd JA: Genome-wide association studies: theoretical and practical concerns. Nat Rev Genet. 2005, 6: 109-118. 10.1038/nrg1522.
Coe E: East, Emerson, and the birth of maize genetics. Maize Handbook. Volume II: Genetics and Genomics. Edited by: Bennetzen JL, Hake S. 2009, New York, USA: Springer, 3-15.
Gaut BS, Doebley JF: DNA sequence evidence for the segmental allotetraploid origin of maize. Proc Natl Acad Sci U S A. 1997, 94: 6809-6814. 10.1073/pnas.94.13.6809.
Helentjaris T, Slocum M, Wright S, Schaefer A, Nienhuis J: Construction of genetic linkage maps in maize and tomato using restriction fragment length polymorphisms. Theor Appl Genet. 1986, 72: 761-769.
Lawrence CJ, Seigfried TE, Brendel V: The Maize Genetics and Genomics Database. The community resource for access to diverse maize data. Plant Physiol. 2005, 138: 55-58. 10.1104/pp.104.059196.
Maize Genetics and Genomics Database (MaizeGDB). http://www.maizegdb.org/,
Jones E, Chu W-C, Ayele M, Ho J, Bruggeman E, Yourstone K, Rafalski A, Smith O, McMullen M, Bezawada C, Warren J, Babayev J, Basu S, Smith S: Development of single nucleotide polymorphism (SNP) markers for use in commercial maize (Zea mays L.) germplasm. Mol Breed. 2009, 24: 165-176. 10.1007/s11032-009-9281-z.
McMullen MD, Kresovich S, Villeda HS, Bradbury P, Li H, Sun Q, Flint-Garcia S, Thornsberry J, Acharya C, Bottoms C, Brown P, Browne C, Eller M, Guill K, Harjes C, Kroon D, Lepak N, Mitchell SE, Peterson B, Pressoir G, Romero S, Rosas MO, Salvo S, Yates H, Hanson M, Jones E, Smith S, Glaubitz JC, Goodman M, Ware D, et al: Genetic properties of the maize nested association mapping population. Science. 2009, 325: 737-740. 10.1126/science.1174320.
Ganal MW, Durstewitz G, Polley A, Berard A, Buckler ES, Charcosset A, Clarke JD, Graner EM, Hansen M, Joets J, Le Paslier MC, McMullen MD, Montalent P, Rose M, Schön CC, Sun Q, Walter H, Martin OC, Falque M: A large maize (Zea mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome. PLoS ONE. 2011, 6: e28334-10.1371/journal.pone.0028334.
Schnable PS, Ware D, Fulton RS, Stein JC, Wei FS, Pasternak S, Liang CZ, Zhang JW, Fulton L, Graves TA, Minx P, Reily AD, Courtney L, Kruchowski SS, Tomlinson C, Strong C, Delehaunty K, Fronick C, Courtney B, Rock SM, Belter E, Du FY, Kim K, Abbott RM, Cotton M, Levy A, Marchetto P, Ochoa K, Jackson SM, Gillam B, et al: The B73 maize genome: Complexity, diversity, and dynamics. Science. 2009, 326: 1112-1115. 10.1126/science.1178534.
Stich B, Melchinger AE, Frisch M, Maurer HP, Heckenberger M, Reif JC: Linkage disequilibrium in European elite maize germplasm investigated with SSRs. Theor Appl Genet. 2005, 111: 723-730. 10.1007/s00122-005-2057-x.
Röber FK, Gordillo GA, Geiger HH: In vivo haploid induction in maize - performance of new inducers and significance of doubled haploid lines in hybrid breeding. Maydica. 2005, 50: 275-283.
Rincent R, Laloë D, Nicolas S, Altmann T, Brunel D, Revilla P, Rodriguez VM, Moreno-Gonzales J, Melchinger AE, Bauer E, Schön C-C, Meyer N, Giauffret C, Bauland C, Jamin P, Laborde J, Monod H, Flament P, Charcosset A, Moreau L: Maximizing the reliability of genomic selection by optimizing the calibration set of reference individuals: comparison of methods in two diverse groups of maize inbreds (Zea mays L.). Genetics. 2012, 192: 715-728. 10.1534/genetics.112.141473.
Ghaffari R, Cannon ES, Kanizay L, Lawrence C, Dawe RK: Maize chromosomal knobs are located in gene-dense areas and suppress local recombination. Chromosoma. 2012, 122: 67-75.
Crismani W, Girard C, Mercier R: Tinkering with meiosis. J Exp Bot. 2013, 64: 55-65. 10.1093/jxb/ers314.
Salome PA, Bomblies K, Fitz J, Laitinen RAE, Warthmann N, Yant L, Weigel D: The recombination landscape in Arabidopsis thaliana F2 populations. Heredity. 2012, 108: 447-455. 10.1038/hdy.2011.95.
Lynch M: The origins of eukaryotic gene structure. Mol Biol Evol. 2006, 23: 450-468.
Chia J-M, Song C, Bradbury PJ, Costich D, de Leon N, Doebley J, Elshire RJ, Gaut B, Geller L, Glaubitz JC, Gore M, Guill KE, Holland J, Hufford MB, Lai J, Li M, Liu X, Lu Y, McCombie R, Nelson R, Poland J, Prasanna BM, Pyhajarvi T, Rong T, Sekhon RS, Sun Q, Tenaillon MI, Tian F, Wang J, Xu X, et al: Maize HapMap2 identifies extant variation from a genome in flux. Nat Genet. 2012, 44: 803-807. 10.1038/ng.2313.
Beavis WD, Grant D: A linkage map based on information from four F2 populations of maize (Zea mays L.). Theor Appl Genet. 1991, 82: 636-644.
Esch E, Szymaniak JM, Yates H, Pawlowski WP, Buckler ES: Using crossover breakpoints in recombinant inbred lines to identify quantitative trait loci controlling the global recombination frequency. Genetics. 2007, 177: 1851-1858. 10.1534/genetics.107.080622.
Sandor C, Li W, Coppieters W, Druet T, Charlier C, Georges M: Genetic variants in REC8, RNF212, and PRDM9 influence male recombination in cattle. PLoS Genet. 2012, 8: e1002854-10.1371/journal.pgen.1002854.
Kong A, Thorleifsson G, Stefansson H, Masson G, Helgason A, Gudbjartsson DF, Jonsdottir GM, Gudjonsson SA, Sverrisson S, Thorlacius T, Jonasdottir A, Hardarson GA, Palsson ST, Frigge ML, Gulcher JR, Thorsteinsdottir U, Stefansson K: Sequence variants in the RNF212 gene associate with genome-wide recombination rate. Science. 2008, 319: 1398-1401. 10.1126/science.1152422.
Dole J, Weber DF: Detection of quantitative trait loci influencing recombination using recombinant inbred lines. Genetics. 2007, 177: 2309-2319. 10.1534/genetics.107.076679.
Dooner HK, He L: Maize genome structure variation: interplay between retrotransposon polymorphisms and genic recombination. Plant Cell. 2008, 20: 249-258. 10.1105/tpc.107.057596.
He L, Dooner HK: Haplotype structure strongly affects recombination in a maize genetic interval polymorphic for Helitron and retrotransposon insertions. Proc Natl Acad Sci U S A. 2009, 106: 8410-8416. 10.1073/pnas.0902972106.
Albert PS, Gao Z, Danilova TV, Birchler JA: Diversity of chromosomal karyotypes in maize and its relatives. Cytogenet Genome Res. 2010, 129: 6-16. 10.1159/000314342.
Brunner S, Fengler K, Morgante M, Tingey S, Rafalski A: Evolution of DNA sequence nonhomologies among maize inbreds. Plant Cell. 2005, 17: 343-360. 10.1105/tpc.104.025627.
Springer NM, Ying K, Fu Y, Ji T, Yeh C-T, Jia Y, Wu W, Richmond T, Kitzman J, Rosenbaum H, Iniguez AL, Barbazuk WB, Jeddeloh JA, Nettleton D, Schnable PS: Maize inbreds exhibit high levels of copy number variation (CNV) and presence/absence variation (PAV) in genome content. PLoS Genet. 2009, 5: e1000734-10.1371/journal.pgen.1000734.
Hammarlund M, Davis MW, Nguyen H, Dayton D, Jorgensen EM: Heterozygous insertions alter crossover distribution but allow crossover interference in Caenorhabditis elegans. Genetics. 2005, 171: 1047-1056. 10.1534/genetics.105.044834.
Zickler D: From early homologue recognition to synaptonemal complex formation. Chromosoma. 2006, 115: 158-174. 10.1007/s00412-006-0048-6.
McClintock B, Yamakake T, Blumenschein A: Chromosome Constitution of Races of Maize: its Significance in the Interpretation of Relationships Between Races and Varieties in the Americas. 1981, Colegio de Postgraduados, Escuela National de Agricultura, Chapingo: Chapingo, Mexico
Paape T, Zhou P, Branca A, Briskine R, Young N, Tiffin P: Fine-scale population recombination rates, hotspots, and correlates of recombination in the Medicago truncatula genome. Genome Biol Evol. 2012, 4: 726-737. 10.1093/gbe/evs046.
Housworth EA, Stahl FW: Crossover interference in humans. Am J Hum Genet. 2003, 73: 188-197. 10.1086/376610.
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE: A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE. 2011, 6: e19379-10.1371/journal.pone.0019379.
Kruglyak L: Prospects for whole-genome linkage disequilibrium mapping of common disease genes. Nat Genet. 1999, 22: 139-144. 10.1038/9642.
de Roos APW, Hayes BJ, Goddard ME: Reliability of genomic predictions across multiple populations. Genetics. 2009, 183: 1545-1553. 10.1534/genetics.109.104935.
Hadad RG, Pfeiffer TW, Poneleit CG: Repeatability and heritability of divergent recombination frequencies in the Iowa Stiff Stalk Synthetic (Zea mays L.). Theor Appl Genet. 1996, 93: 990-996.
Illumina MaizeSNP50 Cluster File maizesnp50_b.egt. http://support.illumina.com/array/array_kits/maizesnp50_dna_analysis_kit/downloads.ilmn,
NCBI GEO: Intraspecific variation of recombination rate in maize. http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE50558,
de Givry S, Bouchez M, Chabrier P, Milan D, Schiex T: CARTHAGENE: multipopulation integrated genetic and radiation hybrid mapping. Bioinformatics. 2005, 21: 1703-1704. 10.1093/bioinformatics/bti222.
Falque M, Décousset L, Dervins D, Jacob A-M, Joets J, Martinant J-P, Raffoux X, Ribière N, Ridel C, Samson D, Charcosset A, Murigneux A: Linkage mapping of 1454 new maize candidate gene loci. Genetics. 2005, 170: 1957-1966. 10.1534/genetics.104.040204.
Haldane JBS: The combination of linkage values, and the calculation of distance between the loci of linked factors. J Genet. 1919, 8: 299-309.
MaizeGDB data record. http://www.maizegdb.org/cgi-bin/displayrefrecord.cgi?id=2804601,
Li H, Durbin R: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009, 25: 1754-1760. 10.1093/bioinformatics/btp324.
Chakravarti A: A graphical representation of genetic and physical maps: The Marey map. Genomics. 1991, 11: 219-222. 10.1016/0888-7543(91)90123-V.
Berloff N, Perola M, Lange K: Spline methods for the comparison of physical and genetic maps. J Comput Biol. 2002, 9: 465-475. 10.1089/106652702760138565.
Alexander DH, Novembre J, Lange K: Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009, 19: 1655-1664. 10.1101/gr.094052.109.
McPeek MS, Speed TP: Modelling interference in genetic recombination. Genetics. 1995, 139: 1031-1044.
We thank Christine Dillmann, Philippe Brabant, and Domenica Manicacci for discussing statistical approaches, and Christine Mézard and Raphaël Mercier for helpful comments on the manuscript. We thank Michael Seidel for providing the physical map coordinates of SNP markers in the B73 AGPv2 assembly, and Hans-Jürgen Auinger, Christina Lehermeier, and Valentin Wimmer for help with R scripts. We also thank Ruedi Fries and Hubert Pausch for processing of SNP arrays and Stefan Schwertfirm for excellent technical assistance. Results have been achieved in the framework of the Transnational (Germany, France, Spain) Cooperation within the PLANT-KBBE Initiative Cornfed, additionally supported by the project AMAIZING. The work was financed by grants from Agence Nationale de la Recherche ('ANR') to AC, MF, MM, and PF, grants from the Ministry of Science and Innovation (Ministerio de Ciencia e Innovación ('MICINN')) to JMG and PR, and grants from the Federal Ministry of Education and Research (Bundesministerium für Bildung und Forschung, 'BMBF') to TA, AEM, MO, and CCS.
The authors declare that they have no competing interests.
TA, AC, PF, AEM, MM, JMG, MO, PR, and CCS designed the project. TA, CB, CC, LC, AC, MF, PF, JMG, OCM, AEM, MM, MO, NR, PR, and WS developed the populations or contributed reagents or software tools. EB, CB, CC, PF, NM, NR, WS, and HW performed the experiments. EB, MF, OCM, and CCS designed the data analysis. EB, MF, OCM, RR, and HW analyzed the data. EB, MF, OCM, and CCS wrote the paper. All authors read and approved the manuscript.
Eva Bauer, Matthieu Falque, Olivier C Martin and Chris-Carolin Schön contributed equally to this work.
Electronic supplementary material
Additional file 1: Table S1: Maize lines used in this study, assignment to gene pools and origin of the lines. (XLSX 10 KB)
Additional file 5: Figure S1: Allele frequency of the central parent allele for all polymorphic markers (A) in all Dent populations and (B) in all Flint populations. The 10 chromosomes are represented along the same horizontal axis. The expected frequency of 0.5 is indicated by a dotted line, and surrounded by solid lines representing its 99% confidence intervals. The x-axis indicates physical coordinates in megabase pairs along the B73 genome. In the bottom of the figure, the heat map represents gene density (low for cold colors and high for hot colors). Gray horizontal line below the heat-map: sketch of the chromosome organization showing centromeres (cen), knobs, nucleolar organizer region (NOR), and known gametophytic factors (gax) (from ). Color filling of chromosome features is solid when the estimated boundaries of the region are known, and hatched when the box indicates only the extremities of the bin containing the region. (PDF 2 MB)
Additional file 6: Figure S2: Statistical comparison of chromsome-wide recombination rates between the 23 populations of the experiment. Chromosome 'All' (first page) corresponds to the genome-wide analysis. 'DentAll', 'FlintAll', and 'All' correspond, respectively, to pooled analyses of all Dent × Dent populations, all Flint × Flint populations, and all 23 populations together. Dark blue, light blue, green, yellow, and red correspond respectively to P ≥ 5.10-2, 10-3 ≤ P < 10-2, 10-4 ≤ P < 10-3, 10-5 ≤ P < 10-4, P < 10-5 where P is the P value of the pairwise comparison test, corrected for multiple testing (Bonferroni). Arrows pointing to the right (respectively to the bottom) indicate that the cross listed in the vertical axis (respectively the horizontal axis) has a higher recombination rate than the cross listed in the horizontal axis (respectively the vertical axis). Dendrograms indicate hierarchical clustering of -log10(P value) based on Euclidian distances, and were used to order the populations. (PDF 69 KB)
Additional file 7: Figure S3: Marey maps and recombination landscapes along the chromosomes for three examples illustrating the imputation of regions with missing or unreliable data: CFD01 chromosome 1, CFF07 chromosome 1, and CFF01 chromosome 3. The x-axis indicates physical position of the SNPs on the B73 physical map in megabase pairs. The left y-axis indicates genetic map position in centiMorgans. The right y-axis indicates recombination rate in cM/Mbp. Each black empty circle corresponds to a SNP. Red dots indicate the outlier markers removed from the smoothing analysis. Dark blue dotted line: smoothed Marey map. Red dotted line: first derivative of the smoothed Marey map. Hatched rectangles: regions masked when going from bare to masked Marey maps (see Materials and methods). Light blue solid line: imputed smoothed Marey map obtained after imputation in the excluded regions, using data from all maps pooled. Pink solid curve: recombination rate computed as the first derivative of the imputed smoothed Marey map. (PDF 72 KB)
Additional file 9: Figure S4: Recombination rates along maize chromosomes 1 to 10. The x-axis indicates physical position (Mbp) along chromosomes. The y-axis indicates genetic position (cM; top panel), recombination rate (cM/Mbp; middle panel), and pairwise parental similarity (frequency of identical SNP alleles in 10 Mbp sliding windows with a step size of 2 Mbp; bottom panel) for 6 of the 23 populations, after smoothing and imputation (see Materials and methods). Blue lines: Flint × Flint crosses. Red lines: Dent × Dent crosses. In both groups, the solid, dashed and dotted lines correspond to the population with the highest, median or lowest genome-wide recombination rate within its group, respectively. Gray parts of the lines correspond to regions where the information was missing or not reliable (IBD segments, non-colinearity with B73), and thus imputed from the other maps (see Materials and methods). Heat-maps below the curves of recombination rates indicate gene density (low for cold colors and high for hot colors). Gray horizontal line below the heat-map: sketch of the chromosome organization (from ) showing centromeres (cen), knobs, and nucleolar organizer region (NOR). Color filling of chromosome features is solid when the estimated boundaries of the region are known, and hatched when the box indicates only the extremities of the bin containing the region. (PDF 684 KB)
Additional file 10: Figure S5: Statistical comparison of genome-wide recombination landscapes between the 23 populations for all chromosomes pooled as well as for each chromosome. For each pairwise comparison test, both genetic map lengths were normalized to their average value, so the test is not affected by differences in the global recombination rate but only by differences in the shape of the recombination landscape. 'DentAll', 'FlintAll', and 'All' correspond, respectively, to pooled analyses of all Dent × Dent populations, all Flint × Flint populations, and all 23 populations together. Dark blue, light blue, green, yellow, and red correspond respectively to P ≥ 5.10-2, 10-3 ≤ P < 10-2, 10-4 ≤ P < 10-3, 10-5 ≤ P < 10-4, P < 10-5 where P is the P value of the pairwise comparison test, corrected for multiple testing (Bonferroni). Dendrograms indicate hierarchical clustering of -log10(P value) based on Euclidian distances, and were used to order the populations. (PDF 55 KB)
Additional file 11: Figure S6: Illustration of the statistical test used to compare recombination landscapes between the pooled data of all Dent × Dent (red) and Flint × Flint (blue) populations. Within regions excluded from the analysis for one population, the data were imputed from the other pool for conservativeness of the test (Additional file 8). Solid curves: Marey maps normalized to the average genetic length, so the comparison focuses on differences in the shape of the recombination landscapes and is not affected by differences in the values of chromosome genetic lengths. Dotted curves: first derivative of the normalized Marey maps, indicating the recombination landscape along the chromosome. The black rectangles show the 10 bins used for the analysis (from left to right: bins 1 to 10). Bin boundaries were chosen so each bin contained regions of the same genetic length. On top of each bar, a black vertical arrow indicates the difference between both populations in average recombination rates over the bin considered, and the error bars indicate the 95% confidence intervals of these average recombination rates. (PDF 248 KB)
Additional file 12: Figure S7: Statistical comparisons of interference intensity in pathway P1 (nu) between individual populations, for all chromosomes pooled together. 'DentAll', 'FlintAll', and 'All' correspond, respectively, to pooled analyses of all Dent × Dent populations, all Flint × Flint populations, and all 23 populations together. Dark blue, light blue, green, yellow, and red correspond respectively to P ≥ 5.10-2, 10-3 ≤ P < 10-2, 10-4 ≤ P < 10-3, 10-5 ≤ P < 10-4, P < 10-5 where P is the P value of the pairwise comparison test, corrected for multiple testing (Bonferroni). Arrows pointing to the right (respectively to the bottom) indicate that the cross listed in the vertical axis (respectively the horizontal axis) has a higher value of nu than the cross listed in the horizontal axis (respectively the vertical axis). Dendrograms indicate hierarchical clustering of -log10(P value) based on Euclidian distances, and were used to order the populations. (PDF 9 KB)
Additional file 13: Figure S8: Statistical comparisons between individual populations, of the fraction (p) of crossovers formed via the non-interfering pathway P2, for all chromosomes pooled together. 'DentAll', 'FlintAll', and 'All' correspond, respectively, to pooled analyses of all Dent × Dent populations, all Flint × Flint populations, and all 23 populations together. Dark blue, light blue, green, yellow, and red correspond respectively to P ≥ 5.10-2, 10-3 ≤ P < 10-2, 10-4 ≤ P < 10-3, 10-5 ≤ P < 10-4, P < 10-5 where P is the P value of the pairwise comparison test, corrected for multiple testing (Bonferroni). Arrows pointing to the right (respectively to the bottom) indicate that the cross listed in the vertical axis (respectively the horizontal axis) has a higher value of p than the cross listed in the horizontal axis (respectively the vertical axis). Dendrograms indicate hierarchical clustering of -log10(P value) based on Euclidian distances, and were used to order the populations. (PDF 10 KB)
Additional file 14: Figure S9: Correlation between interference intensity in pathway P1 (nu) and genome-wide recombination rate for the 10 chromosomes pooled together over all populations. (PDF 5 KB)
Additional file 15: Figure S10: Crossing scheme of the two Dent and Flint half-sib panels. In each panel, a central line was crossed to diverse founder lines and DH lines were developed from the resulting F1 plants using the in vivo haploid induction method. Dent lines are shown in red, Flint lines in blue. Red lines: Dent × Dent crosses. Blue lines: Flint × Flint crosses. Gray lines: Dent × Flint/Flint × Dent crosses. Two reciprocal crosses (CFD01, CFF01) connect the panels via the central lines F353 and UH007. Both panels are also connected by crossing the central parent to B73 (CFD02, CFF02). The panels consist of 11 CFD and 13 CFF full-sib families, respectively. (PDF 13 KB)
About this article
Cite this article
Bauer, E., Falque, M., Walter, H. et al. Intraspecific variation of recombination rate in maize. Genome Biol 14, R103 (2013). https://doi.org/10.1186/gb-2013-14-9-r103
- Quantitative Trait Locus
- Recombination Rate
- Doubled Haploid
- Single Nucleotide Polymorphism Marker