Volume 12 Supplement 1

Beyond the Genome 2011

Open Access

Utilizing whole genome sequences to study population genomics of gene networks: a case study of the Arabidopsis thalianaimmune-signaling network

  • Mridu Middha1, 2,
  • Yungil Kim3,
  • Peter Morrell4,
  • Chad Myers3 and
  • Fumiaki Katagiri2
Genome Biology201112(Suppl 1):P42


Published: 19 September 2011

Arabidopsis thaliana is a member of the mustard (Brassicaceae) family that is widely used as a model organism in plant biology. The 1001 Genomes Project [1] has been sequencing the genomes of Arabidopsis strains (accessions) and has made these sequences available. We selected the genomes of 30 Arabidopsis accessions with diverse geographical and environmental origins for our analysis. Using the TAIR8 annotation of the Arabidopsis reference genome, for the accession Col-0, we generated a dataset of approximately 27,000 protein-coding genes for all of the 30 genomes. With such population genomic data, it is feasible to ask whether a group of genes is under a different type of selection from the rest of the genome.

The plant immune-signaling network is robust to network perturbations. We hypothesized that genes that constitute a robust network tend to be under neutral selection because deleterious mutations in such genes do not strongly affect the immune phenotype owing to the robustness of the network. We identified the component genes of the plant immune-signaling network in a relatively unbiased manner by mining AraNet [2], which is a functional gene network built without using phenotype information. We compared population genetic summary statistics for the network component genes and those for all of the genes in the genome. For example, Tajima’s D is such a summary statistic, and positive, negative and zero values of Tajima’s D suggest diversifying, purifying and neutral selection, respectively, when the effective population size does not change. The Tajima’s D value distribution for all of the genes in the genome has a single clear peak with a negative value, suggesting that purifying selection is the genomic norm.

Our preliminary results showed that the plant immune-signaling network genes are significantly enriched with genes whose Tajima’s D values are near zero compared with all of the genes in the genome. This finding suggests that there is a lower level of purifying selection among the network component genes than other genes.

Authors’ Affiliations

Biomedical Informatics and Computational Biology, University of Minnesota
Department of Plant Biology, Microbial and Plant Genomics Institute, University of Minnesota
Department of Computer Science & Engineering, University of Minnesota
Department of Agronomy & Plant Genetics, University of Minnesota



© Middha et al; licensee BioMed Central Ltd. 2011

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.