Skip to main content
Figure 1 | Genome Biology

Figure 1

From: The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties

Figure 1

Power-law behaviour is observed for many genomic properties. (a) The occurrence of DNA words, InterPro families and protein folds in the worm genome. Black diamonds, 6-mers; dark-gray diamonds, 7-mers; mid-gray diamonds, 8-mers; light-gray diamonds, 9-mers, open diamonds, 10-mers; red circles, gene families; open green squares, protein superfamilies, blue crosses, protein folds. The solid lines represent the best-fit power-law functions for each distribution. (b) The occurrence of pseudogene families (open green squares) and pseudomotifs (black crosses) in the worm intergenic regions. (c) The occurrence of InterPro families in M. genitalium (black diamonds); E. coli (dark-gray diamonds); S. cerevisiae (mid-gray diamonds); and D. melanogaster (open diamonds). (d) Other properties that follow the power law. Black crosses, the number of assigned functions for each fold; open blue squares, the number of protein-protein interactions each fold makes in the yeast two-hybrid experiment, open green circles, the number of transcripts of each fold during vegetative growth in yeast. (e) Best-fit functions for the occurrence of protein folds in the worm genome (blue crosses): linear (y = a - bx), exponential (y = ae-bx), double-exponential (y = ae-bx + ce-dx), triple exponential (y = ae-bx + ce-dx +fe-gx), stretched-exponential , , and power-law functions (y = ax-b). The residuals (R) between the functions and genomic data are calculated as Σ (N folds (actual) - N folds (fitted))2. (f) Properties that do not follow the power law. The occurrence of 3-mers (open blue squares); 4-mers (green crosses); and 5-mers (open dark-blue squares) in the worm genome. Open blue circles, the average composition of asparagine in different folds; open red diamonds, the number of residues involved in protein flexibility in different folds. The slopes (exponent b) are given on the plots. The worm genome was taken from the database at the National Center for Biotechnological Information [41], the family assignments were obtained from the InterPro proteome database[42], and the fold assignments from the Partslist database [20]. Solid red line, best-fit line for worm InterPro families.

Back to article page