Statistical methods for comparing the abundances of metabolic pathways in metagenomics

Liu, Bo; Pop, Mihai

doi:10.1186/gb-2010-11-s1-o7

Volume 11 Supplement 1

Beyond the Genome: The true gene count, human evolution and disease genomics

Selected oral presentation
Published: 11 October 2010

Statistical methods for comparing the abundances of metabolic pathways in metagenomics

Bo Liu¹ &
Mihai Pop¹

Genome Biology volume 11, Article number: O7 (2010) Cite this article

3068 Accesses
Metrics details

Background

A major goal of metagenomic studies is to identify specific functional adaptations of microbial communities to their habitats. The functional profile and the abundances for a sample can be estimated by mapping metagenomic sequences to the global metabolic network consisting of thousands of molecular reactions. Here we describe our development of statistical methods that can identify differentially abundant subnetworks between metagenomic samples.

Methods

First, we introduced a scoring function for an arbitrary subnetwork and find the max-weight subnetwork in the global network by greedy search. Then we compute p_abund and p_struct values using nonparametric approaches to answer two statistical questions: (i) Is this sub-network differentially abundant? (ii) What is the probability of finding such good subnetworks by chance? Significant metabolic subnetworks are detected on the basis of these two p values.

Results

Simulated datasets We randomly choose a metabolic subnetwork as differentially abundant, and then simulate the abundance values from Gaussian distributions. Figure 1 shows the performance of different methods on discovering the significant subnetwork. Real metagenomic data sets We analyzed gut microbiome from obese or lean [1], and infant or adult subjects (Kurokawa et al, 2007), and found several interesting pathways. For example, five pathways in fatty acid biosynthesis are enriched in obese subjects, which confirm the results of a previous study that obese subjects have an increased capacity for dietary energy harvest. In addition, four and three homocysteine pathways are enriched in obese and infant subjects (Figure 2), indicating that they are highly correlated with the homocysteine levels in blood serum.

Conclusions

We have developed statistical methods to find differentially abundant metabolic pathways in metagenomics. The performance is better than previous approaches. Results from real metagenomic datasets confirm previous observations and also provide several new biological insights.

References

Turnbaugh PJ, Ley RE, Mahowald MA, Magrini V, Mardis ER, Gordon JI: An obesity-associated gut microbiome with increased capacity for energy harvest. Nature. 2006, 444: 1027–1031. 10.1038/nature05414.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center for Bioinformatics and Computational Biology, UMIACS, Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
Bo Liu & Mihai Pop

Authors

Bo Liu
View author publications
You can also search for this author in PubMed Google Scholar
Mihai Pop
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, B., Pop, M. Statistical methods for comparing the abundances of metabolic pathways in metagenomics. Genome Biol 11 (Suppl 1), O7 (2010). https://doi.org/10.1186/gb-2010-11-s1-o7

Download citation

Published: 11 October 2010
DOI: https://doi.org/10.1186/gb-2010-11-s1-o7

Beyond the Genome: The true gene count, human evolution and disease genomics

Statistical methods for comparing the abundances of metabolic pathways in metagenomics

Background

Methods

Results

Conclusions

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Genome Biology

Contact us

Beyond the Genome: The true gene count, human evolution and disease genomics

Statistical methods for comparing the abundances of metabolic pathways in metagenomics

Background

Methods

Results

Conclusions

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Genome Biology

Contact us